Codellama 7B Hf ReFT GSM8k by lqtrung1998

 »  All LLMs  »  lqtrung1998  »  Codellama 7B Hf ReFT GSM8k   URL Share it on

Codellama 7B Hf ReFT GSM8k is an open-source language model by lqtrung1998. Features: 7b LLM, VRAM: 13.5GB, Context: 16K, License: llama2, Code Generating, LLM Explorer Score: 0.15, Arc: 43.5, HellaSwag: 64.5, MMLU: 40.9, GSM8K: 23.7.

  Arxiv:2401.08967   Codegen   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

Codellama 7B Hf ReFT GSM8k Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Codellama 7B Hf ReFT GSM8k Parameters and Internals

Model Type 
Reasoning, Code Understanding
Use Cases 
Areas:
Commercial, Research
Applications:
Code synthesis, Understanding tasks
Primary Use Cases:
Python programming language
Limitations:
Use is limited to English, Use that violates applicable laws, Outside scenarios covered in testing
Considerations:
Developers should perform safety testing and tuning.
Additional Notes 
The models are tuned on Codellama and applicable licenses are inherited.
Supported Languages 
English (Proficient)
Training Details 
Data Sources:
GSM8k dataset
Methodology:
Reinforced Fine-Tuning after Supervised Fine-Tuning
Responsible Ai Considerations 
Mitigation Strategies:
Developers should perform safety testing and tuning tailored to their specific applications of the model.
Input Output 
Input Format:
Question followed by 'Answer reasoning:'
Accepted Modalities:
text
Output Format:
Code reasoning in Python format
LLM NameCodellama 7B Hf ReFT GSM8k
Repository 🤗https://huggingface.co/lqtrung1998/Codellama-7b-hf-ReFT-GSM8k 
Model Size7b
Required VRAM13.5 GB
Updated2026-06-09
Maintainerlqtrung1998
Model Typellama
Model Files  10.0 GB: 1-of-2   3.5 GB: 2-of-2
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length16384
Model Max Length16384
Transformers Version4.34.1
Tokenizer ClassCodeLlamaTokenizer
Padding Token<s>
Vocabulary Size32016
Torch Data Typebfloat16

Best Alternatives to Codellama 7B Hf ReFT GSM8k

Best Alternatives
Context / RAM
Downloads
Likes
CodeLlama 7B Hf16K / 13.5 GB340456376
Deepseek Coder 6.7B Instruct16K / 13.5 GB66787495
CodeLlama 7B Instruct Hf16K / 13.5 GB42986258
GetCode Slerp16K / 13.6 GB1881
Wizardllama 7B16K / 13.5 GB590
CodeLlama 7B Python Hf16K / 13.5 GB3028144
GoLLIE 7B16K / 40.5 GB277932
MathCoder2 CodeLlama 7B16K / 13.5 GB45
CodeLlama 7B Hf16K / 13.5 GB3937122
Stack Codellama 7B Inst16K / 13.5 GB660
Note: green Score (e.g. "73.2") means that the model is better than lqtrung1998/Codellama-7b-hf-ReFT-GSM8k.

Rank the Codellama 7B Hf ReFT GSM8k Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 54415 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a