Codellama 7B Hf ReFT GSM8k by lqtrung1998

 »  All LLMs  »  lqtrung1998  »  Codellama 7B Hf ReFT GSM8k   URL Share it on

Codellama 7B Hf ReFT GSM8k is an open-source language model by lqtrung1998. Features: 7b LLM, VRAM: 13.5GB, Context: 16K, License: llama2, Code Generating, HF Score: 45.7, LLM Explorer Score: 0.16, Arc: 43.5, HellaSwag: 64.5, MMLU: 40.9, TruthfulQA: 37.3, WinoGrande: 64.3, GSM8K: 23.7.

  Arxiv:2401.08967   Codegen   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

Codellama 7B Hf ReFT GSM8k Benchmarks

Codellama 7B Hf ReFT GSM8k Parameters and Internals

Model Type 
Reasoning, Code Understanding
Use Cases 
Areas:
Commercial, Research
Applications:
Code synthesis, Understanding tasks
Primary Use Cases:
Python programming language
Limitations:
Use is limited to English, Use that violates applicable laws, Outside scenarios covered in testing
Considerations:
Developers should perform safety testing and tuning.
Additional Notes 
The models are tuned on Codellama and applicable licenses are inherited.
Supported Languages 
English (Proficient)
Training Details 
Data Sources:
GSM8k dataset
Methodology:
Reinforced Fine-Tuning after Supervised Fine-Tuning
Responsible Ai Considerations 
Mitigation Strategies:
Developers should perform safety testing and tuning tailored to their specific applications of the model.
Input Output 
Input Format:
Question followed by 'Answer reasoning:'
Accepted Modalities:
text
Output Format:
Code reasoning in Python format
LLM NameCodellama 7B Hf ReFT GSM8k
Repository 🤗https://huggingface.co/lqtrung1998/Codellama-7b-hf-ReFT-GSM8k 
Model Size7b
Required VRAM13.5 GB
Updated2026-04-27
Maintainerlqtrung1998
Model Typellama
Model Files  10.0 GB: 1-of-2   3.5 GB: 2-of-2
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length16384
Model Max Length16384
Transformers Version4.34.1
Tokenizer ClassCodeLlamaTokenizer
Padding Token<s>
Vocabulary Size32016
Torch Data Typebfloat16

Best Alternatives to Codellama 7B Hf ReFT GSM8k

Best Alternatives
Context / RAM
Downloads
Likes
Deepseek Coder 6.7B Instruct16K / 13.5 GB121189486
CodeLlama 7B Instruct Hf16K / 13.5 GB224984255
CodeLlama 7B Hf16K / 13.5 GB63145375
GetCode Slerp16K / 13.6 GB4531
Wizardllama 7B16K / 13.5 GB1110
CodeLlama 7B Hf16K / 13.5 GB6577121
MathCoder2 CodeLlama 7B16K / 13.5 GB125
CodeLlama 7B Python Hf16K / 13.5 GB2488144
GoLLIE 7B16K / 40.5 GB207030
Stack Codellama 7B Inst16K / 13.5 GB920
Note: green Score (e.g. "73.2") means that the model is better than lqtrung1998/Codellama-7b-hf-ReFT-GSM8k.

Rank the Codellama 7B Hf ReFT GSM8k Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53254 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a