Code 33B by ajibawa-2023

 ยป  All LLMs  ยป  ajibawa-2023  ยป  Code 33B   URL Share it on

  Autotrain compatible   Code Dataset:ajibawa-2023/code-74k-...   En   Endpoints compatible   Llama   Pytorch   Region:us   Sharded
Model Card on HF ๐Ÿค—: https://huggingface.co/ajibawa-2023/Code-33B 

Code 33B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Code 33B (ajibawa-2023/Code-33B)
๐ŸŒŸ Advertise your project ๐Ÿš€

Code 33B Parameters and Internals

Model Type 
code generation
Use Cases 
Areas:
research, code assistance
Applications:
code generation, programming tutorials
Additional Notes 
Special thanks to the Open Source community and TheBloke for quantized models.
Supported Languages 
en (High proficiency)
Training Details 
Data Sources:
Code-74k-ShareGPT, Python-Code-23k-ShareGPT
Data Volume:
74000 sets of codes, each having 2 conversations
Methodology:
Full fine-tuning with detailed code explanations
Training Time:
6 days & 5 hours for 3 epochs
Hardware Used:
4 x A100 80GB GPUs
Input Output 
Input Format:
Vicuna/ShareGPT format v1.1
LLM NameCode 33B
Repository ๐Ÿค—https://huggingface.co/ajibawa-2023/Code-33B 
Model Size33b
Required VRAM65.3 GB
Updated2025-08-20
Maintainerajibawa-2023
Model Typellama
Model Files  8.5 GB: 1-of-8   8.6 GB: 2-of-8   8.6 GB: 3-of-8   8.6 GB: 4-of-8   8.6 GB: 5-of-8   8.6 GB: 6-of-8   8.6 GB: 7-of-8   5.2 GB: 8-of-8
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensecc-by-nc-nd-4.0
Context Length2048
Model Max Length2048
Transformers Version4.28.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Quantized Models of the Code 33B

Model
Likes
Downloads
VRAM
Code 33B GGUF519913 GB
Code 33B AWQ3617 GB
Code 33B GPTQ2616 GB

Best Alternatives to Code 33B

Best Alternatives
Context / RAM
Downloads
Likes
...angled Llama 33M 32K Base V0.132K / 0.1 GB221
ReflectionCoder DS 33B16K / 67 GB88994
Deepseek Wizard 33B Slerp16K / 35.3 GB70
ValidateAI 33B Slerp16K / 35.4 GB50
Deepseek Coder 33B Instruct16K / 66.5 GB15258537
Chronos Divergence 33B16K / 65 GB530
WhiteRabbitNeo 33B V116K / 67 GB156587
ValidateAI 3 33B Ties16K / 66.5 GB60
ValidateAI 2 33B AT16K / 66.5 GB50
...dy Deepseekcoder 33B V16.1 32K16K / 67.1 GB17770
Note: green Score (e.g. "73.2") means that the model is better than ajibawa-2023/Code-33B.

Rank the Code 33B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50767 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124