Open Llama 3B V2 8K GPTQ by openerotica

 ยป  All LLMs  ยป  openerotica  ยป  Open Llama 3B V2 8K GPTQ   URL Share it on

  Arxiv:2302.13971   4bit   Autotrain compatible   Dataset:bigcode/starcoderdata Dataset:tiiuae/falcon-refinedw... Dataset:togethercomputer/redpa...   Endpoints compatible   Ext 8k   Gptq   Llama   Quantized   Region:us

Open Llama 3b V2 8K GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Open Llama 3B V2 8K GPTQ (openerotica/open_llama_3b_v2-8k-GPTQ)
๐ŸŒŸ Advertise your project ๐Ÿš€

Open Llama 3B V2 8K GPTQ Parameters and Internals

Model Type 
large language model
Additional Notes 
Please note that it is advised to avoid using the Hugging Face fast tokenizer for now, as it sometimes gives incorrect tokenizations. This can be avoided by using 'use_fast=False'.
Training Details 
Data Sources:
tiiuae/falcon-refinedweb, bigcode/starcoderdata, togethercomputer/RedPajama-Data-1T
Data Volume:
1 trillion tokens
Methodology:
Pre-trained with open datasets rather than the original LLaMA dataset, using the EasyLM framework.
Hardware Used:
cloud TPU-v4s
LLM NameOpen Llama 3b V2 8K GPTQ
Repository ๐Ÿค—https://huggingface.co/openerotica/open_llama_3b_v2-8k-GPTQ 
Model Size3b
Required VRAM2 GB
Updated2025-08-22
Maintaineropenerotica
Model Typellama
Model Files  2.0 GB
GPTQ QuantizationYes
Context Length8k
Quantization Typegptq|4bit
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length8192
Model Max Length8192
Transformers Version4.31.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Open Llama 3B V2 8K GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
...ma 3.2 3B Instruct XMADai INT4128K / 3 GB346
Orca2myth7.2 GPTQ4K / 10.9 GB64
...B GPTQ 4bit GPTQ Code Instruct2K / 2 GB50
...lInstruct Lora Merged 4bit 32g2K / 2.3 GB132
...zard Evol Instuct V2 196K GPTQ2K / 2.1 GB322
...ama 3B Mathwizard Quantized V22K / 3.7 GB60
SauerkrautLM 3B V1 GPTQ2K / 2.3 GB92
Marx 3B GPTQ2K / 2.1 GB73
Griffin 3B GPTQ2K / 2.1 GB92
Puma 3B GPTQ2K / 2.1 GB101
Note: green Score (e.g. "73.2") means that the model is better than openerotica/open_llama_3b_v2-8k-GPTQ.

Rank the Open Llama 3B V2 8K GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50835 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124