LLM Name | Tinyllama 1B AWQ Gemv |
Repository ๐ค | https://huggingface.co/casperhansen/tinyllama-1b-awq-gemv |
Model Size | 1b |
Required VRAM | 0.8 GB |
Updated | 2025-08-17 |
Maintainer | casperhansen |
Model Type | llama |
Model Files | |
AWQ Quantization | Yes |
Quantization Type | awq |
Model Architecture | LlamaForCausalLM |
License | apache-2.0 |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.33.2 |
Tokenizer Class | LlamaTokenizer |
Beginning of Sentence Token | <s> |
End of Sentence Token | </s> |
Unk Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Tinyllama 1B AWQ | 2K / 0.8 GB | 3983 | 0 |
Tinyllama 2 1B Miniguanaco AWQ | 2K / 0.8 GB | 11 | 3 |
...2 1B Instruct Unsloth Bnb 4bit | 128K / 1.1 GB | 44353 | 3 |
Llama 3.2 1B Unsloth Bnb 4bit | 128K / 1.1 GB | 12954 | 2 |
...truct Gptqmodel 4bit Vortex V1 | 128K / 1.6 GB | 1054 | 2 |
Llama 3.2 1B Instruct 4bit | 128K / 0.7 GB | 24720 | 15 |
Llama 3.2 1B Instruct Bnb 4bit | 128K / 1 GB | 19920 | 17 |
Llama 3.2 1B Instruct Chat Sft | 128K / 2.5 GB | 33 | 0 |
Llama 3.2 1B Bnb 4bit | 128K / 1 GB | 14294 | 15 |
Orca Mini V9 5 1B Instruct | 128K / 2.5 GB | 6 | 4 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐