Llama 31 Hhrlhf Squad Rlhf Policy Model is an open-source language model by almorin. Features: 1b LLM, VRAM: 2.5GB, Context: 128K, Instruction-Based, LLM Explorer Score: 0.19.
| LLM Name | Llama 31 Hhrlhf Squad Rlhf Policy Model |
| Repository 🤗 | https://huggingface.co/almorin/llama-31-hhrlhf-squad-rlhf-policy-model |
| Model Size | 1b |
| Required VRAM | 2.5 GB |
| Updated | 2025-09-23 |
| Maintainer | almorin |
| Model Type | llama |
| Instruction-Based | Yes |
| Model Files | |
| Model Architecture | LlamaForCausalLM |
| Context Length | 131072 |
| Model Max Length | 131072 |
| Transformers Version | 4.44.0 |
| Tokenizer Class | PreTrainedTokenizerFast |
| Padding Token | <|eot_id|> |
| Vocabulary Size | 128256 |
| Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
|---|---|---|---|
| Llama 3.2 1B Instruct | 128K / 2.5 GB | 6336819 | 1393 |
| ...2 1B Instruct Only Mask W Item | 128K / 3 GB | 1425 | 0 |
| ...Instruct Only Mask W Item Mesh | 128K / 3 GB | 377 | 0 |
| Llama 3.2 1B Instruct Bf16 | 128K / 2.5 GB | 1158 | 5 |
| Llama3.2 1B FantasySciFi | 128K / 2.5 GB | 304 | 0 |
| Shield Llama 3.2 1B Full FT CE | 128K / 2.5 GB | 795 | 0 |
| Llama 3.2 1B Instruct | 128K / 2.5 GB | 13 | 0 |
| Llama 3.2 1B Instruct | 128K / 2.5 GB | 157691 | 92 |
| ....2 1B Sarcasm Rewriter Context | 128K / 2.5 GB | 252 | 0 |
| Llama 3.2 OctoThinker INano 1B | 128K / 3 GB | 242 | 1 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟