| LLM Name | Nllb 3.3B Salt Lr2e 4 Fp16 |
| Repository ๐ค | https://huggingface.co/jq/nllb-3.3b-salt-lr2e-4-fp16 |
| Model Size | 3.3b |
| Required VRAM | 13.4 GB |
| Updated | 2025-09-23 |
| Maintainer | jq |
| Model Type | m2m_100 |
| Model Files | |
| Quantization Type | fp16 |
| Model Architecture | TrainableM2MForConditionalGeneration |
| Context Length | 1024 |
| Model Max Length | 1024 |
| Transformers Version | 4.50.3 |
| Tokenizer Class | NllbTokenizer |
| Padding Token | <pad> |
| Vocabulary Size | 256206 |
| Torch Data Type | float32 |
| Activation Function | relu |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐