Fox 1 1.6B by tensoropera

 »  All LLMs  »  tensoropera  »  Fox 1 1.6B   URL Share it on

Fox 1 1.6B is an open-source language model by tensoropera. Features: 1.6b LLM, VRAM: 3.3GB, Context: 8K, License: apache-2.0, LLM Explorer Score: 0.16.

  Arxiv:2411.05281   Conversational   Deploy:azure   En   Endpoints compatible   Llama   Model-index   Region:us   Safetensors

Fox 1 1.6B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Fox 1 1.6B Parameters and Internals

Model Type 
text-generation
Additional Notes 
Fox-1 is a base pretrained model that requires further fine-tuning for most use cases.
Training Details 
Data Sources:
text, code
Data Volume:
3 trillion tokens
Methodology:
3-stage data curriculum
Context Length:
8000
Hardware Used:
8 H100 GPUs
Model Architecture:
decoder-only transformer-based small language model (SLM)
LLM NameFox 1 1.6B
Repository 🤗https://huggingface.co/tensoropera/Fox-1-1.6B 
Model Size1.6b
Required VRAM3.3 GB
Updated2026-05-11
Maintainertensoropera
Model Typellama
Model Files  3.3 GB
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length8192
Model Max Length8192
Transformers Version4.39.3
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Fox 1 1.6B

Best Alternatives
Context / RAM
Downloads
Likes
1.5 Pints 16K V0.116K / 3.1 GB1316
Fox 1 1.6B Instruct V0.18K / 3.3 GB2214
Subnet6 0018K / 16.1 GB480
6 24K / 3.3 GB2910
6 14K / 3.3 GB430
Model 64K / 3.3 GB140
SN64K / 3.3 GB60
Mymodel4K / 3.3 GB50
Chuxin 1.6B Base4K / 3.3 GB2716
Chuxin 1.6B 1M4K / 3.3 GB111

Rank the Fox 1 1.6B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a