MiniLLM 0.2B NoWudao by Tongjilibo

 Β»  All LLMs  Β»  Tongjilibo  Β»  MiniLLM 0.2B NoWudao   URL Share it on

  Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us

MiniLLM 0.2B NoWudao Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
MiniLLM 0.2B NoWudao (Tongjilibo/MiniLLM-0.2B-NoWudao)
🌟 Advertise your project πŸš€

MiniLLM 0.2B NoWudao Parameters and Internals

Model Type 
Text Generation, Chatbot
Use Cases 
Areas:
Research, Shot-term chatbot functionalities
Primary Use Cases:
Simple chat or conversation scenarios
Limitations:
Limited to simple chat tasks, Cannot handle complex queries
Considerations:
The model's chat capabilities are limited due to size constraints and available training data.
Additional Notes 
The model is aimed at demonstrating feasible LLM deployment with budget-friendly resources
Supported Languages 
zh (δΈ­ζ–‡)
Training Details 
Data Sources:
WikiδΈ­ζ–‡η™Ύη§‘, BaiduBaiKe, C4_zh, WuDaoCorpora, shibing624/medical
Data Volume:
634 Billion Tokens
Methodology:
Pre-training followed by instruction fine-tuning
Training Time:
NoWudao: 20h, WithWudao: 3.79d
Hardware Used:
4Γ—A800 (80G)
Model Architecture:
bert4torch framework used for training
Input Output 
Input Format:
Pre-tokenized input suitable for Llama models
Accepted Modalities:
Text
Output Format:
Textual responses
Performance Tips:
Recommended to follow the installation and usage guidelines for optimal performance.
Release Notes 
Version:
20240316
Notes:
Initial model release including pre-trained versions with and without Wudao datasets.
LLM NameMiniLLM 0.2B NoWudao
Repository πŸ€—https://huggingface.co/Tongjilibo/MiniLLM-0.2B-NoWudao 
Model Size0.2b
Required VRAM0.9 GB
Updated2025-03-22
MaintainerTongjilibo
Model Typellama
Model Files  0.9 GB   0.9 GB
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length1024
Model Max Length1024
Tokenizer ClassChatGLMTokenizer
Vocabulary Size64793

Best Alternatives to MiniLLM 0.2B NoWudao

Best Alternatives
Context / RAM
Downloads
Likes
Informer 0.2B 4K4K / 0.5 GB64
MiniLLM 0.2B WithWudao SFT1K / 0.9 GB81
MiniLLM 0.2B SFT1K / 0.9 GB81
MiniLLM 0.2B WithWudao1K / 0.9 GB1272
MiniLLM 0.2B Base1K / 0.9 GB112
MiniLLM 0.2B NoWudao Base1K / 0.9 GB81
Note: green Score (e.g. "73.2") means that the model is better than Tongjilibo/MiniLLM-0.2B-NoWudao.

Rank the MiniLLM 0.2B NoWudao Capabilities

πŸ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51534 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124