TinyLlama 1.1B Chat V1.0 Marlin By neuralmagic: Benchmarks, Features and Detailed Analysis. Insights on TinyLlama 1.1B Chat V1.0 Marlin.

Arxiv:2210.17323 4-bit Autotrain compatible Base model:quantized:tinyllama... Base model:tinyllama/tinyllama... Conversational Endpoints compatible Gptq Int4 Llama Marlin Nm-vllm Quantized Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin

TinyLlama 1.1B Chat V1.0 Marlin Benchmarks

LLME Score: 0.16564

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

TinyLlama 1.1B Chat V1.0 Marlin (neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin)

🌟 Advertise your project 🚀

TinyLlama 1.1B Chat V1.0 Marlin Parameters and Internals

Model Type

llama

Additional Notes

Model files are optimized using nm-vllm, a high-throughput serving engine for compressed LLMs, employing Marlin format for efficient 4-bit inference.

Input Output

Input Format:

Messages formatted as user roles and content

Output Format:

Generated text based on the input prompt

Performance Tips:

Use nm-vllm for optimized performance

LLM Name	TinyLlama 1.1B Chat V1.0 Marlin
Repository 🤗	https://huggingface.co/neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin
Base Model(s)	TinyLlama/TinyLlama-1.1B-Chat-v1.0 TinyLlama/TinyLlama-1.1B-Chat-v1.0
Model Size	1.1b
Required VRAM	0.8 GB
Updated	2025-04-08
Maintainer	neuralmagic
Model Type	llama
Model Files	0.8 GB
GPTQ Quantization	Yes
Quantization Type	gptq
Model Architecture	LlamaForCausalLM
Context Length	2048
Model Max Length	2048
Transformers Version	4.38.1
Tokenizer Class	LlamaTokenizer
Padding Token	</s>
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to TinyLlama 1.1B Chat V1.0 Marlin

Best Alternatives	Context / RAM	Downloads	Likes
Medicine Chat GPTQ	4K / 3.9 GB	9	6
Finance Chat GPTQ	4K / 3.9 GB	17	3
Law Chat GPTQ	4K / 3.9 GB	8	4
Gorilla Openfunctions V1 GPTQ	4K / 3.9 GB	6	5
TinyLlama 1.1B Chat V1.0 GPTQ	2K / 0.8 GB	84884	14
...inyLlama 1.1B Chat V1.0 Marlin	2K / 0.8 GB	957	2
TinyLlama 1.1B Chat V0.3 GPTQ	2K / 0.8 GB	136836	9
...Llama 1.1B Chat V1.0 GPTQ 4bit	2K / 0.8 GB	9617	0
....1B Chat V1.0 GPTQ Marlin 4bit	2K / 0.8 GB	1258	0
Llama Tagger HF GPTQ 4bits	2K / 3.9 GB	42	0

Note: green Score (e.g. "73.2") means that the model is better than neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin.

Rank the TinyLlama 1.1B Chat V1.0 Marlin Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 51566 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241124

Support LLM Explorer

TinyLlama 1.1B Chat V1.0 Marlin by neuralmagic

» All LLMs » neuralmagic » TinyLlama 1.1B Chat V1.0 Marlin URL Share it on

TinyLlama 1.1B Chat V1.0 Marlin Benchmarks

TinyLlama 1.1B Chat V1.0 Marlin Parameters and Internals

Best Alternatives to TinyLlama 1.1B Chat V1.0 Marlin

Rank the TinyLlama 1.1B Chat V1.0 Marlin Capabilities

What open-source LLMs or SLMs are you in search of? 51566 in total.