M3 Open Lr5eminus5 Trainall by anfindsen

 ยป  All LLMs  ยป  anfindsen  ยป  M3 Open Lr5eminus5 Trainall   URL Share it on

  Arxiv:1910.09700   Autotrain compatible   Conversational   Endpoints compatible   Qwen3   Region:us   Safetensors

M3 Open Lr5eminus5 Trainall Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

M3 Open Lr5eminus5 Trainall Parameters and Internals

LLM NameM3 Open Lr5eminus5 Trainall
Repository ๐Ÿค—https://huggingface.co/anfindsen/M3_open_lr5eminus5_trainall 
Model Size596m
Required VRAM1.2 GB
Updated2025-06-09
Maintaineranfindsen
Model Typeqwen3
Model Files  1.2 GB
Model ArchitectureQwen3ForCausalLM
Context Length32768
Model Max Length32768
Transformers Version4.51.3
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace
M3 Open Lr5eminus5 Trainall (anfindsen/M3_open_lr5eminus5_trainall)

Best Alternatives to M3 Open Lr5eminus5 Trainall

Best Alternatives
Context / RAM
Downloads
Likes
Sam Reason A140K / 2.4 GB391
SFT Nochat FULL DATA32K / 1.2 GB4240
Fft Qwen32K / 0 GB1290
Sft Scp Epoch132K / 1.2 GB4050
MNLP M3 Mcqa Model32K / 1.2 GB840
Qwen3 Wiki Sciq Mmlu32K / 1.2 GB730
MNLP M3 Mcqa Model32K / 1.2 GB271
... 15000 B4 2E 512T LR1e 05 ACC432K / 2.4 GB1340
...DPO Model Smoltalk Bigger Test32K / 1.2 GB390
Full Dataset Instruction V432K / 1.2 GB170
Note: green Score (e.g. "73.2") means that the model is better than anfindsen/M3_open_lr5eminus5_trainall.

Rank the M3 Open Lr5eminus5 Trainall Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124