Meditron 7B by epfl-llm

 ยป  All LLMs  ยป  epfl-llm  ยป  Meditron 7B   URL Share it on

  Arxiv:2311.16079   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-2-...   Dataset:epfl-llm/guidelines   En   Endpoints compatible   Llama   Pytorch   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/epfl-llm/meditron-7b 

Meditron 7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

Meditron 7B Parameters and Internals

Model Type 
Causal decoder-only transformer language model
Use Cases 
Areas:
Medical exam question answering, Supporting differential diagnosis, Disease information query, General health information query
Limitations:
Not recommended for production use, Unsuitable for professional purposes related to health and medicine
Considerations:
Use in production environments requires rigorous evaluation and alignment processes.
Additional Notes 
Significant research is still required to fully explore potential bias, fairness, and safety issues.
Supported Languages 
English (mainly)
Training Details 
Data Sources:
Clinical Guidelines, Medical Paper Abstracts, Medical Papers, Replay Data
Data Volume:
48.1B tokens
Methodology:
Continued pretraining
Context Length:
2048
Training Time:
September 2023
Hardware Used:
1 node of 8x NVIDIA A100 (80GB) SXM GPUs
Model Architecture:
Llama 2, Hidden dimension: 4096, Number of attention heads: 32, Number of layers: 32
Input Output 
Input Format:
Text-only data
Accepted Modalities:
Text
Output Format:
Text
LLM NameMeditron 7B
Repository ๐Ÿค—https://huggingface.co/epfl-llm/meditron-7b 
Base Model(s)  Llama 2 7B   meta-llama/Llama-2-7b
Model Size7b
Required VRAM13.4 GB
Updated2025-06-09
Maintainerepfl-llm
Model Typellama
Model Files  1.9 GB: 1-of-8   1.9 GB: 2-of-8   1.8 GB: 3-of-8   1.9 GB: 4-of-8   1.9 GB: 5-of-8   1.8 GB: 6-of-8   1.9 GB: 7-of-8   0.3 GB: 8-of-8   1.9 GB: 1-of-8   1.9 GB: 2-of-8   1.8 GB: 3-of-8   1.9 GB: 4-of-8   1.9 GB: 5-of-8   1.8 GB: 6-of-8   1.9 GB: 7-of-8   0.3 GB: 8-of-8
Supported Languagesen
Gated ModelYes
Model ArchitectureLlamaForCausalLM
Licenseproprietary
Context Length2048
Model Max Length2048
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token<PAD>
Vocabulary Size32017
Torch Data Typebfloat16
Meditron 7B (epfl-llm/meditron-7b)

Quantized Models of the Meditron 7B

Model
Likes
Downloads
VRAM
Meditron 7B AWQ3299193 GB
...editron 7B Lora Finetuned 4bit0223 GB
Meditron 7B GGUF239602 GB
Meditron 7B GPTQ3523 GB

Best Alternatives to Meditron 7B

Best Alternatives
Context / RAM
Downloads
Likes
A6 L1024K / 16.1 GB2010
M1024K / 16.1 GB1270
1571024K / 16.1 GB1010
A3.41024K / 16.1 GB130
1241024K / 16.1 GB930
A5.41024K / 16.1 GB120
A2.41024K / 16.1 GB120
2 Very Sci Fi1024K / 16.1 GB3170
1621024K / 16.1 GB600
1181024K / 16.1 GB150
Note: green Score (e.g. "73.2") means that the model is better than epfl-llm/meditron-7b.

Rank the Meditron 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124