Ct2fast Pythia 12B Sft V8 7K Steps by michaelfeil

 ยป  All LLMs  ยป  michaelfeil  ยป  Ct2fast Pythia 12B Sft V8 7K Steps   URL Share it on

  Ctranslate2   En   Endpoints compatible   Float16 - sft   Int8   Region:us

Ct2fast Pythia 12B Sft V8 7K Steps Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

Ct2fast Pythia 12B Sft V8 7K Steps Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
research, commercial applications
Additional Notes 
Fast inference with Ctranslate2 using int8 and float16 quantization.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
oasst_export, vicuna, dolly15k, grade_school_math_instructions, code_alpaca, red_pajama, wizardlm_70k, poem_instructions
Data Volume:
multiple datasets with varying val_split and fraction parameters
Methodology:
supervised fine-tuning
Context Length:
2048
Model Architecture:
Derived from OpenAssistant/pythia-12b-pre-v8-12.5k-steps
Input Output 
Accepted Modalities:
text
Performance Tips:
Use 'device="cuda", compute_type="int8_float16"' for optimal CUDA performance.
LLM NameCt2fast Pythia 12B Sft V8 7K Steps
Repository ๐Ÿค—https://huggingface.co/michaelfeil/ct2fast-pythia-12b-sft-v8-7k-steps 
Model Size12b
Required VRAM23.7 GB
Updated2025-06-09
Maintainermichaelfeil
Model Files  23.7 GB
Supported Languagesen
Model ArchitectureAutoModel
Licenseapache-2.0
Tokenizer ClassGPTNeoXTokenizer
Ct2fast Pythia 12B Sft V8 7K Steps (michaelfeil/ct2fast-pythia-12b-sft-v8-7k-steps)

Best Alternatives to Ct2fast Pythia 12B Sft V8 7K Steps

Best Alternatives
Context / RAM
Downloads
Likes
...tral Nemo 12B Abliterated LORA0K / 0.5 GB03
... 12b Reasoning Psychology Lora0K / 0.2 GB02
Mistral FreeLiPPA LoRA 12B0K / 1.8 GB61
Ct2fast M2m100 12B Last Ckpt0K / 23.6 GB157
Ct2fast Dolly V2 12B0K / 11.9 GB143
Llama3 12B Wwe GGUF0K / 5.3 GB320
Calme 12B Instruct V0.1 GGUF0K / 4.7 GB1282
Merlyn Education Safety GGUF0K / 4.9 GB861
Dolly V2 GGML0K / 1.6 GB232
Note: green Score (e.g. "73.2") means that the model is better than michaelfeil/ct2fast-pythia-12b-sft-v8-7k-steps.

Rank the Ct2fast Pythia 12B Sft V8 7K Steps Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 48046 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124