Saiga2 13B Lora by IlyaGusev

 ยป  All LLMs  ยป  IlyaGusev  ยป  Saiga2 13B Lora   URL Share it on

  Adapter   Conversational Dataset:ilyagusev/oasst1 ru ma... Dataset:ilyagusev/ru sharegpt ... Dataset:ilyagusev/ru turbo alp... Dataset:ilyagusev/ru turbo alp... Dataset:ilyagusev/ru turbo sai...   Dataset:lksy/ru instruct gpt4   Finetuned   Instruct   Lora   Region:us   Ru

Saiga2 13b Lora Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Saiga2 13B Lora (IlyaGusev/saiga2_13b_lora)
๐ŸŒŸ Advertise your project ๐Ÿš€

Saiga2 13B Lora Parameters and Internals

Model Type 
conversational
Additional Notes 
Based on LLaMA-2 13B HF with an adapter-only version for Russian language.
Supported Languages 
ru (Russian)
Training Details 
Data Sources:
IlyaGusev/ru_turbo_alpaca, IlyaGusev/ru_turbo_saiga, IlyaGusev/ru_sharegpt_cleaned, IlyaGusev/oasst1_ru_main_branch, IlyaGusev/ru_turbo_alpaca_evol_instruct, lksy/ru_instruct_gpt4
Methodology:
Adapter-only version trained using the development version of `transformers` and `peft`.
Model Architecture:
LLaMA-2 based chatbot
Input Output 
Input Format:
~~{role} {content}~~
Accepted Modalities:
text
Output Format:
text
Release Notes 
Version:
v1
Notes:
Trained on multiple datasets including ru_turbo_alpaca and ru_turbo_saiga.
LLM NameSaiga2 13b Lora
Repository ๐Ÿค—https://huggingface.co/IlyaGusev/saiga2_13b_lora 
Model Size13b
Required VRAM0.1 GB
Updated2025-09-21
MaintainerIlyaGusev
Instruction-BasedYes
Model Files  0.1 GB
Supported Languagesru
Model ArchitectureAdapter
Licensecc-by-4.0
Model Max Length2048
Is Biasednone
Tokenizer ClassLlamaTokenizer
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesq_proj|v_proj|k_proj|o_proj
LoRA Alpha16
LoRA Dropout0.05
R Param16

Best Alternatives to Saiga2 13B Lora

Best Alternatives
Context / RAM
Downloads
Likes
...10 2024 06 23 06 24 24 35586350K / 0.8 GB50
...10 2024 06 22 21 11 23 35586240K / 0.8 GB50
Typescript Instruct 20K V40K / 26 GB52
Typescript Instruct 20K V20K / 26 GB62
Llama 2 13B Instruct V0.20K / 0.2 GB1010
... 13B Instruct Lora Jaster V1.00K / 0.1 GB52
...t Lora Jaster Dolly Oasst V1.00K / 0.1 GB81
...Instruct Lora Dolly Oasst V1.00K / 0.1 GB41
RuGPT 3.5 13B Lora0K / 0.1 GB712
...lama 13B Instruction Finetune20K / 0 GB61
Note: green Score (e.g. "73.2") means that the model is better than IlyaGusev/saiga2_13b_lora.

Rank the Saiga2 13B Lora Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51507 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124