Fin Llama 33B Merged by bavest

 ยป  All LLMs  ยป  bavest  ยป  Fin Llama 33B Merged   URL Share it on

  Autotrain compatible Dataset:bavest/fin-llama-datas...   Endpoints compatible   Finance   Llama   Pytorch   Region:us   Sharded   Trading

Fin Llama 33B Merged Benchmarks

Fin Llama 33B Merged (bavest/fin-llama-33b-merged)
๐ŸŒŸ Advertise your project ๐Ÿš€

Fin Llama 33B Merged Parameters and Internals

Model Type 
finance, llm, llama, trading
Use Cases 
Areas:
finance
Limitations:
4-bit inference is slow, Instabilities using 'fp16' compute type, Tokenizer.bos_token_id must be 1
Additional Notes 
Acknowledgement to Meta for LLaMA models, builds on several repos including Stanford Alpaca, QLoRA, Chinese-Guanaco, and LMSYS FastChat
Training Details 
Data Sources:
bavest/fin-llama-dataset
Methodology:
Based on QLoRA finetuning with 4-bit quantization
LLM NameFin Llama 33B Merged
Repository ๐Ÿค—https://huggingface.co/bavest/fin-llama-33b-merged 
Model Size33b
Required VRAM65.2 GB
Updated2025-08-20
Maintainerbavest
Model Typellama
Model Files  9.8 GB: 1-of-7   10.0 GB: 2-of-7   9.9 GB: 3-of-7   9.9 GB: 4-of-7   9.9 GB: 5-of-7   10.0 GB: 6-of-7   5.7 GB: 7-of-7
Model ArchitectureLlamaForCausalLM
Licensegpl
Context Length2048
Model Max Length2048
Transformers Version4.29.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the Fin Llama 33B Merged

Model
Likes
Downloads
VRAM
Fin Llama 33B GGUF223313 GB
Fin Llama 33B AWQ51317 GB
Fin Llama 33B GPTQ72216 GB

Best Alternatives to Fin Llama 33B Merged

Best Alternatives
Context / RAM
Downloads
Likes
...angled Llama 33M 32K Base V0.132K / 0.1 GB221
ReflectionCoder DS 33B16K / 67 GB88994
Deepseek Wizard 33B Slerp16K / 35.3 GB70
ValidateAI 33B Slerp16K / 35.4 GB50
Deepseek Coder 33B Instruct16K / 66.5 GB15258537
Chronos Divergence 33B16K / 65 GB530
WhiteRabbitNeo 33B V116K / 67 GB156587
ValidateAI 3 33B Ties16K / 66.5 GB60
ValidateAI 2 33B AT16K / 66.5 GB50
...dy Deepseekcoder 33B V16.1 32K16K / 67.1 GB17770
Note: green Score (e.g. "73.2") means that the model is better than bavest/fin-llama-33b-merged.

Rank the Fin Llama 33B Merged Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50767 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124