Anima 7B 100K by lyogavin

 ยป  All LLMs  ยป  lyogavin  ยป  Anima 7B 100K   URL Share it on

  100k   7b   Autotrain compatible   Custom code   En   Endpoints compatible   Llama   Llama2   Pytorch   Region:us   Sharded
Model Card on HF ๐Ÿค—: https://huggingface.co/lyogavin/Anima-7B-100K 

Anima 7B 100K Benchmarks

Anima 7B 100K (lyogavin/Anima-7B-100K)
๐ŸŒŸ Advertise your project ๐Ÿš€

Anima 7B 100K Parameters and Internals

Model Type 
Causal Language Model
Use Cases 
Areas:
Research, Commercial applications
Applications:
Long context input processing, QA systems for large token inputs
Primary Use Cases:
Handling long input tokens efficiently due to memory optimizations
Limitations:
Limited information on performance benchmarks for 100k tokens
Additional Notes 
The model uniquely supports a 100k token input length, allowing for extensive input handling, contrasting typical language models.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
Long QA datasets
Data Volume:
30k to 100k token length datasets
Methodology:
Training on curated long QA datasets, and making memory optimizations for 100k tokens
Context Length:
100000
Model Architecture:
Extension and optimization based on Llama2 7B
Input Output 
Input Format:
Long-form prompt inputs with potentially 100K token length
Accepted Modalities:
text
Output Format:
Textual generated responses
Performance Tips:
Avoid using high memory accelerators if OOM is encountered, adjust cache usage settings
LLM NameAnima 7B 100K
Repository ๐Ÿค—https://huggingface.co/lyogavin/Anima-7B-100K 
Model Size7b
Required VRAM13.5 GB
Updated2025-09-23
Maintainerlyogavin
Model Typellama
Model Files  10.0 GB: 1-of-2   3.5 GB: 2-of-2
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.31.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Anima 7B 100K

Best Alternatives
Context / RAM
Downloads
Likes
A6 L1024K / 16.1 GB2010
A3.41024K / 16.1 GB130
A5.41024K / 16.1 GB120
A2.41024K / 16.1 GB120
M1024K / 16.1 GB1270
1571024K / 16.1 GB1010
1241024K / 16.1 GB930
1621024K / 16.1 GB600
2 Very Sci Fi1024K / 16.1 GB3170
1181024K / 16.1 GB150
Note: green Score (e.g. "73.2") means that the model is better than lyogavin/Anima-7B-100K.

Rank the Anima 7B 100K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 51539 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124