Ov Opt 350M 8bit Kv Cache is an open-source language model by vuiseng9. Features: 350m LLM, VRAM: 0.4GB, Context: 2K, License: other, Quantized, LLM Explorer Score: 0.08.
Ov Opt 350M 8bit Kv Cache Parameters and Internals
Model Type
text-generation
Use Cases
Considerations:
More information needed
Additional Notes
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 8
- eval_batch_size: 1
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- training_steps: 1
### Training results
### Framework versions
- Transformers 4.30.2
- Pytorch 2.0.1+cu117
- Datasets 2.13.1
- Tokenizers 0.13.3
Note: green Score (e.g. "73.2") means that the model is better than vuiseng9/ov-opt-350m-8bit-kv-cache.
Rank the Ov Opt 350M 8bit Kv Cache Capabilities
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation
What open-source LLMs or SLMs are you in search of? 52473 in total.