Starcoderbase Triviaqa by vwxyzjn

 ยป  All LLMs  ยป  vwxyzjn  ยป  Starcoderbase Triviaqa   URL Share it on

  Autotrain compatible   Codegen   Dataset:trivia qa   En   Endpoints compatible   Gpt bigcode   Pytorch   Region:us   Rlhf   Safetensors   Sharded   Tensorflow   Trl

Starcoderbase Triviaqa Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Starcoderbase Triviaqa (vwxyzjn/starcoderbase-triviaqa)
๐ŸŒŸ Advertise your project ๐Ÿš€

Starcoderbase Triviaqa Parameters and Internals

Model Type 
transformers, reinforcement learning
Use Cases 
Areas:
Research, Commercial Applications
Limitations:
May generate incorrect or misleading answers, May copy answers from the training data verbatim, May generate language that is hateful or discriminatory, May generate offensive language
Considerations:
Answers should be validated through external sources.
Supported Languages 
en (English)
Training Details 
Data Sources:
TriviaQA
Methodology:
Fine-tuned using reinforcement learning via TRL's `TextEnvironment`.
Responsible Ai Considerations 
Mitigation Strategies:
Disparities between the data contributors and users should inform developers in assessing appropriate use cases. Further research is needed to attribute model generations to sources in the training data.
LLM NameStarcoderbase Triviaqa
Repository ๐Ÿค—https://huggingface.co/vwxyzjn/starcoderbase-triviaqa 
Model Size15.5b
Required VRAM62.1 GB
Updated2025-08-22
Maintainervwxyzjn
Model Typegpt_bigcode
Model Files  9.9 GB: 1-of-7   9.9 GB: 2-of-7   9.8 GB: 3-of-7   9.9 GB: 4-of-7   9.8 GB: 5-of-7   9.9 GB: 6-of-7   2.9 GB: 7-of-7
Supported Languagesen
Generates CodeYes
Model ArchitectureGPTBigCodeForCausalLM
Licensebigscience-openrail-m
Transformers Version4.30.2
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Torch Data Typefloat32
Activation Functiongelu

Best Alternatives to Starcoderbase Triviaqa

Best Alternatives
Context / RAM
Downloads
Likes
Starchat Alpha0K / 31.2 GB1816232
Starchat Beta0K / 31.2 GB1933263
...der15B Personal Copilot Merged0K / 30.9 GB111
Octocoder0K / 62.1 GB10468
...der15B Personal Copilot Merged0K / 31.2 GB263
Note: green Score (e.g. "73.2") means that the model is better than vwxyzjn/starcoderbase-triviaqa.

Rank the Starcoderbase Triviaqa Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 50835 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124