Gemma 3 1B It Reasoning Grpo Lora by codelion

 »  All LLMs  »  codelion  »  Gemma 3 1B It Reasoning Grpo Lora   URL Share it on

Gemma 3 1B It Reasoning Grpo Lora is an open-source language model by codelion. Features: 1b LLM, VRAM: 0.2GB, License: apache-2.0, LLM Explorer Score: 0.2.

  Adapter Base model:adapter:google/gemm... Base model:google/gemma-3-1b-i...   Chain-of-thought Dataset:codelion/gemma-3-1b-it...   Ellora   En   Finetuned   Gemma   Grpo   Lora   Peft   Preference-learning   Reasoning   Region:us   Safetensors   Self-improvement   Thinking

Gemma 3 1B It Reasoning Grpo Lora Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Gemma 3 1B It Reasoning Grpo Lora Parameters and Internals

LLM NameGemma 3 1B It Reasoning Grpo Lora
Repository 🤗https://huggingface.co/codelion/gemma-3-1b-it-reasoning-grpo-lora 
Base Model(s)  Gemma 3 1B It   google/gemma-3-1b-it
Model Size1b
Required VRAM0.2 GB
Updated2026-05-14
Maintainercodelion
Model Typegemma
Model Files  0.2 GB
Supported Languagesen
Model ArchitectureAdapter
Licenseapache-2.0
Is Biasednone
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesup_proj|v_proj|o_proj|gate_proj|down_proj|q_proj|k_proj
LoRA Alpha128
LoRA Dropout0.2
R Param64

Best Alternatives to Gemma 3 1B It Reasoning Grpo Lora

Best Alternatives
Context / RAM
Downloads
Likes
Openhermes 1B Olmo Sft Qlora0K / 0 GB70
Zephyr 1B Olmo Sft Qlora0K / 0 GB51
...iabolic6045 ELN AOC CAIN QLoRA0K / 0 GB50
Llama 3.2 1B Airoboros Merged0K / 0.1 GB70
...la Alpaca Orca Instruct V0.0.10K / 1.1 GB100
Peft Starcoder Lora Python0K / 0 GB140
Prem 1B Chat 32K0K / 1.4 GB91
Prem 1B 32K0K / 1.4 GB21
...ama 3 1M Context Gradient Lora0K / 3.5 GB012
Falcon Arxiv Long Summary 1B0K / 0.2 GB111

Rank the Gemma 3 1B It Reasoning Grpo Lora Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 53999 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Check out Ag3ntum — our secure, self-hosted AI agent for server management.
Release v20260328a