What are the hardware requirements for Reasoning Gym Chain Sum Qwen3 1 7B Grpo?

Reasoning Gym Chain Sum Qwen3 1 7B Grpo requires approximately 6.9 GB of VRAM and supports a context window of 40K tokens. Quantized variants may run on less VRAM; see the Quantized Models section on this page.

Who developed Reasoning Gym Chain Sum Qwen3 1 7B Grpo and how large is it?

Reasoning Gym Chain Sum Qwen3 1 7B Grpo is developed by ermiaazarkhalili, a model with 7b parameters. The model is published as open weights on Hugging Face and indexed on LLM Explorer with full benchmark history.

Where can I download or evaluate Reasoning Gym Chain Sum Qwen3 1 7B Grpo?

Reasoning Gym Chain Sum Qwen3 1 7B Grpo is hosted on Hugging Face and linked from this page. LLM Explorer also lists quantized variants and similar alternatives if available.

Reasoning Gym Chain Sum Qwen3 1 7B Grpo by ermiaazarkhalili — VRAM 6.9GB, 40K context

Name: Reasoning Gym Chain Sum Qwen3 1 7B Grpo
Author: ermiaazarkhalili

Reasoning Gym Chain Sum Qwen3 1 7B Grpo is an open-source language model by ermiaazarkhalili. Features: 7b LLM, VRAM: 6.9GB, Context: 40K, LLM Explorer Score: 0.27.

Arxiv:2402.03300 Base model:finetune:qwen/qwen3... Base model:qwen/qwen3-1.7b Conversational Endpoints compatible Generated from trainer Grpo Qwen3 Region:us Safetensors Trackio Trackio:https://huggingface.co... Trl

Model Card on HF 🤗: https://huggingface.co/ermiaazarkhalili/reasoning-gym-chain-sum-qwen3-1-7b-grpo

Reasoning Gym Chain Sum Qwen3 1 7B Grpo Benchmarks

LLME Score: 0.2688

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Reasoning Gym Chain Sum Qwen3 1 7B Grpo (ermiaazarkhalili/reasoning-gym-chain-sum-qwen3-1-7b-grpo)

🌟 Advertise your project 🚀

Reasoning Gym Chain Sum Qwen3 1 7B Grpo Parameters and Internals

LLM Name	Reasoning Gym Chain Sum Qwen3 1 7B Grpo
Repository 🤗	https://huggingface.co/ermiaazarkhalili/reasoning-gym-chain-sum-qwen3-1-7b-grpo
Model Name	reasoning-gym-chain-sum-qwen3-1-7b-grpo
Base Model(s)	Qwen/Qwen3-1.7B Qwen/Qwen3-1.7B
Model Size	7b
Required VRAM	6.9 GB
Updated	2026-06-30
Maintainer	ermiaazarkhalili
Model Type	qwen3
Model Files	6.9 GB 0.0 GB
Model Architecture	Qwen3ForCausalLM
Context Length	40960
Model Max Length	40960
Transformers Version	5.12.1
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	151936
Errors	replace

Rank the Reasoning Gym Chain Sum Qwen3 1 7B Grpo Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 54677 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Check out Ag3ntum — our secure, self-hosted AI agent for server management.

Release v20260328a

Support LLM Explorer

Reasoning Gym Chain Sum Qwen3 1 7B Grpo by ermiaazarkhalili

» All LLMs » ermiaazarkhalili » Reasoning Gym Chain Sum Qwen3 1 7B Grpo URL Share it on

Reasoning Gym Chain Sum Qwen3 1 7B Grpo Benchmarks

Reasoning Gym Chain Sum Qwen3 1 7B Grpo Parameters and Internals

Rank the Reasoning Gym Chain Sum Qwen3 1 7B Grpo Capabilities

What open-source LLMs or SLMs are you in search of? 54677 in total.