Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
mychen76
/
qwen2.5-3b-think-r1
:latest
220
Downloads
Updated
6 months ago
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
Cancel
tools
qwen2.5-3b-think-r1:latest
...
/
params
41604d919ec8 · 32B
{
"min_p": 0.1,
"temperature": 1.5
}