Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
mychen76
/
qwen2.5-3b-think-r1
:latest
220
Downloads
Updated
6 months ago
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
Cancel
tools
qwen2.5-3b-think-r1:latest
...
/
system
886ec394c1a4 · 84B
Respond in the following format:
<thinking>
...
</thinking>
<answer>
...
</answer>