Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
mychen76
/
qwen2.5-3b-think-r1
219
Downloads
Updated
6 months ago
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.
Cancel
tools
Name
2 models
Size
Context
Input
qwen2.5-3b-think-r1:latest
0937053a5fde
• 3.3GB • 32K context window •
Text input • 6 months ago
Text input • 6 months ago
qwen2.5-3b-think-r1:latest
3.3GB
32K
Text
0937053a5fde
· 6 months ago
qwen2.5-3b-think-r1:q8
latest
0937053a5fde
• 3.3GB • 32K context window •
Text input • 6 months ago
Text input • 6 months ago
qwen2.5-3b-think-r1:q8
latest
3.3GB
32K
Text
0937053a5fde
· 6 months ago