Tags · mychen76/qwen2.5-3b-think-r1

mychen76/ qwen2.5-3b-think-r1

268 Downloads Updated 1 year ago

A regular model convert into Reasoning/Think Model fine-tuned using DeepSeek GRPO algorithm without using distilled data from R1.

tools

Name

2 models

Size / Usage

Context

Input

qwen2.5-3b-think-r1:latest

0937053a5fde • 3.3GB • 32K context window • Text input • 1 year ago

Text input • 1 year ago

qwen2.5-3b-think-r1:latest

3.3GB

32K

Text

0937053a5fde · 1 year ago

qwen2.5-3b-think-r1:q8 latest

0937053a5fde • 3.3GB • 32K context window • Text input • 1 year ago

Text input • 1 year ago

qwen2.5-3b-think-r1:q8 latest

3.3GB

32K

Text

0937053a5fde · 1 year ago