qwen3-coder:30b-a3b-q8_0

319.4K 2 weeks ago

Alibaba's performant long context models for agentic and coding tasks.

30b 480b

1 month ago

bdecf39e3ee4 · 32GB

qwen3moe
·
30.5B
·
Q8_0
{{- if .System }}<|im_start|>system {{ .System }}<|im_end|> {{ end }} {{- range $i, $_ := .Messages
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
{ "repeat_penalty": 1.05, "stop": [ "<|im_start|>", "<|im_end|>", "<

Readme

Qwen 3 logo Qwen3-Coder is the most agentic code model to date in the Qwen series.

Overview

qwen3-coder:30b offers 30B total parameters with only 3.3B activated, delivering strong performance while maintaining efficiency.

  • Exceptional agentic capabilities for real-world software engineering tasks through advanced long-horizon reinforcement learning on SWE-Bench and similar benchmarks.
  • Long context support with 256K tokens natively and up to 1M tokens using extrapolation methods, optimized for repository-scale understanding.
  • Scaled pretraining on 7.5T tokens (70% code ratio) while preserving strong general and mathematical abilities.
  • Execution-driven reinforcement learning that significantly boosts code execution success rates across diverse real-world coding tasks.

image.png

Reference