Tools models · Ollama

nemotron3

NVIDIA Nemotron 3 Nano Omni is a multimodal large language model that unifies video, audio, image, and text understanding to support enterprise-grade Q&A, summarization, transcription, and document intelligence workflows.

vision tools thinking audio 33b

588.8K Pulls 4 Tags Updated 3 weeks ago

granite4.1

IBM Granite Models are a family of enterprise-ready, open foundation models that support multilingual capabilities, coding, retrieval-augmented generation (RAG), tool use, and structured JSON output. Released under Apache 2.0 license.

tools 3b 8b 30b

96.1K Pulls 48 Tags Updated 2 days ago

deepseek-v4-flash

DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

tools thinking cloud

82.2K Pulls 1 Tag Updated 4 weeks ago

deepseek-v4-pro

DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

tools thinking cloud

69.4K Pulls 1 Tag Updated 3 weeks ago

mistral-medium-3.5

Mistral Medium 3.5 is the first flagship model of Mistral AI that merged instruction-following, reasoning, and coding in a single set of 128B weights.

vision tools thinking 128b

24.7K Pulls 5 Tags Updated 2 weeks ago

gemma4

Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

vision tools thinking audio cloud e2b e4b 26b 31b

9.8M Pulls 34 Tags Updated 22 hours ago

qwen3.5

Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

12.2M Pulls 64 Tags Updated 22 hours ago

glm-5.1

GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

tools thinking cloud

2.1M Pulls 1 Tag Updated 1 month ago

qwen3.6

Qwen3.6 delivers substantial upgrades in agentic coding and thinking preservation than previous Qwen models.

vision tools thinking 27b 35b

1.5M Pulls 24 Tags Updated 22 hours ago

minimax-m2.7

MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

tools thinking cloud

2.1M Pulls 1 Tag Updated 2 months ago

nemotron-3-super

NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

tools thinking cloud 120b

2.3M Pulls 7 Tags Updated 2 months ago

glm-5

A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

tools thinking cloud

2.2M Pulls 1 Tag Updated 3 months ago

minimax-m2.5

MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

tools thinking cloud

2.1M Pulls 1 Tag Updated 3 months ago

lfm2

LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.

tools 24b

1.1M Pulls 6 Tags Updated 2 months ago

qwen3-coder-next

Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

tools cloud

1.3M Pulls 4 Tags Updated 3 months ago

glm-4.7

Advancing the Coding Capability

tools thinking cloud

2.1M Pulls 1 Tag Updated 4 months ago

kimi-k2.6

Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.

vision tools thinking cloud

257.5K Pulls 1 Tag Updated 1 month ago

gemini-3-flash-preview

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

vision tools thinking cloud

2.1M Pulls 2 Tags Updated 5 months ago

minimax-m2.1

Exceptional multilingual capabilities to elevate code engineering

tools cloud

2M Pulls 1 Tag Updated 5 months ago

deepseek-v3.2

DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

tools thinking cloud

2M Pulls 1 Tag Updated 5 months ago