-
llama3.2
Meta's Llama 3.2 goes small with 1B and 3B models.
Tools 1B 3B271.9K Pulls 63 Tags Updated 7 days ago
-
llama3.1
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
Tools 8B 70B 405B5.8M Pulls 94 Tags Updated 2 weeks ago
-
gemma2
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
2B 9B 27B1.4M Pulls 94 Tags Updated 2 weeks ago
-
qwen2.5
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
Tools 0.5B 1.5B 3B 7B 14B 32B 72B649.3K Pulls 133 Tags Updated 2 weeks ago
-
phi3.5
A lightweight AI model with 3.8 billion parameters with performance overtaking similarly and larger sized models.
3B68.8K Pulls 17 Tags Updated 6 weeks ago
-
nemotron-mini
A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.
Tools13.3K Pulls 17 Tags Updated 13 days ago
-
mistral-small
Mistral Small is a lightweight model designed for cost-effective use in tasks like translation and summarization.
Tools 22B14.7K Pulls 17 Tags Updated 2 weeks ago
-
mistral-nemo
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
Tools 12B255.8K Pulls 17 Tags Updated 11 days ago
-
deepseek-coder-v2
An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
Code 16B 236B329.7K Pulls 65 Tags Updated 3 months ago
-
mistral
The 7B model released by Mistral AI, updated to version 0.3.
Tools 7B3.6M Pulls 84 Tags Updated 4 months ago
-
mixtral
A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.
Tools 8x7B 8x22B434.4K Pulls 69 Tags Updated 5 months ago
-
codegemma
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
Code 2B 7B308.6K Pulls 85 Tags Updated 5 months ago
-
command-r
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
Tools 35B218.6K Pulls 32 Tags Updated 4 weeks ago
-
command-r-plus
Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.
Tools 104B97.7K Pulls 21 Tags Updated 4 weeks ago
-
llava
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
Vision 7B 13B 34B1.3M Pulls 98 Tags Updated 8 months ago
-
llama3
Meta Llama 3: The most capable openly available LLM to date
8B 70B6.3M Pulls 68 Tags Updated 4 months ago
-
gemma
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
2B 7B4.1M Pulls 102 Tags Updated 5 months ago
-
qwen
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters
0.5B 1.8B 4B 32B 72B 110B4M Pulls 379 Tags Updated 3 months ago
-
qwen2
Qwen2 is a new series of large language models from Alibaba group
Tools 0.5B 1.5B 7B 72B3.8M Pulls 97 Tags Updated 3 months ago
-
phi3
Phi-3 is a family of lightweight 3B (Mini) and 14B (Medium) state-of-the-art open models by Microsoft.
3B 14B2.5M Pulls 72 Tags Updated 4 months ago
-
llama2
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
7B 13B 70B2.2M Pulls 102 Tags Updated 7 months ago
-
codellama
A large language model that can use text prompts to generate and discuss code.
Code 7B 13B 34B 70B1.4M Pulls 199 Tags Updated 4 months ago
-
nomic-embed-text
A high-performing open embedding model with a large token context window.
Embedding600.8K Pulls 3 Tags Updated 7 months ago
-
dolphin-mixtral
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
8x7B 8x22B402.8K Pulls 87 Tags Updated 5 months ago
-
mxbai-embed-large
State-of-the-art large embedding model from mixedbread.ai
Embedding375.4K Pulls 4 Tags Updated 6 months ago