403 Downloads Updated 5 months ago
Name
3 models
Size
Context
Input
Mistral-Small-3.1-24B:Q4_K_S
15GB · 128K context window · Text · 5 months ago
15GB
128K
Text
Mistral-Small-3.1-24B:Q5_K_M
18GB · 128K context window · Text · 5 months ago
18GB
Mistral-Small-3.1-24B:Q6_K
20GB · 128K context window · Text · 5 months ago
20GB
On an RTX 4090 with 24GB of VRAM
Leave 1GB to 800MB of VRAM as a buffer
Q6_K: 35K context
Q5_K_M: 64K context
Q4_K_S: 100K context