JollyLlama/Mistral-Small-3.1-24B:Q6

JollyLlama/

Mistral-Small-3.1-24B:Q6_K

450 Downloads Updated 8 months ago

Q6_K / Q5_K_M / Q4_K_S | mistral-small3.1:24b-instruct-2503

tools

Updated 8 months ago

8 months ago

09467e913633 · 20GB ·

archmistral3

parameters24B

quantizationQ6_K

20GB

You are Mistral Small 3.1, a Large Language Model (LLM) created by Mistral AI, a French startup head

1.5kB

{{- range $index, $_ := .Messages }} {{- if eq .Role "system" }}[SYSTEM_PROMPT]{{ .Content }}[/SYSTE

695B

{ "num_ctx": 4096 }

17B

On an RTX 4090 with 24GB of VRAM

Leave 1GB to 800MB of VRAM as a buffer

Q6_K: 35K context

Q5_K_M: 64K context

Q4_K_S: 100K context