LESSTHANSUPER/DARK_PLANET_REBEL_FURY-Llama3-25b:IQ4

LESSTHANSUPER/

DARK_PLANET_REBEL_FURY-Llama3-25b:IQ4_XS

10 Downloads Updated 2 days ago

Storytelling mixture of experts model for consumer GPUs. Made by DavidAU (Huggingface).

tools

Updated 2 days ago

2 days ago

ecb74ebe7a60 · 13GB ·

model

archllama

parameters24.9B

quantizationIQ4_XS

13GB

system

Write {{char}}'s next reply in this fictional roleplay with {{user}}.

69B

params

{ "stop": [ "<|im_start|>", "<|im_end|>" ] }

59B

template

{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|> {{- else if .M

1.6kB

Readme

DARK PLANET REBEL FURY / I-MATRIX / 25B (4X8B) / I-QUANT

A more recent model from DavidAU’s “Dark Planet” line, one the creator favored themselves among their similar MoE (mixture of experts) models. This model was uploaded for the speed MoEs provide relative for their size, and as I have had good experience with the Dark Planet series. If a higher active parameter model is preferred, there is a 13-billion active parameter Mixtral model available. To stuff as many parameters in as little VRAM as possible, weighted K and I-quants will be listed.

Note that I-quants forfeit some token generation speed relative to K-quants in exchange for storage efficiency. Either of the 4-bit quantizations are recommended for 16GB GPUs. These models were taken from GGUF formats from Huggingface.

Original model (DavidAU):

GGUF weighted quantizations (mradermacher):

[No obligatory model picture. Ollama did not like it.]

**DARK PLANET REBEL FURY / I-MATRIX / 25B (4X8B) / I-QUANT**

A more recent model from DavidAU's "Dark Planet" line, one the creator favored themselves among their similar MoE (mixture of experts) models. This model was uploaded for the speed MoEs provide relative for their size, and as I have had good experience with the Dark Planet series. If a higher active parameter model is preferred, [_there is a 13-billion active parameter Mixtral model available_](https://ollama.com/LESSTHANSUPER/CLOUDY_MIXTRAL-22b). To stuff as many parameters in as little VRAM as possible, weighted K and I-quants will be listed.

[*Original model (DavidAU):*](https://huggingface.co/DavidAU/L3-MOE-4x8B-Dark-Planet-Rebel-FURY-25B-GGUF)

[*GGUF weighted quantizations (mradermacher):*](https://huggingface.co/mradermacher/L3-MOE-4x8B-Dark-Planet-Rebel-FURY-25B-i1-GGUF)

_[No obligatory model picture. Ollama did not like it.]_

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)