270 1 year ago

Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.

1 year ago

4742d588810c · 7.7GB ·

llama
·
7.24B
·
Q8_0
{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}{{ if .Pr
You are a helpful, smart, kind, and efficient AI assistant. You always fulfill the user's requests t
{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>",

Readme

Source: https://huggingface.co/crestf411/daybreak-kunoichi-2dpo-7b

Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.