270 1 year ago

Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.