79 1 year ago

A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP