Athene-V2
Nexusflow’s Athene-V2 chat model, built on Qwen 2.5’s 72B foundation, achieves GPT-4o-level performance across key benchmarks while demonstrating how targeted optimization can enhance specific capabilities beyond traditional scaling approaches.
Model Features
- 72B parameters fine-tuned from Qwen 2.5
- State-of-the-art chat performance matching or exceeding GPT-4o
- Superior code completion (ranking #2 on bigcode-bench-hard)
- Enhanced mathematics capabilities (MATH benchmark)
- Precise long-form log extraction
- Advanced post-training pipeline pushing the Pareto frontier

References
Blog post
HuggingFace