Nexusflow has been launched Athens-Rama 3-70B, Athene-70B is an open-weight chat mannequin fine-tuned from Meta AI’s Llama-3-70B. Athene-70B achieved an Area-Exhausting-Auto rating of 77.8%, rivaling proprietary fashions similar to GPT-4o and Claude-3.5-Sonnet. This represents a big enchancment over its predecessor, Llama-3-70B-Instruct, which scored 46.6%. The enhancements got here from Nexusflow’s focused post-training pipeline designed to enhance particular mannequin behaviors. Athene-70B is at the moment in public testing on the Chatbot Area.
To unlock the total potential of Llama-3-70B, Nexusflow developed inner benchmarks to judge LLM capabilities in instruction following, coding, artistic writing, and multilingual duties. Based mostly on these evaluations, high-quality desire information was curated for focused reinforcement studying with human suggestions (RLHF). This pipeline resulted in vital efficiency enhancements in comparison with Llama-3-70B-Instruct. Enhancements span key elements together with correct instruction following, arithmetic and reasoning, complete coding help, inspirational artistic writing, and multilingual acquisition.
Athene-70B demonstrates Nexusflow’s capacity to customise fashions to particular enterprise necessities via focused post-training. Constructing on its success up to now with Starling-7B and NexusRaven-V2, Nexusflow goals to evolve fashions to fulfill enterprise-grade utility requirements. The corporate presents personalized options to assist enterprises achieve an edge with GenAI copilot and agent know-how. Nexusflow invitations organizations to contact them for extra data and collaboration alternatives to discover how Athene-70B can improve their AI initiatives.
Athene-Llama3-70B, an open-weight chat mannequin developed by Nexusflow, reveals vital enhancements over earlier fashions. The mannequin achieves aggressive efficiency in comparison with proprietary fashions on the Area-Exhausting-Auto benchmark. Nexusflow’s focused post-training pipeline leverages reinforcement studying from inner benchmarks and human suggestions to enhance the mannequin’s capabilities in quite a lot of domains, together with following directions, arithmetic and reasoning, coding, artistic writing, and multilingual duties. This progress demonstrates Nexusflow’s capacity to construct on its success thus far and customise fashions to suit the wants of enterprises. The corporate positions itself as a supplier of personalized enterprise-grade AI options and invitations organizations to discover the potential of Athene-70B of their AI initiatives.
Please verify Model card. All credit score for this analysis goes to the researchers of this undertaking. Additionally, do not forget to observe us. twitter And our Telegram Channel and LinkedIn GroupsUp. For those who like our work, you’ll love our Newsletter..
Please be part of us 46k+ ML Subreddit
Take a look at our upcoming AI webinars right here
Asjad is an Intern Advisor at Marktechpost. He’s pursuing a B.Tech in Mechanical Engineering from Indian Institute of Know-how Kharagpur. Asjad is an avid advocate of Machine Studying and Deep Studying and is continually exploring the applying of Machine Studying in Healthcare.


