Nexusflow has launched Athene-Llama3-70B, an open-weight chat mannequin fine-tuned from Meta AI’s Llama-3-70B. Athene-70B has achieved an Enviornment-Exhausting-Auto rating of 77.8%, rivaling proprietary fashions like GPT-4o and Claude-3.5-Sonnet. This marks a major enchancment from its predecessor, Llama-3-70B-Instruct, which scored 46.6%. The enhancement stems from Nexusflow’s focused post-training pipeline, designed to enhance particular mannequin behaviors. Athene-70B is presently present process public testing on Chatbot Enviornment.
To maximise Llama-3-70B’s potential, Nexusflow developed inner benchmarks evaluating LLM capabilities in instruction following, coding, artistic writing, and multilingual duties. Primarily based on these evaluations, high-quality desire information was curated for focused Reinforcement Studying from Human Suggestions (RLHF). This pipeline resulted in substantial efficiency enhancements in comparison with Llama-3-70B-Instruct. The enhancements span key points equivalent to exact instruction following, math and reasoning, complete coding help, impressed artistic writing, and multilingual mastery.
Athene-70B demonstrates Nexusflow’s functionality to customise fashions for particular enterprise necessities by focused post-training. Constructing on earlier successes with Starling-7B and NexusRaven-V2, Nexusflow goals to advance its fashions to satisfy enterprise-grade utility requirements. The corporate presents tailor-made options to assist companies excel in GenAI copilot and agent applied sciences. Nexusflow invitations organizations to discover how Athene-70B can improve their AI initiatives by contacting them for additional data and collaboration alternatives.
Athene-Llama3-70B, an open-weights chat mannequin developed by Nexusflow, demonstrates important enhancements over its predecessor. The mannequin achieves aggressive efficiency in comparison with proprietary fashions within the Enviornment-Exhausting-Auto benchmark. Nexusflow’s focused post-training pipeline, using inner benchmarks and Reinforcement Studying from Human Suggestions, has enhanced the mannequin’s capabilities throughout numerous domains, together with instruction following, math and reasoning, coding, artistic writing, and multilingual duties. This development showcases Nexusflow’s skill to tailor fashions for enterprise wants, constructing on their earlier successes. The corporate positions itself as a supplier of personalized enterprise-grade AI options, inviting organizations to discover the potential of Athene-70B for his or her AI initiatives.
Take a look at the Mannequin Card. All credit score for this analysis goes to the researchers of this venture. Additionally, don’t neglect to comply with us on Twitter and be part of our Telegram Channel and LinkedIn Group. In the event you like our work, you’ll love our e-newsletter..
Don’t Overlook to affix our 46k+ ML SubReddit
Discover Upcoming AI Webinars right here