NousResearch launched Nous-Hermes-2-Mixtral-8x7B: an open source LLM with SFT and DPO versions

In language models and artificial intelligence, users often face challenges when training and using models for various tasks. The need for a versatile and high-performance model to understand and generate content in different domains is evident. Existing solutions can provide some level of performance, but they must catch up to achieve next-generation results and adaptability. The problem is an advanced language model that can excel at understanding and generating content in many tasks. While there are other models available, existing options may only partially meet the criteria of achieving cutting-edge performance and versatility.

NousResearch has just launched Nous-Hermes-2-Mixtral-8x7B. It has 2 versions including an SFT and a DPO version of this model. Us Hermes 2 Mixtral 8x7B DPO aims to address these challenges by offering a state-of-the-art solution. This model, trained on a vast dataset primarily comprising data generated by GPT-4 and supplemented with high-quality information from open datasets in the field of ai, exhibits exceptional performance on various tasks. It introduces a new SFT + DPO version and, for those who prefer a different approach, an SFT-only version is also available.

He Nous Hermes 2 Mixtral 8x7B SFT is a specialized version of Nous Research's latest model, designed exclusively for supervised adjustments. It is built on the Mixtral 8x7B MoE LLM architecture. This model has been trained using over a million inputs, predominantly generated by GPT-4, along with other high-quality data from various open data sets in the field of ai. It demonstrates exceptional performance in a wide range of tasks, setting new industry benchmarks.

The Nous-Hermes-2-Mixtral-8x7B model has been benchmarked with GPT4All, AGIEval and BigBench tasks. The results demonstrate significant improvements over the base Mixtral model, even outperforming MistralAI's flagship Mixtral Finetune model. The average performance on these benchmarks is an impressive 75.70 for GPT4All, 46.05 for AGIEval, and 49.70 for BigBench.

The introduction of ChatML as a request format allows for more structured and engaging interaction with the model, particularly in multi-turn chat dialogues. System prompts enable addressability, giving users a nuanced way to guide the model's responses based on roles, rules, and stylistic choices. This format, which aligns with the compatibility of OpenAI terminals, improves the user experience and makes the model more accessible.

In conclusion, Nous Hermes 2 Mixtral 8x7B DPO is a powerful solution to language model utilization and training challenges. Its comprehensive training data, innovative builds, and impressive benchmark results make it a versatile, high-performing model. With a focus on user interaction through ChatML and a commitment to surpassing existing benchmarks, this model stands out as an advanced and effective tool in artificial intelligence.

Niharika is a Technical Consulting Intern at Marktechpost. She is a third-year student currently pursuing her B.tech degree at the Indian Institute of technology (IIT), Kharagpur. She is a very enthusiastic person with a keen interest in machine learning, data science and artificial intelligence and an avid reader of the latest developments in these fields.