FreeWilly1 and its successor, FreeWilly2, are powerful new open source Large Language Models (LLMs) developed by the CarperAI team at Stability AI. Both models perform exceptionally well in reasoning skills using many different metrics. Supervised fine tuning (SFT) in the industry standard Alpaca format was used to fit the FreeWilly1 model, built on the original LLaMA 65B base model. FreeWilly2 uses the base model LLaMA 2 70B to achieve performance on par with GPT-3.5 in some tasks.
The training of FreeWilly models was heavily influenced by Microsoft’s innovative approach, described in the article “Orca: Progressive Learning from GPT-4 Complex Explanation Traces”. The team drove high-quality instruction language models to generate our copy of the dataset, which contains 600,000 data points (about 10% of the size of the dataset used in Orca’s original work).
Using this method, the researchers generated 500,000 cases with a less complex LLM model and an additional 100,000 with a more complex LLM model. They scrutinized these data sets, removing cases that originated from the evaluation benchmarks to ensure valid comparisons. His approach to synthetically generated data sets is validated by FreeWilly models performing exceptionally well across multiple benchmarks despite training at only one-tenth of the sample size used in the original Orca paper.
The researchers used EleutherAI’s lm-eval-harness, to which they added AGIEval, to perform evaluations of these models. The findings show that both FreeWilly models are top-notch when solving difficult problems in specialized disciplines such as law and mathematics, engaging in complex reasoning, and recognizing nuances of language.
The team believes that the two models improve our ability to understand spoken language and open up possibilities that were previously impossible. They look forward to seeing all the innovative uses of these models in artificial intelligence.
review the Reference article and project page for FreeWilly1 and his successor FreeWilly2. All credit for this research goes to the researchers of this project. Also, don’t forget to join our 26k+ ML SubReddit, discord channel, and electronic newsletterwhere we share the latest AI research news, exciting AI projects, and more.
🚀 Check out over 900 AI tools at AI Tools Club
Dhanshree Shenwai is a Computer Engineer and has good experience in FinTech companies covering Finance, Cards & Payments and Banking domain with strong interest in AI applications. She is enthusiastic about exploring new technologies and advancements in today’s changing world, making everyone’s life easier.