Stability AI Team Introduces FreeWilly1 and FreeWilly2: New Open Access Large Language Models (LLMs)

FreeWilly1 and its successor, FreeWilly2, are powerful new open source Large Language Models (LLMs) developed by the CarperAI team at Stability AI. Both models perform exceptionally well in reasoning skills using many different metrics. Supervised fine tuning (SFT) in the industry standard Alpaca format was used to fit the FreeWilly1 model, built on the original LLaMA 65B base model. FreeWilly2 uses the base model LLaMA 2 70B to achieve performance on par with GPT-3.5 in some tasks.

The training of FreeWilly models was heavily influenced by Microsoft’s innovative approach, described in the article “Orca: Progressive Learning from GPT-4 Complex Explanation Traces”. The team drove high-quality instruction language models to generate our copy of the dataset, which contains 600,000 data points (about 10% of the size of the dataset used in Orca’s original work).

Using this method, the researchers generated 500,000 cases with a less complex LLM model and an additional 100,000 with a more complex LLM model. They scrutinized these data sets, removing cases that originated from the evaluation benchmarks to ensure valid comparisons. His approach to synthetically generated data sets is validated by FreeWilly models performing exceptionally well across multiple benchmarks despite training at only one-tenth of the sample size used in the original Orca paper.

🚀 Create high-quality training data sets with Kili technology and solve NLP machine learning challenges to develop powerful machine learning applications

The researchers used EleutherAI’s lm-eval-harness, to which they added AGIEval, to perform evaluations of these models. The findings show that both FreeWilly models are top-notch when solving difficult problems in specialized disciplines such as law and mathematics, engaging in complex reasoning, and recognizing nuances of language.

The team believes that the two models improve our ability to understand spoken language and open up possibilities that were previously impossible. They look forward to seeing all the innovative uses of these models in artificial intelligence.

review the Reference article and project page for FreeWilly1 and his successor FreeWilly2. All credit for this research goes to the researchers of this project. Also, don’t forget to join our 26k+ ML SubReddit, discord channel, and electronic newsletterwhere we share the latest AI research news, exciting AI projects, and more.

🚀 Check out over 900 AI tools at AI Tools Club

Dhanshree Shenwai is a Computer Engineer and has good experience in FinTech companies covering Finance, Cards & Payments and Banking domain with strong interest in AI applications. She is enthusiastic about exploring new technologies and advancements in today’s changing world, making everyone’s life easier.

🔥 Get a competitive edge with data – actionable market intelligence for global brands, retailers, analysts and investors. (Sponsored)

Stability AI Team Introduces FreeWilly1 and FreeWilly2: New Open Access Large Language Models (LLMs)

Technical Terrence Team

UCLA Researchers Propose PhyCV: A Physics-Inspired Machine Vision Python Library

Leave a Reply Cancel reply

Recommended.

After Mocking Pricing Model, Crypto Advocates Discuss Reinstating Bitcoin’s Rainbow Chart – Bitcoin Spotlight

What is Storybird and How Does It Work?

Billionaire ‘King of Bonds’ Jeffrey Gundlach Warns of ‘Painful Results’ in Coming Downturn Cryptocurrencies and ICOs

I'd aim for a million by buying less than a dozen shares

Meme Moguls (MGLS) Launches With Exclusive P2E Meme Sharing Game, Set to Rival Established Memes

Categories

Important Links

Stability AI Team Introduces FreeWilly1 and FreeWilly2: New Open Access Large Language Models (LLMs)

Related

Technical Terrence Team

UCLA Researchers Propose PhyCV: A Physics-Inspired Machine Vision Python Library

Leave a Reply Cancel reply

Recommended.

After Mocking Pricing Model, Crypto Advocates Discuss Reinstating Bitcoin’s Rainbow Chart – Bitcoin Spotlight

What is Storybird and How Does It Work?

Billionaire ‘King of Bonds’ Jeffrey Gundlach Warns of ‘Painful Results’ in Coming Downturn Cryptocurrencies and ICOs

I'd aim for a million by buying less than a dozen shares

Meme Moguls (MGLS) Launches With Exclusive P2E Meme Sharing Game, Set to Rival Established Memes

Categories

Important Links

Get daily news updates to your inbox!