StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

StabilityAI and CarperAI team has unveiled two new open-source Large Language Models (LLMs) named FreeWilly1 and FreeWilly2. These models stand out in the field of LLMs due to their enhanced reasoning capabilities.

Stability AI and CarperAI Lab Introduce FreeWilly with Enhanced Reasoning Capabilities

FreeWilly1 is constructed on the LLaMA 65B model and has undergone fine-tuning with a synthetically generated dataset. FreeWilly2 is built on the LLaMA 2 70B model and exhibits performance comparable to GPT-3.5 for certain tasks. The training methodologies for these models were influenced by Microsoft’s research, as detailed in their paper titled “Orca: Progressive Learning from Complex Explanation Traces of GPT-4.” Stability AI’s approach involved prompting language models with high-quality instructions to create a dataset containing 600,000 data points. This dataset size is approximately 10% of what was used in the original Orca research. Despite this reduced dataset size, the FreeWilly models have shown exceptional performance across various benchmarks.

The data generation process involved creating 500,000 cases using a less intricate LLM model and an additional 100,000 cases with a more complex LLM model. To ensure valid comparisons, the datasets were meticulously screened to remove cases that originated from evaluation benchmarks. The effectiveness of this synthetically generated dataset is evident in the FreeWilly models’ performance, even though they were trained on a dataset only a tenth the size of the original Orca paper.

For the evaluation of these models, the researchers employed EleutherAI, supplemented with AGIEval. The findings indicate that both FreeWilly models excel in addressing challenging issues in specialized fields such as law and mathematics. They also demonstrate intricate reasoning and a keen understanding of language nuances. The CarperAI team is optimistic about the potential of these models to enhance our comprehension of spoken language and is eager to witness their innovative applications in the field of artificial intelligence.

For a comprehensive understanding of FreeWilly1 and FreeWilly2, the Reference Article and Project Page provide detailed insights.

LLaMa-2: A New Era in Public Domain Language Models

LLaMa-2 stands as the premier language model in the public domain today, paving the way for the continued evolution and deployment of Large Language Models (LLMs) across various products. Its predecessor, LLaMa-1, laid the foundation by inspiring numerous impactful projects. With the introduction of LLaMa-2, the prospects for utilization in diverse applications are even greater, especially given its provision for free commercial use.

In a recent dialogue with the BBC, Nick Clegg, a notable figure from Meta, discussed the decision to release LLMs as open-source. According to Clegg, such a move enhances the safety of these models, primarily because it facilitates in-depth research and analysis from external entities.

Some key observations from Clegg include:

Meta’s commitment to transparency and contribution to the broader community is evident in their decade-long track record. Over the last ten years, the company has made available over 1000 models, libraries, and datasets for public use. Prominent releases include React, PyTorch, and the more recent ‘Segment Anything’ model.

Source: mPost

A User Just Lost $240,000 in NFTs on the Blur Marketplace

Quest 3 Hands-on: ‘Holotanks’ game brings toy tank battles into your living room

First third-party VR controllers for Vision Pro are on the way

Bittensor suffers $8 million exploit, TAO price tumbles to six-month low

AI Featured Posts

New Google cloud sync feature implicated in $15M crypto heist at Ripple-owned Fortress Trust

Ideogram AI Secures $16.5M Seed Round To Expand its Generative AI Platform

72% of CEOs Consider Generative AI a Top Investment Priority: KPMG

World Mobile Publicly Launches its Global App on Google Play Store

Metaverse Featured Posts

Quest 2 beats Quest 3 in sales, but what does that mean for the platform?

Painting VR developer receives funding for more creative freedom

Meta is testing virtual wrist buttons for Quest

Apple Vision Pro reportedly available in very limited quantities for launch

NFTs Featured Posts

These Are All the Solana Games on the Epic Games Store Right Now

Portal Launch Set as Binance Offers 50 Million Ethereum Gaming Token Rewards

IMX token surges as VanEck says release of new blockchain-based video games like Illuvium next year could boost value

Gas Hero has generated about $90 million in NFT trading this month

Let's Get Social

StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

LLaMa-2: A New Era in Public Domain Language Models

Reverse Engineer Discovers a ChatGPT Jailbreak that Enables Malicious Software Creation

A new dataset of Arctic images will spur artificial intelligence research

Leave a Reply Cancel reply

A User Just Lost $240,000 in NFTs on the Blur Marketplace

Quest 3 Hands-on: ‘Holotanks’ game brings toy tank battles into your living room

First third-party VR controllers for Vision Pro are on the way

Bittensor suffers $8 million exploit, TAO price tumbles to six-month low

What the Bat? gets absurdly cinematic in free “Battywood” update

AI Featured Posts

Metaverse Featured Posts

NFTs Featured Posts

Let's Get Social

StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

LLaMa-2: A New Era in Public Domain Language Models

Share this article

Reverse Engineer Discovers a ChatGPT Jailbreak that Enables Malicious Software Creation

A new dataset of Arctic images will spur artificial intelligence research

Leave a Reply Cancel reply

Read next