Microsoft unveils 2.7B parameter language model Phi-2

Microsoft’s 2.7 billion-parameter model Phi-2 showcases outstanding reasoning and language understanding capabilities, setting a new standard for performance among base language models with less than 13 billion parameters.

Phi-2 builds upon the success of its predecessors, Phi-1 and Phi-1.5, by matching or surpassing models up to 25 times larger—thanks to innovations in model scaling and training data curation.

The compact size of Phi-2 makes it an ideal playground for researchers, facilitating exploration in mechanistic interpretability, safety improvements, and fine-tuning experimentation across various tasks.

Phi-2’s achievements are underpinned by two key aspects:

Performance evaluation

Phi-2 has undergone rigorous evaluation across various benchmarks, including Big Bench Hard, commonsense reasoning, language understanding, math, and coding.

With only 2.7 billion parameters, Phi-2 outperforms larger models – including Mistral and Llama-2 – and matches or outperforms Google’s recently-announced Gemini Nano 2:

Beyond benchmarks, Phi-2 showcases its capabilities in real-world scenarios. Tests involving prompts commonly used in the research community reveal Phi-2’s prowess in solving physics problems and correcting student mistakes, showcasing its versatility beyond standard evaluations:

Phi-2 is a Transformer-based model with a next-word prediction objective, trained on 1.4 trillion tokens from synthetic and web datasets. The training process – conducted on 96 A100 GPUs over 14 days – focuses on maintaining a high level of safety and claims to surpass open-source models in terms of toxicity and bias.

With the announcement of Phi-2, Microsoft continues to push the boundaries of what smaller base language models can achieve.

(Image Credit: Microsoft)

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with Digital Transformation Week.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Source: artificialintelligence-news

This Week in Crypto Games: Dr. Disrespect Dumped, Pixelverse and Catizen Tokens, Notcoin ‘Fresh Start’

Biggest Video Games Releasing in July 2024

Checkmate? Using AI to Build a Better, More Creative Chess Foe

Breachers hands-on: A top-notch tactical VR shooter in the style of Rainbow Six Siege

AI Featured Posts

Google’s AI-empowered search feature goes global with expansion to 120 countries

Enhance Your Presence Online With This AI Webcam App for $49.99

Flipkart Ventures to Invest $500,000 In Algomage

ChatGPT maker OpenAI launches GPT Store and a subscription tier for teams

Metaverse Featured Posts

Google’s AR Efforts Stumble as Vice President of Engineering Departs

Reddam House School in England Pioneers Metaverse Education with VR

Meta’s Flamera headset prototype fights distortion with “bug eyes”

Apple’s Vision Pro gets tortured with extreme drop test

NFTs Featured Posts

Futureverse Wants to Democratize High Music Quality With JEN 1

Fractionalized Ownership Will Drive the Next Wave of NFTs

Polygon-based Y00ts NFTs to migrate to Ethereum, return $3 million grant

Existing laws sufficient to handle NFT copyright concerns, say US authorities

Let's Get Social

Microsoft unveils 2.7B parameter language model Phi-2

Performance evaluation

4 Ways This SEO Expert Uses AI to Create Content — and How You Can, Too

Trump Cuts Up His Suit in New NFT Card Collection

Leave a Reply Cancel reply

This Week in Crypto Games: Dr. Disrespect Dumped, Pixelverse and Catizen Tokens, Notcoin ‘Fresh Start’

Biggest Video Games Releasing in July 2024

Checkmate? Using AI to Build a Better, More Creative Chess Foe

Breachers hands-on: A top-notch tactical VR shooter in the style of Rainbow Six Siege

Frame gets smarter: Brilliant Labs pushes its AI smart glasses with new features

AI Featured Posts

Metaverse Featured Posts

NFTs Featured Posts

Let's Get Social

Microsoft unveils 2.7B parameter language model Phi-2

Performance evaluation

Share this article

4 Ways This SEO Expert Uses AI to Create Content — and How You Can, Too

Trump Cuts Up His Suit in New NFT Card Collection

Leave a Reply Cancel reply

Read next