Microsoft’s New Love

AIM Events

Hosting the World’s Most Impactful AI Conferences & Events. For Brand collaborations write to info@aimmediahouse.com

Published Nov 17, 2023

First came large language models (LLMs) that led to the formation of ChatGPT, built on GPT-3.5 and trained on 175B parameters. Though effective, LLMs are expensive to train, run and can be challenging to customise for specific tasks. Enter small language models (SLMs), which are more efficient to train, deploy and also more accurate. Additionally, they can also run on local infrastructure without resorting to GPU-rich third parties.

Realising the potential of SLMs, enterprises are rushing to develop new small language models.

At Ignite 2023, tech giant Microsoft released the Phi small language model series, termed Phi-2. Phi-2 boasts an impressive 2.7 billion parameters and showcases top-tier performance across benchmark criteria, excelling in areas like common sense, language comprehension, and logical reasoning.

In a blog post, Microsoft said that with the right fine-tuning and customisation, these SLMs can be incredibly powerful tools for applications both on cloud and on the edge. Phi-2 is also available to enterprises in the Azure AI catalogue.

Another interesting fact is that Microsoft claims Phi-2 is open source, which makes it a direct competitor to the LLaMA series of models. Earlier this year, Microsoft claimed that Phi-1.5, which has 1.3 billion parameters, outperformed LLaMA 2’s 7-billion parameters model on several benchmarks.

Recently, Microsoft has been extremely active in the open source space. In June, the tech giant released Orca, an open source AI model designed to learn by emulating the reasoning of larger AI models like GPT-4. The model has 13 billion parameters and is smaller than large models like GPT-4 or GPT-3.5, but it is tailored for specific use cases.

While the discussion about open source continues, experts have pointed out that Microsoft’s Phi-2 is not open source in the real sense as the licence revealed that the model is for ‘research purposes only’ for now.

For Microsoft to replicate LLaMA's success with Phi-2, it must consider making the model accessible for commercial use.

Read the full story here.

Lentra Equips Banks with AI

Since the advancement of AI, the digital lending industry in India has been trying to make the most of it. It is adopting ML models that understand data and decipher the best outcome from it. Lentra AI, a Bangalore-based platform, has been empowering major banks, including HDFC, Standard Chartered, Federal Bank, and many more in India.

Microsoft’s New Love

AIM Events

Hosting the World’s Most Impactful AI Conferences & Events. For Brand collaborations write to info@aimmediahouse.com

Recommended by LinkedIn

Sector 6

6,744 followers

More articles by this author

Insights from the community

Others also viewed

⚙️ 3 Ways to Efficient AI

LLM Pulse - September 16, 2024

The Significance of Human Input in Generative AI

SLM and LLM... My Top 10 in July 2024

Meta Llama 3.1 vs. GPT-4: An Open-Source Contender in the AI Arena

Insider's Edit: OpenAI's Tips for Writing Better Prompts

The Future of Artificial Intelligence: Navigating Small and Large Language Models

Major Changes in Large Language Models (LLMs) You Need to Know in 2024

How Gemini Pro 1.5 Predicts Your Next Move

Unlabeled Data: The Secret Behind Large Language Models

Explore topics

Recommended by LinkedIn

Sector 6

6,744 followers

Indian Founders’ New Found Obsession

Jan 5, 2025

Bengaluru vs the Rest of India

Dec 26, 2024

Indian IT’s Love-Hate Affair with GCCs

Dec 24, 2024

Roti, Kapda, aur ChatGPT

Dec 19, 2024

Why is Indian IT Obsessing Over AI Agents?

Dec 17, 2024

Good News for IT Employees

Dec 12, 2024

Indian IT GenAI Hangover Begins

Oct 27, 2024

Yann LeCun ♥ India | Jensen Huang♥ India

Oct 25, 2024

Indian IT ♥️ GCCs

Oct 22, 2024

Building AI for India? Or Bharat?

Oct 9, 2024