Sign In

Published on September 11, 2025
In Global Tech

Why Groq Loves Mixture of Experts Models

Groq's LPUs thrive on MoE inference while GPUs struggle with memory bottlenecks.

By Supreeth Koundinya

Mixture-of-Experts (MoE) architectures power most of today’s frontier AI models, at least the ones which we are aware of, thanks to their open weights nature. This includes models from DeepSeek, Moonshot AI’s Kimi, and even the recently announced OpenAI’s gpt-oss series. For context, MoE architecture activates only a subset of parameters per token, while retaining a large number of parameters for counts. And for companies like Groq, which have built their entire business around inference, MoE models present a perfect match for the company’s LPU (Language Processing Unit) chips, as per CEO Jonathan Ross. Groq’s LPUs are hardware systems designed specifically for AI inference, and they outperform traditional GPU systems in output speed. Ross was in Benga

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Supreeth Koundinya

Supreeth is an engineering graduate who is curious about the world of artificial intelligence and loves to write stories on how it is solving problems and shaping the future of humanity.

Related Posts

Two Indian Engineers on a Mission to Automate Home Cooking for the World

IBM Helps Unity Bank Cut Time to Market for New APIs by 50%

ChatGPT’s Deep Research

ChatGPT’s New Background Agent ‘Insanely Useful’

Perplexity Announces Search API

Baidu’s Apollo Go Begins Autonomous Driving Trials on Dubai Roads

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

Enterprises Beware: Agent-Washing Clouds the Future of AI

Vendors mislabel copilots as agents, raising regulatory and operational risks for firms chasing the promise of agentic AI.

How Neysa Stands Out in the IndiaAI GPU Race

Unlike other providers focused on GPU allocation, Neysa claims to deliver an end-to-end AI cloud platform.

BharatGen and the Pursuit of Sovereign, Scalable AI for India

“Knowledge-driven components are important because we don’t want everything to be just algorithmic innovation.”

How Pradhi AI Embeds Emotional Intelligence in Voice AI

As businesses recognise the potential of voice-driven tech, Pradhi AI is laying the foundation for an empathetic, responsive AI ecosystem.

Mangaluru Looks to Build Its Own Tech Identity, Not Replicate Bangalore

“The coastal city could showcase tangible results by applying deep tech to areas it already dominates”

Google’s Gemini Nano Banana and the Cost of Convenience

The company’s new AI image and photo editor deepens concerns over data use and consent gaps, experts warn.

Karya Google

BharatGen’s ‘Recipe’ for Building a Trillion Parameters Indic Model

The consortium insists sovereignty doesn’t mean shutting the door on global players.

AI in design

Storytelling is in the Creator’s Control with GenAI

“Democracy would not have been possible without storytelling being distributed.”

Download the easiest way to
stay informed

Flagship Events ↗