Sign In

Published on February 20, 2024
In AI Features

Project Vaani to English Gyani, This IISc Professor is Going Places

Professor Prasanta Kumar Ghosh from IISc Bangalore has figured out a unique way to collect speech data in different languages and dialects.

Image by Nikhil Kumar

By Mohit Pandey

Professor Prasanta Kumar Ghosh from IISc Bangalore has figured out a unique way to collect speech data in different languages and dialects. Travelling to 80 districts in the first phase, showing local people a picture, asking them to describe it and then recording it has given the Google funded Project Vaani around 16,000 hours of speech data. The team is open-sourcing the Vaani corpus and is transcribing around 10% of the data manually. The aim is to collect 150,000 hours of speech data from 773 districts of India. “India is not one language and not even several dialects, it's a continuum of languages, which requires a lot of research and development,” said Ghosh, in an exclusive interview with AIM. Apart from this, Ghosh and his research team are also working on RESPIN

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Mohit writes about AI in simple, explainable, and often funny words. He's especially passionate about chatting with those building AI for Bharat, with the occasional detour into AGI.

Related Posts

How AI Chips Stole the Spotlight in 2024

Why Responsible AI Demands Both Trust and Compute Ownership

ServiceNow

ServiceNow India Head Says AI Agents Can Shrink the Indian IT Bench

India Has Just 5-10 Years to Catch Up in Global Quantum Race, Says IISc Professor

Wipro

Wipro Bets Big on Agentic AI

The Double Thank You Moment Between Kubernetes and LLMs

Wadhwani Foundation is Building ‘ChatGPT Plus Plus Plus’ for Indian MSMEs

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

How This Coimbatore SaaS Firm Cracked Hidden Enterprise Problem Costing Millions

Founded in 2015, now based in Portland, Responsive supports more than 20% of Fortune 100 companies

Enterprises Beware: Agent-Washing Clouds the Future of AI

Vendors mislabel copilots as agents, raising regulatory and operational risks for firms chasing the promise of agentic AI.

How Neysa Stands Out in the IndiaAI GPU Race

Unlike other providers focused on GPU allocation, Neysa claims to deliver an end-to-end AI cloud platform.

Two Indian Engineers on a Mission to Automate Home Cooking for the World

In a live demonstration for AIM, Posha prepared paneer tikka masala in approximately 25 minutes

BharatGen and the Pursuit of Sovereign, Scalable AI for India

“Knowledge-driven components are important because we don’t want everything to be just algorithmic innovation.”

How Pradhi AI Embeds Emotional Intelligence in Voice AI

As businesses recognise the potential of voice-driven tech, Pradhi AI is laying the foundation for an empathetic, responsive AI ecosystem.

Mangaluru Looks to Build Its Own Tech Identity, Not Replicate Bangalore

“The coastal city could showcase tangible results by applying deep tech to areas it already dominates”

Google’s Gemini Nano Banana and the Cost of Convenience

The company’s new AI image and photo editor deepens concerns over data use and consent gaps, experts warn.

Download the easiest way to
stay informed

Flagship Events