IIT Gandhinagar Channels Rivers Ganga, Yamuna into LLMs

IIT Gandhinagar develops Hindi LLM 'Ganga' from scratch for just INR 10 lakh
Image by Nikhil Kumar
IIT Gandhinagar recently released Ganga-1B, a pre-trained large language model (LLM) for Hindi as part of its Unity project. Built from scratch using the largest curated Hindi dataset, Ganga-1B outperforms all open-source LLMs supporting Hindi, up to 7B in size.  https://twitter.com/mayank_iitgn/status/1808450615208804750 The Unity project is building Indic LLMs and is looking to release the largest-ever curated datasets and SOTA models for other Indian languages. It’s a part of Lingo, the research group from IIT Gandhinagar, that engages in various activities, projects, and collaborations to advance the fields of natural language processing (NLP) and AI.  “We are neither a non-profit organisation nor a for-profit organisation. We are a small research group at IIT
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Siddharth Jindal
Siddharth Jindal
Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed