Sign In

Published on September 2, 2025
In Global Tech

Why Everyone is Rushing to Build Reinforcement Learning Environments

“RL environment specifications are among the most consequential things we can write as AI researchers.”

Image by Nalini Nirad

By Siddharth Jindal

Building reinforcement learning (RL) environments is quickly emerging as the next big thing in AI. OpenAI co-founder Andrej Karpathy recently noted in his post on X that the evolution of AI training can be broken down into three distinct eras—pretraining, supervised finetuning and, now, reinforcement learning environments. “In the era of pretraining, what mattered was internet text,” Karpathy explained. The priority then was to gather a large, diverse and high-quality collection of online documents to train models. With supervised finetuning, the focus shifted to conversations. “Contract workers were hired to create answers for questions, a bit like what you’d see on Stack Overflow or Quora, but geared towards LLM use cases,” he said. According to Karpathy, neither of

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Siddharth Jindal

Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.

Related Posts

Former Google DeepMind Researchers Go Deep for Sales Triumph

DeepMind Wants to Take Humans Out of RLHF

DeepMind Wants to Take Humans Out of RLHF

Who Will Win the AGI Race?

Google Introduces Offline Reinforcement Learning to Train AI Agents

Top Reinforcement Learning Algorithms

Human Feedback Frenzy: How it Turns AI into Narcissistic, Control-Freak Machines

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

Enterprises Beware: Agent-Washing Clouds the Future of AI

Vendors mislabel copilots as agents, raising regulatory and operational risks for firms chasing the promise of agentic AI.

How Neysa Stands Out in the IndiaAI GPU Race

Unlike other providers focused on GPU allocation, Neysa claims to deliver an end-to-end AI cloud platform.

Two Indian Engineers on a Mission to Automate Home Cooking for the World

In a live demonstration for AIM, Posha prepared paneer tikka masala in approximately 25 minutes

BharatGen and the Pursuit of Sovereign, Scalable AI for India

“Knowledge-driven components are important because we don’t want everything to be just algorithmic innovation.”

How Pradhi AI Embeds Emotional Intelligence in Voice AI

As businesses recognise the potential of voice-driven tech, Pradhi AI is laying the foundation for an empathetic, responsive AI ecosystem.

Mangaluru Looks to Build Its Own Tech Identity, Not Replicate Bangalore

“The coastal city could showcase tangible results by applying deep tech to areas it already dominates”

Google’s Gemini Nano Banana and the Cost of Convenience

The company’s new AI image and photo editor deepens concerns over data use and consent gaps, experts warn.

Karya Google

BharatGen’s ‘Recipe’ for Building a Trillion Parameters Indic Model

The consortium insists sovereignty doesn’t mean shutting the door on global players.

Download the easiest way to
stay informed

Flagship Events ↗