Sign In

Published on November 22, 2024
In AI Trends

The Breakthrough AI Scaling Desperately Needed

TokenFormer enables AI to scale by preserving the existing knowledge while seamlessly integrating new information, redefining long-context modelling and continuous learning.

Image by Nalini Nirad

By Sagar Sharma

When Transformers were introduced, the entire AI ecosystem underwent a reform. But there was a problem. When a model was large enough, and researchers wanted to train a specific part of it, the only option was to retrain the entire model from scratch. This was a critical issue. To address it, researchers from Google, Max Planck Institute, and Peking University introduced a new approach called TokenFormer. The innovation lies in treating model parameters as tokens themselves, allowing for a dynamic interaction between input tokens and model parameters through an attention mechanism rather than fixed linear projections. The traditional Transformer architecture faces a significant challenge when scaling—it requires complete retraining from scratch when architectural mod

Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

A software engineer who loves to experiment with new-gen AI. He also happens to love testing hardware and sometimes they crash. While reviving his crashed system, you can find him reading literature, manga, or watering plants.

Related Posts

Sakana.ai Introduces Transformer2, a Self-Adaptive AI

Sakana.ai Introduces Transformer², a Self-Adaptive AI

Google’s New AI Architecture ‘Titans’ Can Remember Long-Term Data

RNNs are Back to Compete with Transformers

RNNs are Back to Compete with Transformers

Transformers Can Now Work Pixel by Pixel, Says Meta AI’s New Study

Former Google DeepMind Researchers Go Deep for Sales Triumph

Gaussian Adaptive Transformer

Best Transformer-based LLMs on Hugging Face (Part 2)

Don’t Miss the Next Big Shift in AI.

Get one year subscription for ₹5999

How This Coimbatore SaaS Firm Cracked Hidden Enterprise Problem Costing Millions

Founded in 2015, now based in Portland, Responsive supports more than 20% of Fortune 100 companies

Enterprises Beware: Agent-Washing Clouds the Future of AI

Vendors mislabel copilots as agents, raising regulatory and operational risks for firms chasing the promise of agentic AI.

How Neysa Stands Out in the IndiaAI GPU Race

Unlike other providers focused on GPU allocation, Neysa claims to deliver an end-to-end AI cloud platform.

Two Indian Engineers on a Mission to Automate Home Cooking for the World

In a live demonstration for AIM, Posha prepared paneer tikka masala in approximately 25 minutes

BharatGen and the Pursuit of Sovereign, Scalable AI for India

“Knowledge-driven components are important because we don’t want everything to be just algorithmic innovation.”

How Pradhi AI Embeds Emotional Intelligence in Voice AI

As businesses recognise the potential of voice-driven tech, Pradhi AI is laying the foundation for an empathetic, responsive AI ecosystem.

Mangaluru Looks to Build Its Own Tech Identity, Not Replicate Bangalore

“The coastal city could showcase tangible results by applying deep tech to areas it already dominates”

Google’s Gemini Nano Banana and the Cost of Convenience

The company’s new AI image and photo editor deepens concerns over data use and consent gaps, experts warn.

Download the easiest way to
stay informed

Flagship Events