Can Language Models Level Up Like Super Mario? | by Andreas Maier

A Easy Discovery That Might Redefine How AI Learns New Abilities

The way to stage up language fashions. Picture created with DALL-E.

Within the iconic Tremendous Mario video games, the hero features new skills not via follow or repetition, however just by touching the fitting power-up. A fireplace flower lets him hurl fireballs. A star makes him invincible. One contact, and Mario is remodeled. What if synthetic intelligence may do the identical?

This pleasant metaphor is greater than whimsy — it completely captures the essence of a groundbreaking new paper from ICML 2024. Titled “Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch”, it presents a radical thought: language fashions can acquire new capabilities by “absorbing” different fashions without having retraining, additional knowledge, and even GPUs. In a subject the place progress is usually measured in thousands and thousands of compute hours, this discovery lands like a hearth flower.

Why This Work Feels Like Magic

Historically, if we would like a language mannequin to observe directions, resolve math issues, and write code, we should fine-tune it individually for every activity. This implies a number of coaching runs, huge computational prices, and cautious curation of coaching knowledge. Every functionality turns into a silo, remoted in its personal mannequin, every optimized for one slim function.

Source link

Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

Why PDF Extraction Still Feels LikeHack

🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025

Using Graph Databases to Model Patient Journeys and Clinical Relationships

I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

Amazon and eBay to pay ‘fair share’ for e-waste recycling

Artificial Intelligence Concerns & Predictions For 2025

Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

Most Popular

Trump Pardons Trevor Milton, Founder of Bankrupt Truck Maker Nikola

A Great Domain Name Can Add Millions to Your Business — Here’s How to Get One (Even If It’s Already Taken)

Building TikTok-like Recommenders with Feature Pipelines

Our Picks

Using Graph Databases to Model Patient Journeys and Clinical Relationships

Cuba’s Energy Crisis: A Systemic Breakdown

AI Startup TML From Ex-OpenAI Exec Mira Murati Pays $500,000

Can Language Models Level Up Like Super Mario? | by Andreas Maier | Apr, 2025

A Easy Discovery That Might Redefine How AI Learns New Abilities

Why This Work Feels Like Magic

Related Posts