Close Menu
    Trending
    • Using Graph Databases to Model Patient Journeys and Clinical Relationships
    • Cuba’s Energy Crisis: A Systemic Breakdown
    • AI Startup TML From Ex-OpenAI Exec Mira Murati Pays $500,000
    • STOP Building Useless ML Projects – What Actually Works
    • Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025
    • The New Career Crisis: AI Is Breaking the Entry-Level Path for Gen Z
    • Musk’s X appoints ‘king of virality’ in bid to boost growth
    • Why Entrepreneurs Should Stop Obsessing Over Growth
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Machine Learning»Can Language Models Level Up Like Super Mario? | by Andreas Maier | Apr, 2025
    Machine Learning

    Can Language Models Level Up Like Super Mario? | by Andreas Maier | Apr, 2025

    Team_AIBS NewsBy Team_AIBS NewsApril 22, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    A Easy Discovery That Might Redefine How AI Learns New Abilities

    The way to stage up language fashions. Picture created with DALL-E.

    Within the iconic Tremendous Mario video games, the hero features new skills not via follow or repetition, however just by touching the fitting power-up. A fireplace flower lets him hurl fireballs. A star makes him invincible. One contact, and Mario is remodeled. What if synthetic intelligence may do the identical?

    This pleasant metaphor is greater than whimsy — it completely captures the essence of a groundbreaking new paper from ICML 2024. Titled “Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch”, it presents a radical thought: language fashions can acquire new capabilities by “absorbing” different fashions without having retraining, additional knowledge, and even GPUs. In a subject the place progress is usually measured in thousands and thousands of compute hours, this discovery lands like a hearth flower.

    Why This Work Feels Like Magic

    Historically, if we would like a language mannequin to observe directions, resolve math issues, and write code, we should fine-tune it individually for every activity. This implies a number of coaching runs, huge computational prices, and cautious curation of coaching knowledge. Every functionality turns into a silo, remoted in its personal mannequin, every optimized for one slim function.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleAirbnb to Show Full Pricing With Cleaning, Added Fees
    Next Article Building a Personal API for Your Data Projects with FastAPI
    Team_AIBS News
    • Website

    Related Posts

    Machine Learning

    Credit Risk Scoring for BNPL Customers at Bati Bank | by Sumeya sirmula | Jul, 2025

    July 1, 2025
    Machine Learning

    Why PDF Extraction Still Feels LikeHack

    July 1, 2025
    Machine Learning

    🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Using Graph Databases to Model Patient Journeys and Clinical Relationships

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    Trump Pardons Trevor Milton, Founder of Bankrupt Truck Maker Nikola

    March 28, 2025

    A Great Domain Name Can Add Millions to Your Business — Here’s How to Get One (Even If It’s Already Taken)

    May 7, 2025

    Building TikTok-like Recommenders with Feature Pipelines

    March 18, 2025
    Our Picks

    Using Graph Databases to Model Patient Journeys and Clinical Relationships

    July 1, 2025

    Cuba’s Energy Crisis: A Systemic Breakdown

    July 1, 2025

    AI Startup TML From Ex-OpenAI Exec Mira Murati Pays $500,000

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.