Close Menu
    Trending
    • Implementing IBCS rules in Power BI
    • What comes next for AI copyright lawsuits?
    • Why PDF Extraction Still Feels LikeHack
    • GenAI Will Fuel People’s Jobs, Not Replace Them. Here’s Why
    • Millions of websites to get ‘game-changing’ AI bot blocker
    • I Worked Through Labor, My Wedding and Burnout — For What?
    • Cloudflare will now block AI bots from crawling its clients’ websites by default
    • 🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»Artificial Intelligence»Apollo and Design Choices of Video Large Multimodal Models (LMMs) | by Matthew Gunton | Jan, 2025
    Artificial Intelligence

    Apollo and Design Choices of Video Large Multimodal Models (LMMs) | by Matthew Gunton | Jan, 2025

    Team_AIBS NewsBy Team_AIBS NewsJanuary 24, 2025No Comments1 Min Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Let’s discover main design selections from Meta’s Apollo paper

    Towards Data Science

    Picture by Writer — Flux.1 Schnell

    As we’ve been anticipating, fashions have gotten more and more able to understanding several types of inputs. We’ve seen picture transformer fashions (see my blogs on fine-tuning Flux and the research behind MM1) and now we’re starting to see video fashions hit the scene.

    In December of 2024, Meta unveiled their new Apollo household of fashions. Once they unveiled these, in addition they revealed a paper detailing their analysis and work round Giant Multimodal Fashions (LMMs). The paper is filled with nice particulars, so quite than attempt to cowl all of it I’ll be specializing in the 4 main design selections they highlighted when making their mannequin.

    Let’s dive in!

    Embedding

    Let’s first format some fast concepts which are vital to know what’s happening right here. Each Transformer depends on embeddings for its enter. Nonetheless, consumer enter is often first transformed from one thing user-understood (textual content, movies) to tokens after which embeddings. To transform to embeddings, we use an embedding mannequin. For multi-modal inputs, we usually use a distinct encoder for every enter kind.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleOpenAI launches Operator—an agent that can use a computer for you
    Next Article The $260 Billion Fitness Industry’s Top Franchise Revealed
    Team_AIBS News
    • Website

    Related Posts

    Artificial Intelligence

    Implementing IBCS rules in Power BI

    July 1, 2025
    Artificial Intelligence

    Become a Better Data Scientist with These Prompt Engineering Tips and Tricks

    July 1, 2025
    Artificial Intelligence

    Lessons Learned After 6.5 Years Of Machine Learning

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Implementing IBCS rules in Power BI

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    NetApp’s 2024 Data Complexity Report Reveals AI’s Make or Break Year Ahead

    December 26, 2024

    《從零開始的資料科學筆記》Day#9: 特徵工程. 🙋什麼是特徵? | by Ethan Chen | Jun, 2025

    June 23, 2025

    Understanding Random Forest & Naïve Bayes (Classifier) | by Alvin Octa Hidayathullah | Feb, 2025

    February 20, 2025
    Our Picks

    Implementing IBCS rules in Power BI

    July 1, 2025

    What comes next for AI copyright lawsuits?

    July 1, 2025

    Why PDF Extraction Still Feels LikeHack

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.