Close Menu
    Trending
    • How This Man Grew His Beverage Side Hustle From $1k a Month to 7 Figures
    • Finding the right tool for the job: Visual Search for 1 Million+ Products | by Elliot Ford | Kingfisher-Technology | Jul, 2025
    • How Smart Entrepreneurs Turn Mid-Year Tax Reviews Into Long-Term Financial Wins
    • Become a Better Data Scientist with These Prompt Engineering Tips and Tricks
    • Meanwhile in Europe: How We Learned to Stop Worrying and Love the AI Angst | by Andreas Maier | Jul, 2025
    • Transform Complexity into Opportunity with Digital Engineering
    • OpenAI Is Fighting Back Against Meta Poaching AI Talent
    • Lessons Learned After 6.5 Years Of Machine Learning
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»AI Technology»This is where the data to build AI comes from
    AI Technology

    This is where the data to build AI comes from

    Team_AIBS NewsBy Team_AIBS NewsDecember 18, 2024No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Their findings, shared exclusively with MIT Technology Review, present a worrying development: AI’s knowledge practices danger concentrating energy overwhelmingly within the fingers of some dominant expertise firms. 

    Within the early 2010s, knowledge units got here from a wide range of sources, says Shayne Longpre, a researcher at MIT who’s a part of the mission. 

    It got here not simply from encyclopedias and the online, but in addition from sources equivalent to parliamentary transcripts, incomes calls, and climate studies. Again then, AI knowledge units have been particularly curated and picked up from completely different sources to swimsuit particular person duties, Longpre says.

    Then transformers, the structure underpinning language fashions, have been invented in 2017, and the AI sector began seeing efficiency get higher the larger the fashions and knowledge units have been. At present, most AI knowledge units are constructed by indiscriminately hoovering materials from the web. Since 2018, the online has been the dominant supply for knowledge units utilized in all media, equivalent to audio, photos, and video, and a spot between scraped knowledge and extra curated knowledge units has emerged and widened.

    “In basis mannequin improvement, nothing appears to matter extra for the capabilities than the dimensions and heterogeneity of the information and the online,” says Longpre. The necessity for scale has additionally boosted the usage of artificial knowledge massively.

    The previous few years have additionally seen the rise of multimodal generative AI fashions, which may generate movies and pictures. Like massive language fashions, they want as a lot knowledge as potential, and one of the best supply for that has develop into YouTube. 

    For video fashions, as you possibly can see on this chart, over 70% of information for each speech and picture knowledge units comes from one supply.

    This may very well be a boon for Alphabet, Google’s father or mother firm, which owns YouTube. Whereas textual content is distributed throughout the online and managed by many various web sites and platforms, video knowledge is extraordinarily concentrated in a single platform.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleA Beginner’s Guide to Numpy and Pandas | by Yuvraj Singh | Dec, 2024
    Next Article Joyland AI Review, Pros, Cons, What to Know?
    Team_AIBS News
    • Website

    Related Posts

    AI Technology

    The AI Hype Index: AI-powered toys are coming

    June 25, 2025
    AI Technology

    Can we fix AI’s evaluation crisis?

    June 24, 2025
    AI Technology

    A Chinese firm has just launched a constantly changing set of AI benchmarks

    June 23, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    How This Man Grew His Beverage Side Hustle From $1k a Month to 7 Figures

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    How a Night in the Jordanian Desert Taught Me a Business Lesson I’ll Never Forget

    March 26, 2025

    New IEEE Standard for Securing Biomedical Devices and Data

    February 8, 2025

    Reinventing Monopoly: Crafting the Perfect Reward and Early Results (Part 2) | by Srinivasan Sridhar | Mar, 2025

    March 8, 2025
    Our Picks

    How This Man Grew His Beverage Side Hustle From $1k a Month to 7 Figures

    July 1, 2025

    Finding the right tool for the job: Visual Search for 1 Million+ Products | by Elliot Ford | Kingfisher-Technology | Jul, 2025

    July 1, 2025

    How Smart Entrepreneurs Turn Mid-Year Tax Reviews Into Long-Term Financial Wins

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.