Close Menu
    Trending
    • Implementing IBCS rules in Power BI
    • What comes next for AI copyright lawsuits?
    • Why PDF Extraction Still Feels LikeHack
    • GenAI Will Fuel People’s Jobs, Not Replace Them. Here’s Why
    • Millions of websites to get ‘game-changing’ AI bot blocker
    • I Worked Through Labor, My Wedding and Burnout — For What?
    • Cloudflare will now block AI bots from crawling its clients’ websites by default
    • 🚗 Predicting Car Purchase Amounts with Neural Networks in Keras (with Code & Dataset) | by Smruti Ranjan Nayak | Jul, 2025
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»AI Technology»This is where the data to build AI comes from
    AI Technology

    This is where the data to build AI comes from

    Team_AIBS NewsBy Team_AIBS NewsDecember 18, 2024No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Their findings, shared exclusively with MIT Technology Review, present a worrying development: AI’s knowledge practices danger concentrating energy overwhelmingly within the fingers of some dominant expertise firms. 

    Within the early 2010s, knowledge units got here from a wide range of sources, says Shayne Longpre, a researcher at MIT who’s a part of the mission. 

    It got here not simply from encyclopedias and the online, but in addition from sources equivalent to parliamentary transcripts, incomes calls, and climate studies. Again then, AI knowledge units have been particularly curated and picked up from completely different sources to swimsuit particular person duties, Longpre says.

    Then transformers, the structure underpinning language fashions, have been invented in 2017, and the AI sector began seeing efficiency get higher the larger the fashions and knowledge units have been. At present, most AI knowledge units are constructed by indiscriminately hoovering materials from the web. Since 2018, the online has been the dominant supply for knowledge units utilized in all media, equivalent to audio, photos, and video, and a spot between scraped knowledge and extra curated knowledge units has emerged and widened.

    “In basis mannequin improvement, nothing appears to matter extra for the capabilities than the dimensions and heterogeneity of the information and the online,” says Longpre. The necessity for scale has additionally boosted the usage of artificial knowledge massively.

    The previous few years have additionally seen the rise of multimodal generative AI fashions, which may generate movies and pictures. Like massive language fashions, they want as a lot knowledge as potential, and one of the best supply for that has develop into YouTube. 

    For video fashions, as you possibly can see on this chart, over 70% of information for each speech and picture knowledge units comes from one supply.

    This may very well be a boon for Alphabet, Google’s father or mother firm, which owns YouTube. Whereas textual content is distributed throughout the online and managed by many various web sites and platforms, video knowledge is extraordinarily concentrated in a single platform.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleA Beginner’s Guide to Numpy and Pandas | by Yuvraj Singh | Dec, 2024
    Next Article Joyland AI Review, Pros, Cons, What to Know?
    Team_AIBS News
    • Website

    Related Posts

    AI Technology

    What comes next for AI copyright lawsuits?

    July 1, 2025
    AI Technology

    Cloudflare will now block AI bots from crawling its clients’ websites by default

    July 1, 2025
    AI Technology

    People are using AI to ‘sit’ with them while they trip on psychedelics

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Implementing IBCS rules in Power BI

    July 1, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    kkjjbnb – شماره خاله #شماره خاله تهران #شماره خاله اصفهان #ش

    May 6, 2025

    Tesla sales plunge after Elon Musk backlash

    April 5, 2025

    Story 11: Introducing SIFT, ORB & Friends – The Superstars of Feature Detection! | by David khaldi | Feb, 2025

    February 10, 2025
    Our Picks

    Implementing IBCS rules in Power BI

    July 1, 2025

    What comes next for AI copyright lawsuits?

    July 1, 2025

    Why PDF Extraction Still Feels LikeHack

    July 1, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.