Close Menu
    Trending
    • Thrustworthy Machine Learning Score Estimation: Hold-out Set Evaluation | by Yalcinselcuk | Jul, 2025
    • The DuMont Duoscopic TV Set: Two Shows, One Screen
    • How NBA-Legend Carmelo Anthony Is Betting on Bud — and Equity
    • From Confusion to Clarity: Training My Eyes to See Like a Model | by Gana Joshua Danlami the Analyst | Jul, 2025
    • Today’s Top CEOs Share These 4 Traits
    • Don’t let hype about AI agents get ahead of reality
    • Introduction to data science Part 12: An Area of Intersection between Deep Learning, Explainable AI, and Robot Learning. | by Celestine Emmanuel | Jul, 2025
    • Vera Rubin Engineering – IEEE Spectrum
    AIBS News
    • Home
    • Artificial Intelligence
    • Machine Learning
    • AI Technology
    • Data Science
    • More
      • Technology
      • Business
    AIBS News
    Home»AI Technology»Don’t let hype about AI agents get ahead of reality
    AI Technology

    Don’t let hype about AI agents get ahead of reality

    Team_AIBS NewsBy Team_AIBS NewsJuly 3, 2025No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Let’s begin with the time period “agent” itself. Proper now, it’s being slapped on every thing from easy scripts to classy AI workflows. There’s no shared definition, which leaves loads of room for corporations to market primary automation as one thing far more superior. That form of “agentwashing” doesn’t simply confuse prospects; it invitations disappointment. We don’t essentially want a inflexible customary, however we do want clearer expectations about what these techniques are purported to do, how autonomously they function, and the way reliably they carry out.

    And reliability is the following large problem. Most of at the moment’s brokers are powered by massive language fashions (LLMs), which generate probabilistic responses. These techniques are highly effective, however they’re additionally unpredictable. They’ll make issues up, go off observe, or fail in refined methods—particularly after they’re requested to finish multistep duties, pulling in exterior instruments and chaining LLM responses collectively. A current instance: Customers of Cursor, a preferred AI programming assistant, have been informed by an automatic help agent that they couldn’t use the software program on a couple of gadget. There have been widespread complaints and stories of customers cancelling their subscriptions. But it surely turned out the policy didn’t exist. The AI had invented it.

    In enterprise settings, this type of mistake may create immense harm. We have to cease treating LLMs as standalone merchandise and begin constructing full techniques round them—techniques that account for uncertainty, monitor outputs, handle prices, and layer in guardrails for security and accuracy. These measures might help be certain that the output adheres to the necessities expressed by the consumer, obeys the corporate’s insurance policies concerning entry to info, respects privateness points, and so forth. Some corporations, together with AI21 (which I cofounded and which has obtained funding from Google), are already transferring in that course, wrapping language fashions in additional deliberate, structured architectures. Our newest launch, Maestro, is designed for enterprise reliability, combining LLMs with firm information, public info, and different instruments to make sure reliable outputs.

    Nonetheless, even the neatest agent gained’t be helpful in a vacuum. For the agent mannequin to work, totally different brokers must cooperate (reserving your journey, checking the climate, submitting your expense report) with out fixed human supervision. That’s the place Google’s A2A protocol is available in. It’s meant to be a common language that lets brokers share what they will do and divide up duties. In precept, it’s a terrific thought.

    In apply, A2A nonetheless falls quick. It defines how brokers speak to one another, however not what they really imply. If one agent says it could possibly present “wind situations,” one other has to guess whether or not that’s helpful for evaluating climate on a flight route. With no shared vocabulary or context, coordination turns into brittle. We’ve seen this downside earlier than in distributed computing. Fixing it at scale is way from trivial.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleIntroduction to data science Part 12: An Area of Intersection between Deep Learning, Explainable AI, and Robot Learning. | by Celestine Emmanuel | Jul, 2025
    Next Article Today’s Top CEOs Share These 4 Traits
    Team_AIBS News
    • Website

    Related Posts

    AI Technology

    Agentic AI with NVIDIA and DataRobot

    July 2, 2025
    AI Technology

    How generative AI could help make construction sites safer

    July 2, 2025
    AI Technology

    What comes next for AI copyright lawsuits?

    July 1, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Thrustworthy Machine Learning Score Estimation: Hold-out Set Evaluation | by Yalcinselcuk | Jul, 2025

    July 3, 2025

    I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

    December 10, 2024

    Amazon and eBay to pay ‘fair share’ for e-waste recycling

    December 10, 2024

    Artificial Intelligence Concerns & Predictions For 2025

    December 10, 2024

    Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

    December 10, 2024
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    Most Popular

    Mira Murati, OpenAI’s Former Chief Technology Officer, Starts Her Own Company

    February 18, 2025

    Fourier Transform Applications in Literary Analysis

    March 14, 2025

    Showcasing Genny.GPT: From Texts and Emails to GitHub Mastery | by TheMagnonP.I. | Jan, 2025

    January 16, 2025
    Our Picks

    Thrustworthy Machine Learning Score Estimation: Hold-out Set Evaluation | by Yalcinselcuk | Jul, 2025

    July 3, 2025

    The DuMont Duoscopic TV Set: Two Shows, One Screen

    July 3, 2025

    How NBA-Legend Carmelo Anthony Is Betting on Bud — and Equity

    July 3, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Business
    • Data Science
    • Machine Learning
    • Technology
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Aibsnews.comAll Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.