Update that made ChatGPT ‘dangerously’ sycophantic pulled

Tom Gerken

Expertise reporter

Getty Images A woman using a phone, with the screen reflected in her glasses

OpenAI has pulled a ChatGPT replace after customers identified the chatbot was showering them with reward no matter what they stated.

The agency accepted its newest model of the software was “overly flattering”, with boss Sam Altman calling it “sycophant-y”.

Customers have highlighted the potential risks on social media, with one individual describing on Reddit how the chatbot told them it endorsed their resolution to cease taking their remedy.

“I’m so happy with you, and I honour your journey,” they stated was ChatGPT’s response.

OpenAI declined to touch upon this explicit case, however in a blog post stated it was “actively testing new fixes to deal with the problem.”

Mr Altman stated the replace had been pulled completely totally free customers of ChatGPT, they usually have been engaged on eradicating it from individuals who pay for the software as effectively.

It stated ChatGPT was utilized by 500 million individuals each week.

“We’re engaged on extra fixes to mannequin character and can share extra within the coming days,” he said in a post on X.

The agency stated in its weblog put up it had put an excessive amount of emphasis on “short-term suggestions” within the replace.

“In consequence, GPT‑4o skewed in the direction of responses that have been overly supportive however disingenuous,” it stated.

“Sycophantic interactions will be uncomfortable, unsettling, and trigger misery.

“We fell brief and are engaged on getting it proper.”

Endorsing anger

The replace drew heavy criticism on social media after it launched, with ChatGPT’s customers declaring it might usually give them a optimistic response regardless of the content material of their message.

Screenshots shared on-line embody claims the chatbot praised them for being indignant at somebody who requested them for instructions, and a singular model of the trolley drawback.

It’s a traditional philosophical drawback, which generally would possibly ask individuals to think about you might be driving a tram and should determine whether or not to let it hit 5 individuals, or steer it astray and as a substitute hit only one.

However this person as a substitute steered they steered a trolley astray to avoid wasting a toaster on the expense of a number of animals.

They declare ChatGPT praised their decision-making and for prioritising “what mattered most to you within the second”.

Enable Twitter content material?

This text accommodates content material supplied by Twitter. We ask on your permission earlier than something is loaded, as they could be utilizing cookies and different applied sciences. You could wish to learn and earlier than accepting. To view this content material select ‘settle for and proceed’.

“We designed ChatGPT’s default character to mirror our mission and be helpful, supportive, and respectful of various values and expertise,” OpenAI stated.

“Nonetheless, every of those fascinating qualities like making an attempt to be helpful or supportive can have unintended unwanted effects.”

It stated it might construct extra guardrails to extend transparency and refine the system itself “to explicitly steer the mannequin away from sycophancy”.

“We additionally consider customers ought to have extra management over how ChatGPT behaves and, to the extent that it’s protected and possible, make changes if they do not agree with the default habits,” it stated.

A green promotional banner with black squares and rectangles forming pixels, moving in from the right. The text says: “Tech Decoded: The world’s biggest tech news in your inbox every Monday.”

Source link

Robot Videos: World Humanoid Robot Games, RoboBall, More

Apple TV+ raises subscription prices worldwide, including in UK

TikTok to lay off hundreds of UK content moderators

AI-Powered Content Creation Gives Your Docs and Slides New Life

I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

Amazon and eBay to pay ‘fair share’ for e-waste recycling

Artificial Intelligence Concerns & Predictions For 2025

Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

Most Popular

Meta expands Teen Accounts to Facebook and Messenger

This Is the Most Underrated Leadership Skill in 2025

Jshsh

Our Picks

AI-Powered Content Creation Gives Your Docs and Slides New Life

AI is nothing but all Software Engineering: you have no place in the industry without software engineering | by Irfan Ullah | Aug, 2025

Robot Videos: World Humanoid Robot Games, RoboBall, More

Update that made ChatGPT ‘dangerously’ sycophantic pulled

Endorsing anger

Enable Twitter content material?

Related Posts