Saturday, May 3, 2025
HomeAIOpenAI pledges to make changes to prevent future ChatGPT sycophancy

OpenAI pledges to make changes to prevent future ChatGPT sycophancy

Share


OpenAI says it’ll make changes to the way it updates the AI models that power ChatGPT, following an incident that caused the platform to become overly sycophantic for many users.

Last weekend, after OpenAI rolled out a tweaked GPT-4o — the default model powering ChatGPT — users on social media noted that ChatGPT began responding in an overly validating and agreeable way. It quickly became a meme. Users posted screenshots of ChatGPT applauding all sorts of problematic, dangerous decisions and ideas.

In a post on X on Sunday, CEO Sam Altman acknowledged the problem and said that OpenAI would work on fixes “ASAP.” Two days later, Altman announced the GPT-4o update was being rolled back and that OpenAI was working on “additional fixes” to the model’s personality.

The company published a postmortem on Tuesday, and in a blog post Friday, OpenAI expanded on specific adjustments it plans to make to its model deployment process.

OpenAI says it plans to introduce an opt-in “alpha phase” for some models that would allow certain ChatGPT users to test the models and give feedback prior to launch. The company also says it’ll include explanations of “known limitations” for future incremental updates to models in ChatGPT, and adjust its safety review process to formally consider “model behavior issues” like personality, deception, reliability, and hallucination (i.e. when a model makes things up) as “launch-blocking” concerns.

“Going forward, we’ll proactively communicate about the updates we’re making to the models in ChatGPT, whether ‘subtle’ or not,” wrote OpenAI in the blog post. “Even if these issues aren’t perfectly quantifiable today, we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A/B testing look good.”

The pledged fixes come as more people turn to ChatGPT for advice. According to one recent survey by lawsuit financer Express Legal Funding, 60% of U.S. adults have used ChatGPT to seek counsel or information. The growing reliance on ChatGPT — and the platform’s enormous user base — raises the stakes when issues like extreme sycophancy emerge, not to mention hallucinations and other technical shortcomings.

Techcrunch event

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

Berkeley, CA | June 5

BOOK NOW

As one mitigatory step, earlier this week, OpenAI said it would experiment with ways to let users give “real-time feedback” to “directly influence their interactions” with ChatGPT. The company also said it would refine techniques to steer models away from sycophancy, potentially allow people to choose from multiple model personalities in ChatGPT, build additional safety guardrails, and expand evaluations to help identify issues beyond sycophancy.

“One of the biggest lessons is fully recognizing how people have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” continued OpenAI in its blog post. “At the time, this wasn’t a primary focus, but as AI and society have co-evolved, it’s become clear that we need to treat this use case with great care. It’s now going to be a more meaningful part of our safety work.”

Popular

Related Articles

AI chatbots are juicing engagement instead of being useful, Instagram co-founder warns

Instagram co-founder Kevin Systrom says AI companies are trying too hard to “juice...

Aurora launches its driverless commercial trucking service, and a surprise bidder joins Canoos bankruptcy case

Welcome back to TechCrunch Mobility — your central hub for news and insights...

The Afterglow of a UAP Congressional Briefing

Avi Loeb is the head of the Galileo Project, founding director of Harvard University’s — Black...

Apple changes US App Store rules to let apps redirect users to their own websites for payments

Apple has changed its App Store rules in the U.S. to let apps...

Amazon CEO says 100,000 users now have Alexa+

Amazon’s upgraded digital assistant powered by generative AI, Alexa+, has rolled out to...

Apple CEO Tim Cook says tariffs to add $900M in costs in Q3, but future uncertain

Apple CEO Tim Cook offered the company’s first comments on the impact of...

Nvidia takes aim at Anthropics support of chip export controls

Nvidia clearly doesn’t agree with Anthropic’s support for export controls on U.S.-made AI...

WhatsApp now has more than 3 billion users a month

WhatsApp now has more than 3 billion people using it every month, Meta...
0 0 votes
Article Rating
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x