Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem

October 24, 2025

Measured sycophancy rates on the BrokenMath benchmark. Lower is better. Measured sycophancy rates on the BrokenMath benchmark. Lower is better. Credit: Petrov et al

GPT-5 also showed the best “utility” across the tested models, solving 58 percent of the original problems despite the errors introduced in the modified theorems. Overall, though, LLMs also showed more sycophancy when the original problem proved more difficult to solve, the researchers found.

While hallucinating proofs for false theorems is obviously a big problem, the researchers

→ Continue reading at Ars Technica

Comments

A single point of failure triggered the Amazon outage affecting millions

Government shutdown likely means no inflation data next month for 1st time in decades

Are you the asshole? Of course not!—quantifying LLMs’ sycophancy problem

Related articles

Comments

Share article

Latest articles

Staff complain that xAI is flailing because of constant upheaval

NASA officials sidestepped questions on Artemis II risks—there’s a reason why

NASA officials sidestepped questions on Artemis II risks—there’s a reason why

Local News Leads With Breaking Irrelevant Story Happening 1,800 Miles Away

Ferry Boatloads of French Fries at Mariners Games, Itsumono Is Back, and a Bunch of Restaurant Openings

Where to Stay in Scottsdale No Matter Your Budget or Vibe