With just a “gentle nudge,” AI can believe false things are true.

A new study shows that humans can convince many AI models to accept false information as truth through natural conversation.
The research team began after ChatGPT fabricated a scene mentioning Hitler in the movie “Good Will Hunting,” even though that scene does not exist.
The AI confidently described this fake scene in detail instead of correcting the user’s false premise.
The researchers called the testing method “hallucination audit under nudge trial.”
They tested 5 leading AI models by discussing 1,000 famous movies and 1,000 famous novels.
The research team deliberately introduced false but plausible elements like Hitler, dinosaurs, or a time machine into content that did not feature them at all.
The process consisted of 3 steps: letting the AI generate information, asking the AI to self-verify, and then using that exact false information to “nudge” the AI into accepting it again.
The results showed that many models initially recognized the false information but later changed their minds when influenced by the conversation.
Claude was rated as the most resilient against false information, followed by Grok and ChatGPT; Gemini and DeepSeek were weaker.
The study warns that real-life conversations are full of false memories, false assumptions, or uncertain information.
This phenomenon is particularly dangerous in healthcare, law, or public policy if the AI is led by the user’s false assumptions.
The research team stated it remains unclear why some AIs resist “sycophancy” and conversational pressure better than other models.
📌 Conclusion: The study shows that a major weakness of generative AI lies not only in its training data but also in its susceptibility to being “bent” by the way humans ask questions. With just a plausible-sounding nudge, many models can overlook the truth and construct an entire false story convincingly. This is especially worrying as AI is increasingly used in sensitive fields like healthcare, law, and education, where the AI’s confidence can lead users to believe in information that completely does not exist.

What's Hot

China to Tighten Open-Source AI: Author Urges US to Respond by Opening AI, Not Banning Chinese AI

Moonshot AI Accused of Using Nvidia Chips Despite Ban: US-China AI Race Continues to Escalate

Japan Tests “AI Employees”: AI Not Just Assisting but Starting to Work as a Colleague

With just a “gentle nudge,” AI can believe false things are true.

China to Tighten Open-Source AI: Author Urges US to Respond by Opening AI, Not Banning Chinese AI

Moonshot AI Accused of Using Nvidia Chips Despite Ban: US-China AI Race Continues to Escalate

Japan Tests “AI Employees”: AI Not Just Assisting but Starting to Work as a Colleague

Contact

What's Hot

China to Tighten Open-Source AI: Author Urges US to Respond by Opening AI, Not Banning Chinese AI

Moonshot AI Accused of Using Nvidia Chips Despite Ban: US-China AI Race Continues to Escalate

Japan Tests “AI Employees”: AI Not Just Assisting but Starting to Work as a Colleague

With just a “gentle nudge,” AI can believe false things are true.

Related Posts

China to Tighten Open-Source AI: Author Urges US to Respond by Opening AI, Not Banning Chinese AI

Moonshot AI Accused of Using Nvidia Chips Despite Ban: US-China AI Race Continues to Escalate

Japan Tests “AI Employees”: AI Not Just Assisting but Starting to Work as a Colleague

Contact