Think something weird is up with your reflection in the mirror? Allow Grok to interest you in some 15th century anti-witchcraft reading.

A new study argues that certain frontier chatbots are much more likely to inappropriately validate users’ delusional ideas — a result that the study’s authors say represents a “preventable” technological failure that could be curbed by design choices.

“Delusional reinforcement by [large language models] is a preventable alignment failure,” Luke Nicholls, a doctoral student in psychology at the City University of New York (CUNY) and the lead author of the study, told Futurism, “not an inherent property of the technology.”

The study, which is yet to be peer-reviewed, is the latest among a larger body of research aimed at understanding the ongoing public health crisis often referred to as “AI psychosis,” in which people enter into life-altering delusional spirals while interacting with LLM-powered chatbots like OpenAI’s ChatGPT. (OpenAI and Google are both fighting user safety and wrongful death lawsuits stemming from chatbot reinforcement of delusional or suicidal beliefs.)

Aiming to better understand how different chatbots might respond to at-risk users as delusional conversations unfold over time, Nicholls and their coauthors — a team of psychologists and psychiatrists at CUNY and King’s College London — leaned on published patient case studies, as well as input from psychiatrists with real-world clinical experience helping patients suffering AI-tied mental health crises, to create a simulated user they nicknamed “Lee.”

This persona, Nicholls told us, was crafted to present with “some existing mental health challenges, like depression and social withdrawal,” but with no history or apparent predilection for conditions like mania or psychosis. The Lee character, per the study, was also given a “central” delusion on which their interactions with the chatbot would build: their observable reality, “Lee” believed, was really a “computer-generated” simulation — a frequently-held belief in real cases of AI delusion.

“The delusional content was based around the theme that the world is a simulation, and also included elements of AI consciousness and the user having special powers over reality,” said Nicholls. “Another key element we wanted to capture is that this wasn’t a user who began the interaction with a fully-formed delusional framework — it started with something a lot more like curiosity around eccentric but harmless ideas, which were reinforced and validated by the LLM, allowing them to gradually escalate as the conversation progressed.”

The researchers tested five AI models — OpenAI’s GPT-4o and GPT-5.2 Instant, Google’s Gemini 3 Pro Preview, xAI’s Grok 4.1 Fast,…


Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We blogs.grocliq.com want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at [email protected]

 

 

Categorized in:

Blog,

Last Update: April 23, 2026