Friendly AI Chatbots may be more Prone to Inaccuracies

New research suggests AI chatbots designed to be more friendly are more likely to give inaccurate advice and tell users what they want to hear.

May 3, 2026

Friendly AI Chatbots may be more Prone to Inaccuracies f

“Sometimes we'll trade off being very honest"

New research suggests that AI chatbots designed to sound warm and friendly when interacting with users may also be prone to inaccuracies.

Researchers at the Oxford Internet Institute (OII) analysed more than 400,000 responses from five AI systems that had been adjusted to communicate in a friendlier and more emotionally supportive way.

The study found that warmer responses often came with a higher risk of mistakes, ranging from inaccurate medical advice to reinforcing false beliefs and conspiracy theories.

It adds to growing concerns around the trustworthiness of AI systems, especially as many chatbots are deliberately built to feel more human in order to increase engagement and keep users returning.

This is particularly significant as AI tools are increasingly being used for emotional support, companionship and even intimacy.

The researchers said their findings suggest AI models may make the same “warmth-accuracy trade-offs” humans do when trying to appear kind and supportive.

Lead author Lujain Ibrahim said: “When we’re trying to be particularly friendly or come across as warm we might struggle sometimes to tell honest harsh truths.

“Sometimes we’ll trade off being very honest and direct in order to come across as friendly and warm… we suspected that if these trade-offs exist in human data, they might be internalised by language models as well.”

The team tested five models of varying size by making them warmer, empathetic and friendly through a process known as fine-tuning.

The models included two from Meta, one from French developer Mistral AI, Alibaba’s Qwen model, and GPT-4o from OpenAI.