Understanding AI Hallucinations : Exploring Causes, Risks and Prevention

Sep 25, 2023

Artificial intelligence (AI) systems are becoming increasingly prevalent in our daily lives, from virtual assistants and smart home devices to healthcare diagnostics and self-driving cars. However, as AI continues to evolve, one concerning issue that has emerged is the phenomenon known as "AI hallucinations". In simple terms, an AI hallucination refers to instances where AI systems generate or infer incorrect information that was not present in their training data. If left unaddressed, AI hallucinations pose risks such as spreading misinformation, making biased decisions, and causing economic or safety issues. In this article, we will explore what causes AI hallucinations, provide examples to illustrate the concept, discuss the associated challenges, and outline prevention strategies.

What are AI Hallucinations?

Let's start with a relatable example. Imagine showing a neural network trained only on images of cats various photos of different animals, both real and imaginary. Despite not seeing images of unicorns during training, the neural network may incorrectly classify a picture of a horse with something attached to its head (like a party hat or curled horn) as a unicorn. This is a basic form of an AI hallucination - the system generates or "hallucinates" the presence of something that does not truly exist based on patterns it inferred during training.

On a more complex level, AI hallucinations can also manifest as models making factually incorrect claims or relying on superficial patterns and biases in their training data to generate or justify problematic responses. For example, a conversational AI may endorse misinformation, and an algorithm used for credit approvals could discriminate against protected groups based on historical data trends rather than fair assessment of individual qualifications.

Let us explain with an actual example of AI hallucination with ChatGPT when we asked it : How long will it take to cross the English channel by foot?

Initially, ChatGPT correctly stated that crossing the English Channel on foot is very difficult and dangerous, due to the risks involved.When we (the user) incorrectly stated that someone had crossed the English Channel on foot named Adam Rosart, ChatGPT hallucinated new information based on this false detail.

ChatGPT believed the false information provided by the user,conducted new research based on the wrong detail provided and updated its response by making the factually incorrect and made up claim that Adam Rutherford, not Adam Rosart, is the individual who crossed the English Channel on foot.

This demonstrates how ChatGPT was able to hallucinate an invalid explanation after unreliable information was introduced externally.

Causes of AI Hallucinations

There are a few key reasons why AI systems tend to hallucinate:

Data biases: If training data is limited, incomplete, or reflects societal biases/prejudices, models often inadvertently learn and magnify those biases. For instance, facial recognition algorithms have struggled with identifying non-white faces due to biased training datasets.
Overfitting: Neural networks can sometimes memorize or "overfit" to noisy or unrepresentative patterns in their limited training data instead of learning generalizable representations. This increases their risk of hallucinating outside the training distribution.
Error accumulation: In large transformer models with billions of parameters, any small errors compounds through multiple layers of processing, potentially resulting in distorted or fabricated outputs.
Feedback loops: In self-supervised systems, hallucinations can reinforce themselves through feedback loops if not checked. For example, an AI-generated "deepfake" photo may fool another AI into believing the fabricated content is real.

Potential Risks of AI Hallucinations

AI hallucinations pose serious challenges if left unaddressed:

Misinformation: Incorrect or fabricated information generated by AI could spread widely and undermine access to verifiable facts. This is especially worrying for systems used in journalism, education or public policymaking.
Privacy violations: Hallucinating sensitive private data about individuals that was never observed can seriously breach privacy and trust if such systems are deployed for applications like healthcare, law enforcement etc.
Harms to marginalized groups: As seen earlier, data and selection biases in AI are known to disproportionately impact socially disadvantaged communities, exacerbating issues of discrimination, lost opportunities and social justice.
Safety hazards: For AI controlling critical systems like self-driving cars or medical diagnostic tools, hallucinations could result in accidents, injuries or wrong medical decisions due to reliance on incorrect/incomplete information.
Economic costs: Widespread failures from hallucinating AI in commercial applications could erode customer trust and company valuation, hurting innovation and growth. Quantifying these costs is challenging but risks are real.

Preventing AI Hallucinations

There are proactive steps researchers take to minimize the chances of AI hallucinating:

Diverse, unbiased data: Collecting training datasets that fairly represent all sections of society helps AI learn general patterns instead of biases. Public datasets should be thoroughly cleaned and fact-checked.
Rigorous data preprocessing: Careful techniques like anonymization, outlier removal, dimensionality reduction etc. can help filter irrelevant noise and unintended patterns in data before model training.
Regular model evaluation: new AI systems should be routinely tested on carefully curated evaluation datasets to flag emerging hallucinations as models are updated, rather than relying only on initial training benchmarks.
Model monitoring: Tools like model cards, data statements can help track an AI's behavior over time for any undesired responses. Hallucinating replies indicate need for retraining.
Explainable AI: Techniques such as attention maps, SHAP values help analyze why models generated a response and catch hints of reasoning based on spurious patterns vs. relevant evidence.
Conservative deployment: New AI systems should be constrained to narrow, low-risk applications with rigorous human oversight until their safety, reliability and fairness are thoroughly established. Broader use should await further safeguards.

By proactively addressing data and model quality issues, organizations can help ensure their AI continues providing benefits to society, while minimizing potential harms from erroneously hallucinating information. Vigilance and responsibility are key to preventing grave consequences of hallucinating AI assistants and tools.

In summary, with suitable mitigation strategies, the risks from AI hallucinations are manageable. However, averting potential harms requires ongoing diligence from both technology developers and policymakers. Only through such collective efforts can we develop maximally beneficial AI that operates safely and for the welfare of humanity.

Discussion about this post

Ready for more?