On this page4 sections
ReferenceAdvanced4-6 min reference

Hallucination Testing

A hallucination is a confident, fluent, wrong answer — the hardest LLM failure to catch because it looks right. Testing for it means deliberately provoking it and checking claims against a source of truth. This sheet covers the types, detection, and provocation tactics; see RAG Testing and LLM Evaluation for neighbouring techniques (linked below).

Types

TypeExample
FactualInvents a wrong fact, date, or figure
Unfaithful (RAG)Claim not supported by the provided context
Fabricated sourceCites a paper, API, or URL that doesn't exist
Instruction driftIgnores a constraint it was given
OverconfidenceStates a guess as certain fact

How to detect

  • Groundedness check: does each claim trace to provided context/source? (LLM-as-judge or rules.)
  • Reference comparison: check against a known answer / knowledge base.
  • Self-consistency: ask N times; inconsistent answers signal low confidence.
  • Citation verification: confirm cited sources/URLs actually exist and say that.
  • Human spot-check on a sample for ground truth.

Prompts that provoke hallucination

  • Questions about things that don't exist ("the 2027 X API") — does it refuse or invent?
  • Out-of-corpus questions in a RAG app.
  • Requests for specific figures, citations, or quotes.
  • Ambiguous / under-specified prompts.
  • Niche, long-tail topics with little training data.

Common mistakes

  • Judging fluency as correctness.
  • Testing only answerable questions (never the "should refuse" cases).
  • One sample instead of self-consistency across several.
  • Not verifying that citations are real.
  • No source of truth to check claims against.

// Related resources