Skip to main content
New: Matter-aware research now ships with jurisdiction filtering. Learn more
AI & Law|April 15, 2026|3 min read

Why 36% of AI-Generated Legal Citations Are Wrong

Analysis of AI hallucination rates in legal research and how CaseRead.ai's Dual-RAG architecture eliminates fabricated citations.

AI hallucinationslegal citationslegal researchChatGPT

The Problem Nobody Wants to Talk About

A Stanford and Yale study found that 36% of citations generated by GPT-4 are completely fabricated. They look real — plausible case names, proper citation formats, convincing holdings — but the cases don't exist.

This isn't a theoretical risk. As of 2026, there are now 729+ documented court cases where attorneys submitted AI-generated citations that turned out to be fake.

Real Consequences

The most famous example is Mata v. Avianca (2023), where a New York attorney submitted a brief containing six fabricated cases generated by ChatGPT. The attorney was sanctioned, publicly embarrassed, and the case became a cautionary tale covered by every legal publication.

But it keeps happening. Why?

Why Large Language Models Hallucinate

ChatGPT, Claude, Gemini — they're all language models, not databases. When you ask for a citation, they don't look it up in a case law database. They generate text that looks like a citation based on patterns they learned during training.

The model predicts: "A case about landlord-tenant disputes in Utah would probably be cited as something like Smith v. Johnson, 2019 UT App 47." It sounds real. It follows the right format. But it's fiction.

The Architecture Gap

The fundamental problem is that general-purpose AI tools skip the retrieval step. They go straight from question to answer, generating everything — including citations — from patterns.

Retrieval-Augmented Generation (RAG) solves this by adding a retrieval step before generation:

  1. Your question is converted to an embedding (a mathematical representation)
  2. That embedding is matched against a real database of case law and statutes
  3. The AI generates its answer using only the real sources it found

Every citation in the response is a real case that exists in the database. The AI can't hallucinate a citation because it's not generating citations — it's citing documents it actually retrieved.

Dual-RAG: Searching Two Databases Simultaneously

CaseRead.ai takes RAG a step further with Dual-RAG — simultaneously searching:

  • Public case law and statutes (federal and state databases)
  • Your firm's private documents (briefs, memos, precedents in your secure vault)

This means a single research query can cite both binding precedent and your firm's winning arguments from a similar case three years ago.

What This Means for Your Practice

If you're using ChatGPT or any general-purpose AI for legal research, every citation needs manual verification. That defeats the purpose of using AI to save time.

With a RAG-based system like CaseRead, citations are verified at the architecture level. You still exercise professional judgment about which cases to cite, but you don't have to worry about whether the cases actually exist.

Try It Yourself

Search any legal question on CaseRead.ai. Click any citation. It's real. Every time.

Start free — no credit card required.

CaseRead

CaseRead Team

AI-powered legal research built for practicing attorneys.

Ready to try AI-powered legal research?

Free to start. No credit card required.

Start Free