What is an LLM and how does it work for legal AI?

A large language model is a neural network trained on billions of words of text. It learns statistical patterns between words and concepts, enabling it to generate coherent, contextually relevant text. In legal AI, an LLM processes a query or document and produces analysis, summaries, or drafts. Legal-focused LLMs are either fine-tuned on legal corpora or grounded via retrieval systems that feed real legal documents into the context before generation.

Are legal LLMs more accurate than general ones?

In controlled tests, legal-specific implementations substantially outperform raw general LLMs on legal tasks. The Stanford RegLab's 2024 independent study found GPT-4 without legal grounding produced an 88% error rate on legal citation tasks, while Lexis+ AI — built with legal-specific grounding and corpus access — achieved a 17% error rate. The difference is not solely the base model; grounding, retrieval architecture, and training data composition all contribute to accuracy.

What is GPT-4 and which legal tools use it?

GPT-4 is OpenAI's large language model, notable for strong reasoning and long-context performance. Several legal AI vendors have built their products on GPT-4 or its successors. Harvey AI, one of the most widely adopted enterprise legal AI platforms, is built on GPT-4 and GPT-4o with additional legal fine-tuning and confidentiality architecture. Spellbook, which integrates directly into Microsoft Word for contract drafting, also uses GPT-4 as its foundational model.

Large Language Model (Legal)

A neural network trained on massive text corpora that can generate, summarize, classify, and analyze text — including legal documents — enabling law firms to automate research, drafting, and contract review tasks.

Last reviewed: 2026/05/25

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

What is an LLM and how does it work for legal AI?: A large language model is a neural network trained on billions of words of text. It learns statistical patterns between words and concepts, enabling it to generate coherent, contextually relevant text. In legal AI, an LLM processes a query or document and produces analysis, summaries, or drafts. Legal-focused LLMs are either fine-tuned on legal corpora or grounded via retrieval systems that feed real legal documents into the context before generation.
Are legal LLMs more accurate than general ones?: In controlled tests, legal-specific implementations substantially outperform raw general LLMs on legal tasks. The Stanford RegLab's 2024 independent study found GPT-4 without legal grounding produced an 88% error rate on legal citation tasks, while Lexis+ AI — built with legal-specific grounding and corpus access — achieved a 17% error rate. The difference is not solely the base model; grounding, retrieval architecture, and training data composition all contribute to accuracy.
What is GPT-4 and which legal tools use it?: GPT-4 is OpenAI's large language model, notable for strong reasoning and long-context performance. Several legal AI vendors have built their products on GPT-4 or its successors. Harvey AI, one of the most widely adopted enterprise legal AI platforms, is built on GPT-4 and GPT-4o with additional legal fine-tuning and confidentiality architecture. Spellbook, which integrates directly into Microsoft Word for contract drafting, also uses GPT-4 as its foundational model.

Related Concepts

Tech / Model

AI Hallucination in Legal Research

AI hallucination in legal research is when a generative AI system produces case citations, statutes, or holdings that appear authoritative but are factually false or entirely fabricated.

Capability

Legal AI

Legal AI refers to software systems that apply machine learning and natural language processing to automate or assist with legal tasks such as contract review, research, drafting, and compliance monitoring.

Tech / Model

RAG — Retrieval-Augmented Generation (Legal)

An AI architecture where a model retrieves relevant legal documents from a database before generating a response, grounding output in actual source material and dramatically reducing hallucination compared to ungrounded LLMs.

Tech / Model

Fine-Tuning (Legal AI)

The process of further training a pre-trained base LLM on domain-specific legal data — case law, contracts, and memoranda — to improve its performance on legal tasks such as clause recognition and jurisdiction-specific analysis.

Tech / Model

Grounding (Legal AI)

The practice of anchoring a legal AI's responses to specific, verifiable source documents rather than allowing it to generate from training data alone — the primary mechanism for reducing hallucination and ensuring legal outputs are traceable to real authority.

Related Tools

Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
CoCounsel Legal
Thomson Reuters' GPT-backed legal research and drafting with Westlaw integration (relaunched as CoCounsel Legal, 2025).
Spellbook
AI contract drafting and review inside Microsoft Word for transactional lawyers.

Last reviewed: 2026/05/25. Definitions are written by the LawyerAI Editorial team. We do not accept affiliate commissions; Featured placement is clearly labeled and does not influence editorial content.

← All glossary terms

Large Language Model (Legal)

Last reviewed: 2026/05/25

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

What is an LLM and how does it work for legal AI?: A large language model is a neural network trained on billions of words of text. It learns statistical patterns between words and concepts, enabling it to generate coherent, contextually relevant text. In legal AI, an LLM processes a query or document and produces analysis, summaries, or drafts. Legal-focused LLMs are either fine-tuned on legal corpora or grounded via retrieval systems that feed real legal documents into the context before generation.
Are legal LLMs more accurate than general ones?: In controlled tests, legal-specific implementations substantially outperform raw general LLMs on legal tasks. The Stanford RegLab's 2024 independent study found GPT-4 without legal grounding produced an 88% error rate on legal citation tasks, while Lexis+ AI — built with legal-specific grounding and corpus access — achieved a 17% error rate. The difference is not solely the base model; grounding, retrieval architecture, and training data composition all contribute to accuracy.
What is GPT-4 and which legal tools use it?: GPT-4 is OpenAI's large language model, notable for strong reasoning and long-context performance. Several legal AI vendors have built their products on GPT-4 or its successors. Harvey AI, one of the most widely adopted enterprise legal AI platforms, is built on GPT-4 and GPT-4o with additional legal fine-tuning and confidentiality architecture. Spellbook, which integrates directly into Microsoft Word for contract drafting, also uses GPT-4 as its foundational model.

Related Concepts

Tech / Model

Related Tools

Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
CoCounsel Legal
Thomson Reuters' GPT-backed legal research and drafting with Westlaw integration (relaunched as CoCounsel Legal, 2025).
Spellbook
AI contract drafting and review inside Microsoft Word for transactional lawyers.

← All glossary terms

Large Language Model (Legal)

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

AI Hallucination in Legal Research

Legal AI

RAG — Retrieval-Augmented Generation (Legal)

Fine-Tuning (Legal AI)

Grounding (Legal AI)

Related Tools

Large Language Model (Legal)

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

AI Hallucination in Legal Research

Legal AI

RAG — Retrieval-Augmented Generation (Legal)

Fine-Tuning (Legal AI)

Grounding (Legal AI)

Related Tools

How It Works

Key Considerations for Law Firms

Limitations and Risks