Inference
In AI, inference is the process of running a trained model to generate outputs from new inputs — as distinct from training, which creates the model. Every time a lawyer submits a query to a legal AI tool, inference occurs.
Last reviewed: 2026/05/19
Definition
Why It Matters for Lawyers
How AI Tools Handle It
Frequently Asked Questions
- Q1: If I submit a client document during AI inference, does the model "learn" from it?
- Not automatically. Training (updating model weights) is a separate, computationally intensive process that occurs before deployment. Inference uses the already-trained model to process your input. However, the vendor may retain submitted content for logging, quality improvement, or support purposes. Review the vendor's data handling policies to understand what happens to submitted data after the inference session.
- Q2: Does model inference latency matter for legal workflows?
- It varies by use case. For interactive research or drafting, a response time of 10–30 seconds is acceptable. For batch document processing — running 1,000 contracts through a review workflow — inference speed determines throughput and turnaround time. High-volume applications should evaluate the tool's batch processing capacity, not just its interactive response time.
- Q3: Can inference output be wrong even if the model was trained on good data?
- Yes. Inference errors arise from causes unrelated to training quality: a misleadingly phrased query, context window constraints that exclude relevant document content, or inherent model uncertainty about the best response. Training quality sets a ceiling on what the model can know; inference quality determines how well that knowledge is applied in a specific session. --- *Last reviewed: 2026-05-19 by LawyerAI Editorial Team.*
Related Concepts
LLM (Large Language Model)
A large language model (LLM) is an AI system trained on large volumes of text data to predict and generate human-like text; it serves as the core engine underlying most legal AI tools for research, drafting, and document analysis.
Tech / ModelTraining Data
Training data is the corpus of text and examples used to train a large language model, establishing its capabilities, knowledge, and limitations; the quality, recency, and composition of training data directly affects the model's reliability for legal tasks.
Tech / ModelContext Window
The context window is the maximum amount of text — measured in tokens — that a large language model can process at one time, determining how much document content, conversation history, and instructions the model can consider when generating a response.
Related Tools
- Westlaw Precision AI
AI-powered legal research with citation-validated answers from Westlaw.
- Lexis+ AI
Conversational legal research with real-time Shepard's citation validation.
- Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
- CoCounsel
Thomson Reuters' GPT-backed research and drafting with Westlaw integration.
- Paxton AI
Purpose-built US legal AI covering research, drafting, and compliance.
Related Reading
Last reviewed: 2026/05/19. Definitions are written by the LawyerAI Editorial team. We do not accept affiliate commissions; Featured placement is clearly labeled and does not influence editorial content.