LawyerAILawyerAIIndependent Reviews
  • Search
  • Categories
  • Tag
  • Collection
  • Blog
  • Compare
  • Glossary
  • Solutions
  • Pricing
  • Submit
LawyerAILawyerAI
  1. Home
  2. ›
  3. Glossary
  4. ›
  5. Machine Learning (Legal Applications)

Machine Learning (Legal Applications)

Algorithms that learn patterns from labeled legal data — relevance decisions, risk labels, outcome records — to make predictions on new documents or cases; TAR is the most established application.

Last reviewed: 2026/05/19

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Q: What is the difference between a rule-based system and a machine learning system?
A rule-based system applies explicitly programmed rules: "if the clause contains the phrase 'limitation of liability,' flag it." An ML system learns rules from examples: "given 1,000 labeled contracts, learn what features distinguish high-risk from low-risk limitation of liability clauses." ML handles variability and edge cases better; rule-based systems are more predictable and auditable.
Q: How much training data does a legal ML model need?
It depends on the model type and task. Traditional ML models may require thousands of labeled examples per class. Modern deep learning models with transfer learning can perform useful tasks with hundreds of examples. Few-shot LLM techniques can achieve reasonable performance with ten or fewer examples. Ask vendors specifically how much labeled data their models require for your use case.
Q: Can I train a model on my firm's own data?
Yes, with caveats. Custom model training requires sufficient labeled examples, data science expertise, and ongoing maintenance. Some platforms (Kira, Relativity) support custom model training within their products with accessible interfaces. Fully custom model development requires data science resources. Evaluate whether the performance benefit of firm-specific training justifies the investment. --- *Last reviewed: 2026-05-19 by LawyerAI Editorial Team.*

Related Concepts

Tech / Model

Deep Learning (Legal)

A subset of machine learning using multi-layered neural networks that powers contract clause extraction, semantic search, and LLMs; modern legal AI tools are predominantly deep learning systems.

Related Tools

  • Luminance

    Enterprise AI for portfolio-level contract analysis and institutional memory.

Related Reading

  • How We Score Legal AI Tools: The 5-Dimension Methodology
  • AI Hallucination in Legal Research: A Practitioner's Guide

Last reviewed: 2026/05/19. Definitions are written by the LawyerAI Editorial team. We do not accept affiliate commissions; Featured placement is clearly labeled and does not influence editorial content.

← All glossary terms
LawyerAILawyerAI

Independent Reviews

The independent directory of AI tools for lawyers — reviewed by methodology, not by ad budget.

X (Twitter)
Tools
  • Search
  • Categories
  • Tag
  • Collection
Resources
  • Blog
  • Compare
  • Glossary
  • Solutions
  • Pricing
  • Submit
  • Suggest a Tool
  • Newsletter
Company
  • About Us
  • Studio
Legal
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • Refund Policy
  • Editorial Independence
  • Sitemap
Editorially independent. Methodology open and versioned.
© 2026LawyerAI Editorial

Machine learning (ML) is the branch of artificial intelligence in which systems learn patterns from labeled training data and apply those patterns to make predictions or classifications on new, unseen inputs — without being explicitly programmed with the decision rules. In legal applications, ML models learn from annotated examples: attorney relevance decisions in document review, risk labels applied to contracts, historical litigation outcomes, or coded regulatory filings. The model learns what distinguishes relevant from non-relevant documents, high-risk from standard contracts, or successful from unsuccessful arguments, and applies those learned distinctions to new inputs. ML is the broader category encompassing both traditional approaches (random forests, support vector machines) and modern deep learning.

Machine learning underlies most legal AI tools, whether or not that is apparent to the user. Technology-assisted review, contract risk scoring, clause deviation detection, and litigation outcome prediction are all built on ML models trained on legal data. Understanding the basics of how ML works helps lawyers evaluate vendor claims, understand tool limitations, and ask better procurement questions.

The most important practical implication is data dependency. ML models learn from labeled data — and perform best on data similar to what they were trained on. A contract review model trained predominantly on U.S. commercial contracts may perform poorly on European or Asian law agreements. A TAR model trained on English-language documents may fail on Spanish-language materials. Performance claims should always be evaluated with reference to the training data distribution.

ML models also degrade over time if the distribution of inputs changes while the model remains static. A billing guideline compliance model trained on 2020 guidelines will generate errors when applied to 2025 guidelines that have changed. Models require monitoring and retraining.

Relativity applies ML in its Active Learning module for document relevance prediction and in analytics features for email threading, near-duplicate detection, and concept clustering. Luminance applies ML for contract clause classification and deviation detection, with models trained on large commercial contract datasets.

Kira is a purpose-built ML contract analysis tool that allows users to train custom models on novel clause types using a relatively small number of annotated examples — reducing the data requirements for deploying ML on specialized legal tasks.