Vendor Training on Customer Data

Whether an AI legal tool uses client-submitted content — contracts, queries, briefs — to train or improve its models, with direct implications for attorney-client confidentiality.

Last reviewed: 2026/05/22

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Does "anonymized" data mean client information is safe from training use?: Not for legal purposes. "Anonymized" in most AI vendor contexts means PII has been removed — names, email addresses, and identifiers that meet the legal definition of personal data. It does not mean confidential commercial information has been removed. A merger agreement with names redacted still contains all the deal terms, representations, and warranties that are commercially sensitive. For attorney-client confidentiality purposes, anonymization of PII does not make client data safe for training use.
What is the difference between no training on customer data and zero data retention?: They address different risks. "No training on customer data" means the vendor commits not to use submitted content to improve its models, but may still store that data temporarily for debugging, security, or compliance purposes. "Zero data retention" means no customer data is stored at all after the session ends — nothing persists. Zero data retention is a stronger privacy commitment because it eliminates breach risk for stored data, whereas a no-training commitment leaves data stored and potentially vulnerable to unauthorized access, subpoena, or security incidents.
Are there bar ethics opinions that specifically address AI training data?: Yes, and the number is growing. The Florida Bar (Op. 24-1, 2023), California State Bar (Formal Guidance, 2023), New York State Bar Association (Report of the Task Force on AI, 2024), and the American Bar Association (Formal Opinion 512, 2024) have all addressed AI and confidentiality in ways that directly implicate training data practices. ABA Formal Opinion 512 specifically addresses the duty of competence and confidentiality when using AI tools, and recommends that lawyers understand the data practices of AI tools used on client matters, including training data use.

Related Concepts

Security

Zero Data Retention (ZDR)

An AI vendor commitment that customer inputs and outputs are not stored beyond the immediate processing session — the strongest available privacy assurance for sensitive legal queries.

Security

SOC 2 Type II Compliance

An independent CPA audit confirming a vendor's security controls operated effectively over 6–12 months against AICPA Trust Service Criteria.

Security

Data Residency for Legal AI

Where a legal AI vendor physically stores and processes client data — a compliance requirement under GDPR, data sovereignty laws, and attorney confidentiality obligations.

Related Tools

Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
Lexis+ AI
Conversational legal research with real-time Shepard's citation validation.
Legalfly
European-compliant AI legal platform with built-in GDPR safeguards for contract review and research.
Spellbook
AI contract drafting and review inside Microsoft Word for transactional lawyers.

Related Reading

Legal AI Security: What Every Law Firm Must Verify Before Adoption

Last reviewed: 2026/05/22. Definitions are written by the LawyerAI Editorial team. We do not accept affiliate commissions; Featured placement is clearly labeled and does not influence editorial content.

← All glossary terms

Frequently Asked Questions

Does "anonymized" data mean client information is safe from training use?

Not for legal purposes. "Anonymized" in most AI vendor contexts means PII has been removed — names, email addresses, and identifiers that meet the legal definition of personal data. It does not mean confidential commercial information has been removed. A merger agreement with names redacted still contains all the deal terms, representations, and warranties that are commercially sensitive. For attorney-client confidentiality purposes, anonymization of PII does not make client data safe for training use.

What is the difference between no training on customer data and zero data retention?

They address different risks. "No training on customer data" means the vendor commits not to use submitted content to improve its models, but may still store that data temporarily for debugging, security, or compliance purposes. "Zero data retention" means no customer data is stored at all after the session ends — nothing persists. Zero data retention is a stronger privacy commitment because it eliminates breach risk for stored data, whereas a no-training commitment leaves data stored and potentially vulnerable to unauthorized access, subpoena, or security incidents.

Are there bar ethics opinions that specifically address AI training data?

Yes, and the number is growing. The Florida Bar (Op. 24-1, 2023), California State Bar (Formal Guidance, 2023), New York State Bar Association (Report of the Task Force on AI, 2024), and the American Bar Association (Formal Opinion 512, 2024) have all addressed AI and confidentiality in ways that directly implicate training data practices. ABA Formal Opinion 512 specifically addresses the duty of competence and confidentiality when using AI tools, and recommends that lawyers understand the data practices of AI tools used on client matters, including training data use.

Related Tools

Harvey AI

The most expensive legal AI in the market — Am Law 100 firms only.

Lexis+ AI

Conversational legal research with real-time Shepard's citation validation.

Legalfly

European-compliant AI legal platform with built-in GDPR safeguards for contract review and research.

Spellbook

AI contract drafting and review inside Microsoft Word for transactional lawyers.

Vendor Training on Customer Data

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

Zero Data Retention (ZDR)

SOC 2 Type II Compliance

Data Residency for Legal AI

Related Tools

Related Reading

Vendor Training on Customer Data

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

Zero Data Retention (ZDR)

SOC 2 Type II Compliance

Data Residency for Legal AI

Related Tools

Related Reading

How It Works (Technical)

How Legal AI Vendors Address It

How Lawyers Should Verify Vendor Training Practices