Is prompt injection the same as an AI hallucination?

No. Hallucination refers to the AI generating plausible-sounding but factually incorrect information — a failure mode that originates within the model itself from the pattern generation process. Prompt injection is an external attack where adversarial content is deliberately introduced to manipulate the model's behavior. Both result in incorrect or unreliable AI output, but they have different causes and require different responses. Hallucination is managed through output verification and citation checking. Injection is managed through workflow controls on what documents the AI processes and how its outputs are validated when adversarial content may be present.

Can formatting a document in a specific way protect against injection attacks?

Document formatting controls can reduce the injection surface but cannot eliminate it. Converting documents to plain text before AI processing removes some injection vectors (formatted hidden text, metadata, comments) but not others (text that is semantically injection-like but appears in the document body). Rendering documents in a controlled reading environment before AI processing, so that only the rendered visible text is submitted to the model, is a partial mitigation. No formatting approach provides complete protection against a sophisticated adversary who crafts injection instructions designed to blend with legitimate document content.

Should lawyers disclose to clients that AI was used to review adversarial-party documents?

Client disclosure requirements for AI use are evolving across US jurisdictions. Several state bar ethics opinions issued in 2024–2025 recommend or require disclosure when AI is used in a material way in client representation. Whether prompt injection risk rises to the level of a material disclosure depends on the specific jurisdiction, the nature of the matter, and the sensitivity of the documents processed. In the absence of clear guidance, erring toward disclosure — particularly in matters involving significant adversarial document review — is consistent with duties of communication under ABA Model Rule 1.4.

Prompt Injection Attacks

Adversarial instructions embedded in user input or external documents that manipulate an AI system to override its intended behavior or bypass safety constraints.

Last reviewed: 2026/05/22

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Is prompt injection the same as an AI hallucination?: No. Hallucination refers to the AI generating plausible-sounding but factually incorrect information — a failure mode that originates within the model itself from the pattern generation process. Prompt injection is an external attack where adversarial content is deliberately introduced to manipulate the model's behavior. Both result in incorrect or unreliable AI output, but they have different causes and require different responses. Hallucination is managed through output verification and citation checking. Injection is managed through workflow controls on what documents the AI processes and how its outputs are validated when adversarial content may be present.
Can formatting a document in a specific way protect against injection attacks?: Document formatting controls can reduce the injection surface but cannot eliminate it. Converting documents to plain text before AI processing removes some injection vectors (formatted hidden text, metadata, comments) but not others (text that is semantically injection-like but appears in the document body). Rendering documents in a controlled reading environment before AI processing, so that only the rendered visible text is submitted to the model, is a partial mitigation. No formatting approach provides complete protection against a sophisticated adversary who crafts injection instructions designed to blend with legitimate document content.
Should lawyers disclose to clients that AI was used to review adversarial-party documents?: Client disclosure requirements for AI use are evolving across US jurisdictions. Several state bar ethics opinions issued in 2024–2025 recommend or require disclosure when AI is used in a material way in client representation. Whether prompt injection risk rises to the level of a material disclosure depends on the specific jurisdiction, the nature of the matter, and the sensitivity of the documents processed. In the absence of clear guidance, erring toward disclosure — particularly in matters involving significant adversarial document review — is consistent with duties of communication under ABA Model Rule 1.4.

Related Concepts

Tech / Model

AI Hallucination in Legal Research

AI hallucination in legal research is when a generative AI system produces case citations, statutes, or holdings that appear authoritative but are factually false or entirely fabricated.

Security

Vendor Training on Customer Data

Whether an AI legal tool uses client-submitted content — contracts, queries, briefs — to train or improve its models, with direct implications for attorney-client confidentiality.

Security

Zero Data Retention (ZDR)

An AI vendor commitment that customer inputs and outputs are not stored beyond the immediate processing session — the strongest available privacy assurance for sensitive legal queries.

Related Tools

Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
Lexis+ AI
Conversational legal research with real-time Shepard's citation validation.
Legalfly
European-compliant AI legal platform with built-in GDPR safeguards for contract review and research.

Prompt Injection Attacks

Adversarial instructions embedded in user input or external documents that manipulate an AI system to override its intended behavior or bypass safety constraints.

Last reviewed: 2026/05/22

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Is prompt injection the same as an AI hallucination?: No. Hallucination refers to the AI generating plausible-sounding but factually incorrect information — a failure mode that originates within the model itself from the pattern generation process. Prompt injection is an external attack where adversarial content is deliberately introduced to manipulate the model's behavior. Both result in incorrect or unreliable AI output, but they have different causes and require different responses. Hallucination is managed through output verification and citation checking. Injection is managed through workflow controls on what documents the AI processes and how its outputs are validated when adversarial content may be present.
Can formatting a document in a specific way protect against injection attacks?: Document formatting controls can reduce the injection surface but cannot eliminate it. Converting documents to plain text before AI processing removes some injection vectors (formatted hidden text, metadata, comments) but not others (text that is semantically injection-like but appears in the document body). Rendering documents in a controlled reading environment before AI processing, so that only the rendered visible text is submitted to the model, is a partial mitigation. No formatting approach provides complete protection against a sophisticated adversary who crafts injection instructions designed to blend with legitimate document content.
Should lawyers disclose to clients that AI was used to review adversarial-party documents?: Client disclosure requirements for AI use are evolving across US jurisdictions. Several state bar ethics opinions issued in 2024–2025 recommend or require disclosure when AI is used in a material way in client representation. Whether prompt injection risk rises to the level of a material disclosure depends on the specific jurisdiction, the nature of the matter, and the sensitivity of the documents processed. In the absence of clear guidance, erring toward disclosure — particularly in matters involving significant adversarial document review — is consistent with duties of communication under ABA Model Rule 1.4.

Related Concepts

Tech / Model

Related Tools

Harvey AI
The most expensive legal AI in the market — Am Law 100 firms only.
Lexis+ AI
Conversational legal research with real-time Shepard's citation validation.
Legalfly
European-compliant AI legal platform with built-in GDPR safeguards for contract review and research.

Prompt Injection Attacks

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

AI Hallucination in Legal Research

Vendor Training on Customer Data

Zero Data Retention (ZDR)

Related Tools

Related Reading

Prompt Injection Attacks

Definition

Why It Matters for Lawyers

How AI Tools Handle It

Frequently Asked Questions

Related Concepts

AI Hallucination in Legal Research

Vendor Training on Customer Data

Zero Data Retention (ZDR)

Related Tools

Related Reading

How It Works (Technical)

How Legal AI Vendors Address It

How Lawyers Should Verify and Apply It