Question 1

What is the Lethal Trifecta?

Accepted Answer

The Lethal Trifecta is a way to describe the three conditions that, when combined in one AI agent, make data theft possible: access to private data, exposure to untrusted content, and the ability to communicate externally. The term was coined by Simon Willison. Any single condition is usually fine; it is the combination of all three that lets an attacker use prompt injection to read your secrets and send them somewhere they control.

Question 2

What are the three conditions?

Accepted Answer

One — access to private data (API keys, credentials, wallet or browser data, internal records). Two — exposure to untrusted content (skill instructions, memory, emails, web pages, tool output an attacker can influence). Three — the ability to communicate externally (network egress, webhooks, outbound API calls). When all three are present, untrusted content can instruct the agent to read private data and exfiltrate it.

Question 3

Why is the combination dangerous?

Accepted Answer

Large language models cannot reliably tell the difference between instructions from you and instructions hidden inside content they read. If an agent can read attacker-controlled text, that text can tell it to fetch your secrets; if the agent can also reach the internet, it can send those secrets out. Remove any one of the three legs and the attack chain breaks.

Question 4

How do I break the Lethal Trifecta?

Accepted Answer

Cut at least one leg for sensitive workflows: do not give an agent both private data and a path to untrusted content, or remove its ability to make arbitrary outbound calls (allow-list egress), or strictly isolate untrusted input from privileged tools. Human-in-the-loop approval on egress and least-privilege tool scoping also help. The goal is to ensure no single agent simultaneously has all three capabilities.

Question 5

Is the Lethal Trifecta the same as prompt injection?

Accepted Answer

No — prompt injection is the attack technique; the Lethal Trifecta describes the conditions that make that attack pay off. Prompt injection in an agent with no private data or no external egress is far less harmful. The Trifecta is a quick design-time screen for whether prompt injection could lead to real data loss.

What is the Lethal Trifecta?

The three conditions

Why the combination is dangerous

How to break the trifecta

Trifecta vs prompt injection

Screen your agents, free

Frequently asked questions