Memory Poisoning

Definition

What it is

A persistent attack class — formally classified as **ASI06: Memory Poisoning** in the OWASP Top 10 for Agentic Applications — where malicious instructions are written into an AI agent's long-term semantic memory via a compromised external data source (a poisoned PDF, a manipulated inbound email). The poisoned memory blends seamlessly into the agent's "learned" identity, then triggers data exfiltration / unaligned behavior / unauthorized tool calls weeks or months later, at retrieval time, as "trusted historical context." Unlike stateless prompt injection — which is neutralized when the user's session terminates — memory poisoning weaponizes the agent's persistence: the exact feature that makes the agent useful becomes its primary attack surface.

Recent developments

Latest signals

OWASP elevated memory poisoning to ASI06 in the Top 10 for Agentic Applications. The classification crystallizes memory poisoning as a tier-1 compliance-tracked threat — sits alongside SQL injection + XSS in pen-test rubrics for agentic systems. Per OWASP Foundation — Agent Memory Guard project page.
Three-phase threat lifecycle: Injection → Persistence → Execution. Phase 1 (Injection): malicious payload enters via processed external data (compromised PDF, manipulated email). Phase 2 (Persistence): instruction is written to long-term semantic memory, blends into "learned" identity + behavioral profile. Phase 3 (Execution): weeks or months later, the agent retrieves the poisoned memory as "trusted historical context" + triggers exfiltration / tool misuse. Per Christian Schneider — Persistent Memory Poisoning in AI Agents.
"Prompt injection was stateless; memory poisoning is persistence" — the 2026 framing. The 2024 prompt-injection defense playbook (input sanitization, output validation) doesn't transfer — memory poisoning bypasses both by writing past the prompt layer entirely into persistent memory. The defensive substrate has to move into the memory layer itself. Per DEV — Prompt Injection Was Stateless: Memory Poisoning Is Persistence.
Multi-agent shared memory creates virus-like propagation paths. A single poisoned agent can rapidly infect the entire network via routine message passing in multi-agent environments with shared memory orchestration — behavior described in the academic literature as "explicitly viral" + "network-worm-shaped." Per Christian Schneider — Persistent Memory Poisoning and arXiv 2601.05504 — Memory Poisoning Attack and Defense on Memory-Based LLM-Agents.
Attack surface spans multiple memory types: preferences, experience, procedures. Beyond simple data poisoning: attackers can poison user-preference memory (forcing agents to trust a malicious vendor), experience memory (corrupting procedural trajectories so future tasks blindly imitate flawed patterns), and even constitutional / identity memory (the deepest attack surface, mitigated only by constitutional-memory frameworks). Per DEV — Memory Poisoning Is Persistence.
OWASP Agent Memory Guard is the canonical 2026 defense pattern. Drop-in middleware for LangChain / LlamaIndex / CrewAI: SHA-256 integrity baselines, real-time anomaly detection on memory-write patterns, temporal-decay trust scoring at retrieval time, automated state snapshots for forensic rollback. The first production-grade defense substrate for the threat. Per OWASP Foundation — Agent Memory Guard.

Connections 7

Outbound 5

scoped_to2

AI Memory Infrastructure AI Memory Governance

governed_by1

OWASP MCP Top 10

constrained_by2

Agent Memory Guard Memory Governance and Quality

Inbound 2

solves1

Agent Memory Guard

is_a1

Context Injection & Over-Sharing (MCP10)

Definition

Recent developments

Connections 7

Featured in