You’re viewing the JavaScript-off version of this page (message previews and Quick Look turn JavaScript off). Each failure mode below shows its first card, with the rest behind a “Show the other three cards” toggle. If the toggle doesn’t respond in your viewer, open this file in a web browser — that’s also where the full tabbed experience lives.

Family A · #1–3 of 19

Drift

A1

Semantic Drift under Ambiguity

An instruction that sounds specific has several legitimate readings — and the model picks one silently.

A2

Semantic Drift under Clarity

The companion nobody warns you about: a perfectly clear spec still drifts on a long run.

A3

Schema Improvisation

Asked for a fixed shape, the model invents a slightly better one.

Family B · #4–7 of 19

Trust

B4

The Confident Fabrication

A number nobody computed, stated fluently.

B5

Evidence Surface Inconsistency

The story and the evidence beside it disagree — or no one can see whether they agree.

B6

The Sum That Doesn’t Add Up

Individually correct queries combine into nonsense. The join is where correctness dies.

B7

The Correct Number, Wrong Reading

The data is right. The sentence makes the reader compute something false.

Family C · #8–13 of 19

Orchestration

C8

The Hidden Failure Cascade

No layer proved correctness — and the deeper the orchestration, the harder to localize.

C9

The Silent Skip / Phase Hollowing

A step quietly dropped — output still looks complete. Success by appearance.

C10

The Lying Recorder

The telemetry itself silently fails. The run looks clean because nothing was watching.

C11

The Deaf Loop

A stop condition keyed to a signal that never fires. The loop looks patient; it’s actually deaf.

C12

The Courier Tax

Paying a language model to be a for-loop.

C13

The Orchestration-Shape Tradeoff

Inline drifts. Full fan-out goes blind. The shape is a per-step decision.

Family D · #14–16 of 19

Memory

D14

Context Dilution / The Monolith

Loading everything up front performs worse than loading nothing.

D15

The Compaction Cliff

Long sessions summarize themselves. Summaries round. Downstream builds on the rounding.

D16

The Re-Learned Lesson

A lesson you can’t find is a lesson you re-learn. Even the system you built to prevent this.

Family E · #17–19 of 19

Maturity

E17

Single-Model Blindness

A second pass from the same model is polish, not verification.

E18

The Polite Fiction

A declared capability nothing verifies is a lie waiting to ship.

E19

The Dead Safety Net

A gate built ahead of need rots into theater — and its presence implies false coverage.

External catalogs

Appendix

After writing this post, I got curious whether anyone else was cataloguing these failures, so I went looking. I had no prior familiarity with this research, but two maps stood out.

F1

MAST — the academic map (UC Berkeley)

Fourteen multi-agent failure modes, derived from 1,600+ annotated execution traces. arXiv March 2025; published at NeurIPS, December 2025.

F2

Microsoft’s red-team taxonomy — the adversarial map

Failure modes of agentic systems under attack, from the Microsoft AI Red Team. Whitepaper April 2025; updated June 4, 2026.

A few of the modes above I haven’t found on either map — the evidence-surface fork (B5), the corrupting join (B6), the correct number that reads false (B7). Not because they’re exotic: Berkeley measures benchmarks, Microsoft measures attack surfaces, and this catalog measures output that has to be right in front of a paying customer. It isn’t the exhaustive map either. It’s the map of where I’ve actually been.