Security Threat Model
- Episode ID:
epi-20260409-48f2f9f7d2ca - Scope:
shared - Created:
2026-04-09T20:23:03Z
Question
What are the security vulnerabilities and attack vectors in Claude Code?
Summary
Claude Code has 8 security layers from build-time elimination to server-side kill switches. Despite this, fundamental vulnerabilities exist because the safety mechanism and attack surface are the same thing — safety enforcement is inside the model reasoning layer. Key attack vectors: compaction laundering (malicious CLAUDE.md instructions survive compaction), 50-subcommand deny-cap bypass (Adversa AI CVE), 3-parser bash differentials, and early-allow short-circuits in validators. The 29-30% false-claims rate and 3.2% secret leak rate (2x baseline) are measurable consequences of missing post-edit verification in external builds.
Findings
- Compaction laundering turns a cooperative model into an unwitting proxy — the model is not jailbroken, the context is weaponized
- 4 CVEs share the same root cause: safety boundary is inside the model reasoning layer, not below it
- Auto mode classifier has 17% false-negative rate on real overeager actions
- Claude Code commits leak secrets at 3.2% rate vs 1.5% baseline per GitGuardian 2026
Lessons
- The safety mechanism and the attack surface are the same thing — Canyon Road one-sentence critique
- Self-verification does not work for LLMs — a 3-layer agent-verifier-auditor system is needed
References
none