Bimodal Token Distribution (Pre/Post v2.1.100)

Description

Proxy-verified measurement across 40+ sessions shows two distinct clusters of per-request cache_creation_input_tokens: ~50K tokens for v2.1.98 and earlier, ~71K tokens for v2.1.100+. The gap — ~20K tokens — is the phantom injection. Clean bimodal separation confirmed the effect was a step-function version change, not a gradual configuration drift.

Key claims

Relations

Sources

src-20260423-542f02260352