Token Doubling Effect
- Entity ID:
ent-20260410-d6e0425480cd - Type:
concept - Scope:
shared - Status:
active
Description
The hidden cost phenomenon where extractMemories fires a separate Opus API call after every turn, transmitting the entire conversation with different tool definitions and spawning a second independent cache chain. A 20-turn session with 650K context consumes ~26M tokens instead of ~13M. Invisible to users on flat-rate Max plan; devastating on API billing.
Key claims
- none yet
Relations
- none yet
Sources
none