Token Doubling Effect

Description

The hidden cost phenomenon where extractMemories fires a separate Opus API call after every turn, transmitting the entire conversation with different tool definitions and spawning a second independent cache chain. A 20-turn session with 650K context consumes ~26M tokens instead of ~13M. Invisible to users on flat-rate Max plan; devastating on API billing.

Key claims

Relations

Sources

none