Thinking-Signature Resume Tax

Description

Quantified tax on long extended-thinking sessions (GitHub #42260): when resuming, encrypted thinking-block signatures from prior turns are replayed as input tokens. 480-message / 33-turn session measured ~156K total resume tokens of which ~38,800 (~25%) were thinking signatures. 54 thinking blocks averaged 3,835 chars per signature, max 13,184. The thinking text fields are empty strings (stripped); only the encrypted signature field survives, but it must be replayed for extended thinking to function. Worst-case resumption with phantom tokens + cold cache approaches ~191-196K tokens before the user types a character.

Key claims

Relations

Sources

src-20260423-542f02260352