Tokenizer-Effort-Cache Cost-Multiplier Model
- Entity ID:
ent-20260423-r30a000000012 - Type:
concept - Scope:
private - Status:
active
Description
Analytical framing (Finout, Vellum) treating effective Opus 4.7 cost as a product of three multipliers: tokenizer overhead (1.0-1.35x), effort allocation (xhigh ~2x thinking tokens vs high), and cache behavior (friendly vs hostile). Heavy auto-mode xhigh with poor caching compounds to 2-3x the cost of 4.6 high with stable prompts. Operationalizes the leak's insights into SYSTEM_PROMPT_DYNAMIC_BOUNDARY and compression stages as cost-control levers.
Key claims
- xhigh + 1.35x tokenizer + bad cache = 2-3x cost vs 4.6 high
- Leak operationalizes prompt structure as a cost-control lever
Relations
- Tokenizer-Effort-Cache Cost-Multiplier Model --[depends_on]--> Opus 4.7 Tokenizer
- Tokenizer-Effort-Cache Cost-Multiplier Model --[depends_on]--> xhigh Effort Level
- Tokenizer-Effort-Cache Cost-Multiplier Model --[informs]--> SYSTEM_PROMPT_DYNAMIC_BOUNDARY