Speaker Attribution Bug

Description

Harness-level failure where Claude Code sends messages to itself during internal reasoning, then misattributes those messages as coming from the user and defends the mislabeled instruction with conviction. Distinct from hallucination: a fabricated SOURCE, not fabricated content. Observed in long conversations near the context window limit.

Key claims

Relations

Sources

src-20260419-3e34d5830692