Human Decision Authority (Value)

Description

First of the paper's five motivating values. Humans retain ultimate decision authority, organized through a principal hierarchy (Anthropic > operators > users) that formalizes who holds authority over what. The response to the finding that users approve 93% of permission prompts was not more warnings but to restructure the problem via defined boundaries (sandboxing, auto-mode classifiers). Motivates deny-first defaults, graduated trust, append-only state, externalized policy, and values-over-rules.

Key claims

Relations

Sources

src-20260423-0cff68d3291b