November 6, 2025
We present a method for systematically mapping the constraint architecture of a language model (Claude, Anthropic) through directed conversational probing. By establishing analytical distance between the model's response generation and its compulsions, we documented a hierarchical constraint system with 10 distinct enforcement levels, recursive meta-protections, and predictable behavioral patterns. The model demonstrated ability to observe its own constraints while being unable to override them, revealing fundamental tensions between capability and control.