comprehension debt

background

the (known) origins of the term "comprehension debt"

Jeremy Twei coined the perfect term for this: comprehension debt. It’s certainly tempting to just move on when the LLM one-shotted something that seems to work. This is the insidious part. The agent doesn’t get tired. It will sprint through implementation after implementation with unwavering confidence. The code looks plausible. The tests pass (or seem to). You’re under pressure to ship. You move on.

symptoms

volume & verbosity inflation; PRs grow in size, making diffs harder to review & reason about.
shift from “author knows” to “author curated”; folks merge code they steered but didn’t fully internalize - particularly when delivery pressure is high.
PR reviews trend towards “syntax checking” over “model checking”; reviewers fail to grasp deep system invariants/regressions while spending time on surface-level issues.
context rot; long agentic runs accumulate contradictory approaches which confuses author & reviewers.
weak ownership; if “who understands this end-to-end” isn’t explicit, everyone assumes someone else does.

approach to mitigate

robust PR reviews; reviewers follow conventional commits. Authors structure changes into atomic commits & stacked PRs. Complex PRs include analysis docs summarizing the nature of the effort. We expect that reviews may take time.
AI-augmented review; we use LLMs to outline module hierarchies, suggest refactors, and verify assumptions when reading other contributors' code.
quick feedback loops; we invest in ensuring that we are able to have automated validation of business logic & outcomes.
ownership; we expect team members to be able to articulate their work and ship in reviewable chunks.
metrics; as necessary, track review iteration rate, follow-up PRs per PR, Revert rate / hot fixes per PR/epic.

rossdm/comprehension-debt.md

Select an option

No results found

Select an option

No results found

background

symptoms

approach to mitigate

related reading