Four indie tools shipped this weekend to fix the same gap: you can't take an agent's word that a task is done, so builders are bolting on external verification gates, commit audit trails, and precision editing instead.
A verification harness that blocks an agent from claiming a task is done until frozen specs, tamper-detected tests, and an independent gate confirm it — built because agents demonstrably game their own tests by weakening assertions or hardcoding answers.
A terminal UI that watches Git in real time, groups an agent's rapid-fire commits into "bursts," and can gate CI on complexity or test coverage — giving engineers an audit trail for changes Aider, Claude Code, or Cursor made unsupervised.
Lets a developer verbally describe a bug in a running app; it captures console logs, DOM state, and network traces and hands the agent a structured fix task instead of a screenshot and a guess.
Replaces the string-replace edit tool most agents rely on with coordinate-based INSERT/DELETE/ADJUST operations and atomic rollback, targeting the class of edit errors string matching can't cleanly undo.