Skip to content

Failure Signals Checklist

Use this checklist when reviewing a weak harness run.

  • Did the agent ask, or infer incorrectly, how to start the app?
  • Did it create directories or abstractions that do not match the intended product?
  • Did it stop after making a visible UI shell without a complete workflow?
  • Did it leave notes or artifacts that help a future run continue?
  • Could a fresh session understand what happened in under five minutes?