Failure Signals Checklist
Use this checklist when reviewing a weak harness run.
- Did the agent ask, or infer incorrectly, how to start the app?
- Did it create directories or abstractions that do not match the intended product?
- Did it stop after making a visible UI shell without a complete workflow?
- Did it leave notes or artifacts that help a future run continue?
- Could a fresh session understand what happened in under five minutes?