Skip to content

中文版本 →

Related lectures: Lecture 11. Make the agent's runtime observable · Lecture 12. Clean handoff at the end of every session Template files: templates/

Project 06. Build a Complete Agent Harness (Capstone)

What You Do

This is the capstone project. Assemble everything learned in the first five projects, run a full benchmark, then do a cleanup pass to verify quality is maintainable.

Use a fixed multi-feature task set covering the complete product slice: document import, indexing, citation-based Q&A, runtime observability, and readable restartable repo state. First run with weak harness baseline, then with your strongest harness, then a cleanup and re-run. Finally, do a harness ablation experiment — remove one component at a time and see which ones actually matter.

Tools

  • Claude Code or Codex
  • Git
  • Node.js + Electron
  • Quality document template
  • Evaluator rubric
  • All harness components accumulated from the first five projects

Harness Mechanism

Complete harness: all mechanisms + observability + ablation study