Related lectures: Lecture 11. Make the agent's runtime observable · Lecture 12. Clean handoff at the end of every session Template files: templates/

Project 06. Build a Complete Agent Harness (Capstone)

What You Do

This is the capstone project. Assemble everything learned in the first five projects, run a full benchmark, then do a cleanup pass to verify quality is maintainable.

Use a fixed multi-feature task set covering the complete product slice: document import, indexing, citation-based Q&A, runtime observability, and readable restartable repo state. First run with weak harness baseline, then with your strongest harness, then a cleanup and re-run. Finally, do a harness ablation experiment — remove one component at a time and see which ones actually matter.

Tools

Claude Code or Codex
Git
Node.js + Electron
Quality document template
Evaluator rubric
All harness components accumulated from the first five projects

Harness Mechanism

Complete harness: all mechanisms + observability + ablation study

Project 06. Build a Complete Agent Harness (Capstone) ​

What You Do ​

Tools ​

Harness Mechanism ​

Project 06. Build a Complete Agent Harness (Capstone)

What You Do

Tools

Harness Mechanism