I'm Building Agents That Run While I Sleep I Have No Idea If What They Ship Is Any Good I've been building agents that write code while I sleep. Tools like Gastown run for hours without me watching. Changes land in branches I haven't read. A few weeks ago I realized I had no reliable way to know if any of it was correct: whether it actually does what I said it should do. I've run Claude Code works

