Judge-friendly proof. No vibes.
Fix failures with receipts.
Run a command in a safe sandbox, capture stdout/stderr, propose a patch, apply it, and verify the result — with every artifact visible in the UI.
Quick start
generates runscd backend && npm run dev
cd cli && npx ts-node src/index.ts start --path ../samples/demo-repo --command "node failing-test.js"
What makes this judge-proof
1Evidence is first: stdout/stderr is captured and shown verbatim.
2AI proposes patches — it doesn’t claim success.
3Verifier reruns the command to decide pass/fail.
Integrations:Cline · local code agentKestra · optional workflowsGitHub Actions · optional CI gate