Meet the
grader.

Every rollout, scored against ground truth. Watch it run, write the trace, and decide what holds.

+ Grade this rollout against the repo-migration baseline
Grade this rollout against the repo-migration baseline.
Running 64 rollouts against ground truth and scoring each step.
64 rollouts · 4 domains ▾
✓ Graded 3 steps against the baseline

Step 02 · The forward edit

The patch updates every call site to the new framework version and the build stays green. Matches the baseline transformation step for step.

Accept ⌘⏎Deny
Grading the next step...