Available nowhi@idler.ai
IdlerRL environments for coding
Issue N°03Coll. 2026Ref. RL-2026-R02
MandateTrain AI to code at an expert human level. Real engineering, graded against a working result.
The corpus / index
DebuggingFeature workRefactorsTests & review Long-horizonError recoveryTool useMigrations ReviewsIncidents
*The corpus, rotating — one generative spine
01

Method

a skill → a graded test
From real engineering work to a graded world. The same five steps every time.
01PerceiveFind where coding agents break on real engineering work.
02RepresentTurn the task into an environment with a checkable result.
03BuildStand up the repo, the tests, and the grader.
04ScaleMass-produce variants. Early environments become training data.
05MeasureScore where models fail, and aim the next environment there.
02

Environments

the engineering work

Debugging

Reproduce, localize, and fix real bugs in a live repo.

Feature work

Build features across an unfamiliar codebase.

Refactors

Restructure code without breaking what works.

Tests & review

Write tests, read diffs, and catch regressions.

03

Why Idler

real, graded, frontier

Real

Environments from real engineering work, never invented benchmarks. The skill transfers.

Graded

Every step checked against a working result. Dense reward, not just pass or fail.

Frontier

Built for the best models, aimed at the engineering they still get wrong.

Tell us where your models fail at real engineering. We build the world that trains it.

Request access