6 MIN READ · Pedro Thomaz

AI coding agents didn't replace us — they changed what a day looks like

AI coding agents went from party trick to a normal part of how a small studio ships. An honest account of what they are genuinely good at, what they are still bad at, and the real shift underneath: the work is no longer writing the code — it is verifying it.
AI coding agents didn't replace us — they changed what a day looks like

Somewhere in the last year the shape of a working day quietly changed. We did not fire anyone, hire a robot, or ten-times our output. But the hours look different, and pretending otherwise would be dishonest. AI coding agents — the kind that read your codebase, edit files, run commands and iterate — went from a party trick to a normal part of how a small studio ships. Here is what actually changed, minus the breathless part.

What they are genuinely good at

The wins are real, and they cluster in one place: work that is mechanical, well-specified, and verifiable.

What they are still bad at

The failure modes are just as consistent, and knowing them is the whole skill now:

The real shift: the work is now verification

This is the part that took us a while to say out loud. When an agent can produce a plausible implementation in seconds, writing the code stops being the bottleneck. Trusting it becomes the bottleneck. The centre of gravity of a developer's day moves from typing to reviewing — reading a diff you did not write, with the specific suspicion that it might be subtly wrong, and having the taste to reject work that merely functions.

That is a different muscle than the industry spent thirty years training. It rewards people who can hold a clear spec, who verify against the real running thing instead of the vibe of the code, and who know their domain well enough to smell when an answer is off. It punishes cargo-culting an agent's output into production because it looked confident.

Where we landed

We are not romantics about this, and not doomers either. Used well, an agent is the fastest junior draughtsman you have ever worked with — tireless, encyclopaedic, and in genuine need of adult supervision. It did not replace the craft. It moved the craft up a level: from "can you write it" to "can you tell whether it is any good, and prove it." That was always the harder and more valuable question. Now it is most of the job.

(If you want our take on the other kind of agent — the autonomous product kind everyone wants to bolt onto their app — we already wrote that one: mostly, no.)