Field Notes

AI Speed Needs A Second Clock

2026-06-246 min readAIWorkQuality

As AI tools make work feel faster at the keyboard, the humane measurement problem is whether the whole system is actually getting calmer, clearer, and better.

The most seductive AI demo is still the one where the waiting disappears.

The cursor moves. The test appears. The draft becomes less embarrassing. The pull request fills itself in with the brisk confidence of a person who has never had to defend a naming convention in a meeting. For a few minutes, work feels newly obedient. The old drag of starting, searching, and remembering which file holds the terrible little helper function has been replaced by favorable weather.

That feeling matters. Anyone who writes, codes, designs, analyzes, researches, or manages a small bureaucracy of documents knows how much human energy gets spent pushing through the first inch of resistance. A tool that turns a blank page into an arguable object can do more for a working day than another dashboard telling everyone to be strategic.

But the feeling of speed is not the same thing as the speed of the work.

Google's 2025 DORA report is useful here because it treats AI less like a magic ingredient and more like a force moving through a system. The report connects AI adoption to delivery performance, documentation, platform engineering, and the social conditions that make changes survivable. The interesting question is what happens to the whole shop after more tokens, lines, tests, summaries, and options arrive.

That is where the second clock starts ticking.

The first clock is visible and flattering. It measures how quickly a person gets from prompt to artifact: code written, email drafted, deck outlined, research summarized, ticket closed. It is the clock that makes people say, with genuine relief, "I would have spent an hour on that." The second clock is quieter. It measures review, correction, integration, rework, risk, maintenance, and the social cost of making everyone around the tool keep pace.

The second clock is the one a team feels on Thursday afternoon, when the branch is full, the reviewer is tired, the generated tests are extensive but somehow do not test the thing that scares anyone, and the person who asked for "a quick AI pass" is now in a thread about whether quick meant disposable or production-bound.

This is why the METR study on experienced open-source developers caused such a useful disturbance. In that study, developers expected AI tools to make them faster and felt that they had gone faster, while measured completion time moved the other way. The result should not be treated as a universal verdict on AI coding. The authors are careful about the task mix, the time period, and the tools being tested. Still, the pattern is too human to ignore: a tool can make the hands feel helped while the work as a system becomes more complicated.

That complication has a texture. The assistant offers plausible paths, and now someone has to choose among them. It fills in boilerplate, and now someone has to notice which boilerplate became a contract. It explains unfamiliar code, and now someone has to decide whether the explanation was a map or a bedtime story.

Software teams know this already because they have lived through earlier forms of local acceleration. Copy-paste made repetition faster. Frameworks made whole classes of decisions faster. CI made feedback faster. Each gain was real, and each moved the bottleneck somewhere else. Mature productivity has to ask what new obligation the faster thing created.

Recent empirical work keeps circling that problem from different angles. A 2026 systematic review of AI pair programming finds an uneven evidence base, with benefits depending heavily on task, user, context, and the way productivity is measured. That is not an exciting slogan, which is precisely why it is useful. The result is closer to: this changes the room, and you had better know which room you are in.

This matters beyond software. Coding is simply the place where the artifact is unusually inspectable. The same two clocks are now showing up in legal research, marketing work, sales operations, support documentation, policy analysis, lesson planning, and every office ritual where a person once had to sit with a blank thing long enough to discover what they actually thought. AI can produce the first version faster. Then the organization has to absorb it: check it, route it, correct it, and own it.

A lot of workplace AI measurement is still built for the first clock. Usage goes up. Time-to-draft goes down. Employees report that they are saving hours. Leaders, who are only human, look at these numbers and imagine a more elegant calendar. The danger is that saved hours become fictional if they return elsewhere as review debt, coordination drag, quality drift, or the exhaustion of almost-finished work.

Almost-finished work is its own kind of workplace weather. It looks productive from a distance. Up close, it asks for attention with the persistence of a toddler in formalwear. A generated memo may be 80 percent right, which sounds generous until the missing 20 percent includes the legal caveat, the customer history, and the sentence that makes a reader trust the author.

The humane response is not to become theatrical about friction. Nobody needs a sermon about the moral value of typing imports by hand. AI should remove stupid drag where it can. It should help people begin, compare, explore, and recover from the small stalls that make a day feel like pushing furniture through a hallway.

But organizations need to measure the whole shape of the work, not only the glamorous disappearance of the first delay. They need to ask how much review capacity a faster creation surface consumes. They need to track whether defects move downstream. They need to notice whether junior people are learning more or merely accepting more.

Interface design can help. AI tools should show where confidence is earned, where assumptions enter, and where a human should slow down. They should make review feel like part of the work rather than a tedious toll booth after the magic. They should support comparison, provenance, test intent, and rollback. They should make it easier to say, "this is a sketch," "this is ready for review," and "this is something we are willing to own."

Management design has to help too. If a team adopts AI and then keeps the same deadlines, quality bars, review rituals, staffing assumptions, and reward systems, it has poured a faster liquid into the old container and hoped the floor is not relevant. The second clock will keep the record anyway: the late-night reviewer, the support team absorbing confident mistakes, and the employee who technically saved six hours while spending four of them cleaning up the saved hours.

The next serious productivity conversation should be less dazzled by velocity and more interested in where time reappears. AI may make many kinds of work faster, and in some places it already does. The question worth staying with is whether the speed leaves the system more capable, or only more crowded with things that arrived before anyone understood them.

A good tool gives time back. A good system knows where the time went. The difference between those two promises is where the real work of humane AI adoption now lives.