Field Notes

Agent-Readable Software Needs Caretakers

2026-05-196 min readAIInterfacesMaintenance

As software gets redesigned for agents as well as humans, the hidden work is not making everything autonomous. It is keeping systems legible enough to trust.

The next interface shift may not look like an interface at all.

It may look like an API spec that is finally accurate. A permission model that does not require folk knowledge. A sandbox that lets an agent work without letting it wander through the whole machine. A contextual panel that follows the issue, the pull request, and the repository instead of asking the developer to paste the world into a chat box.

That is less glamorous than a new model demo.

It may also be closer to the actual future of AI work.

Anthropic's acquisition of Stainless is a useful signal because it sounds boring in exactly the right way. Stainless generates SDKs, CLIs, and MCP servers from API specifications. In other words, it helps turn a product's declared shape into something developers and agents can actually use. Anthropic framed the move around a simple premise: agents are only as useful as the systems they can reach.

Microsoft is making a similar argument from another direction. If agents become major users of enterprise software, the old assumption that software is designed only for humans starts to break. The system still needs a human interface, but it also needs business logic and prepared data that agents can operate against without reinventing the company every time they run.

GitHub's recent Copilot updates point to the same direction in miniature. Copilot chat now opens in context on the GitHub page a developer is viewing, automatically attaching pull requests, issues, repositories, and other references as the user moves. The assistant is less a separate destination and more a layer of attention riding along the work surface.

The pattern is not hard to see.

AI tools are leaving the blank box.

They are moving into the structure of software itself: APIs, sandboxes, repositories, permissions, source control, data models, workflow rules, and the weird institutional sediment that determines what can actually happen.

This changes the work.

For a long time, a lot of digital products were allowed to survive on human adaptability. The labels were confusing, but employees learned them. The approval flow was undocumented, but someone knew whom to ask. The dashboard was noisy, but the senior person could squint at it and extract the signal. The API had edge cases, but the integration engineer carried them around like scar tissue.

Human users are strangely forgiving because they can build private maps.

Agents are different.

They can be resilient in impressive ways, but they also expose every place where the system was never truly legible. A messy permission boundary becomes an unsafe action surface. A stale API spec becomes a broken workflow. A field named one thing but used as another becomes quiet corruption. A process that only exists in someone's memory becomes inaccessible labor.

This is why "agent-ready" should not be treated as a sparkle layer.

It is maintenance.

Not maintenance in the tired sense of keeping old machinery alive while the interesting people build new things. Maintenance as the discipline of making a system honest enough that another actor can use it without absorbing years of institutional ambiguity.

That is a high standard.

It asks for cleaner contracts between systems. It asks for documentation that reflects reality. It asks for audit trails, recoverable actions, explicit approvals, and useful failure states. It asks teams to decide which operations should be easy, which should be slow, and which should remain inconvenient because the inconvenience is part of the safety.

The OpenAI post on building a Codex sandbox for Windows makes this concrete. Without a useful sandbox, users had to choose between approving almost everything or giving the agent too much freedom. That is a design failure masquerading as a security setting. Good boundaries are not anti-autonomy. They are what make autonomy survivable.

The same is true inside organizations.

An agent that can operate the CRM, the spreadsheet, the ticket queue, the deployment pipeline, or the finance system does not remove the need for human judgment. It moves judgment into a different place. Someone has to decide what the agent is allowed to see, what it can change, when it needs approval, how its work is reviewed, and what evidence remains after the task is complete.

That someone is not just a security team.

It is product design. It is operations. It is engineering leadership. It is the person who knows why the "temporary" spreadsheet from 2019 still runs a real process. It is the manager who understands that a faster workflow can still make the work worse if nobody knows what good looks like.

Agent-readable software will reward a kind of care that many organizations have historically underfunded.

The clean API. The named workflow. The well-tended knowledge base. The boring migration away from hidden manual steps. The product surface that tells the truth about system state. The data model that does not require an oral tradition to interpret it.

These were already signs of good systems.

Agents make them harder to ignore.

There is a temptation to describe this as a future where software disappears. The agent handles the interface. The human states the goal. The machinery performs the rest.

Maybe some interactions will feel that way.

But disappearance is the wrong dream. When systems disappear too completely, accountability disappears with them. The better future is not invisible software. It is software whose important boundaries become more visible: what happened, why it happened, who authorized it, what can be undone, and where the human must still stand close.

That is less magical than full automation.

It is also more humane.

The strange gift of agents may be that they force software to become more truthful. They will not do that automatically. They will happily run through confusion at machine speed if we let them. But if teams take the moment seriously, agent-readable software could become a pressure toward systems that are cleaner, calmer, and easier for humans to understand too.

The machine user is arriving.

The question is whether its arrival makes our systems more obscure, or finally makes us care for the parts we taught people to work around.