Field Notes

Sandboxes Are The Agent Interface

2026-05-265 min readAIInterfacesTrust

As agents move from chat into managed work environments, the most important interface may be the room we let them work inside.

The most revealing AI product detail right now may not be the model name.

It may be the room.

Google's new Managed Agents in the Gemini API arrive as more than a smarter endpoint. They come with isolated Linux environments, persistent state, resumable interactions, tools, instructions, skills, and data. OpenAI's recent Codex sandbox work tells the same story from a lower level: a coding agent is only useful when it can touch a real machine, and only tolerable when that touch has boundaries. GitHub's Copilot app preview also leans into focused sessions, isolation, steering, validation, and pull request review.

This is a shift in the shape of AI work.

The old assistant lived in a text box. You asked. It answered. Even when the answer was wrong, the mistake usually arrived as language. The next assistant lives inside an environment. It reads files, runs commands, edits state, calls tools, resumes later, and leaves artifacts behind. The question is no longer only what the model knows or how fluently it explains itself.

The question is where it is allowed to happen.

That sounds like a security concern, and it is. But treating sandboxes as only security plumbing misses the more interesting interface problem. A sandbox is not just a cage. It is a designed workplace. It decides what counts as nearby, what is out of reach, what requires permission, what can be undone, what persists, what disappears, and what evidence remains after the work is finished.

For human users, those details become the felt experience of trust.

If an agent can read everything but write only inside the current workspace, that is an interface decision. If network access is blocked unless the user grants it, that is an interface decision. If a managed environment can be resumed with its files and state intact, that is an interface decision. If every risky action requires another approval, that is also an interface decision, and sometimes a hostile one, because constant consent prompts can turn oversight into fatigue.

Good containment is not the same as maximum friction.

This is why OpenAI's Windows sandbox write-up is more culturally interesting than its technical specificity might suggest. The hard part was not merely making Codex weaker. The hard part was making it constrained enough to be safe and capable enough to be useful. Too much access turns the agent into a liability. Too little access turns it into a clerk that asks permission to pick up every pencil.

That tension is going to define a lot of AI product design.

Organizations want agents because they promise motion without constant human handling. The tool can inspect, plan, modify, test, and report while a person does something else. But the moment the agent acts inside real systems, the fantasy of frictionless delegation meets the older reality of responsibility. Someone has to decide which files are writable, which tools are callable, which secrets are unavailable, which logs are durable, which requests need approval, and which actions should simply be impossible.

Those are not secondary settings.

They are the product.

In a chat-first world, interface design often meant shaping the conversation: the prompt box, the answer format, the citations, the regenerate button, the model picker. In an agent-first world, the interface expands into the operating conditions. The environment becomes a kind of contract between human intention and machine action. It says: this is the workbench, these are the tools, these are the walls, this is the door, and here is what happens when the agent tries to leave.

That is a more honest metaphor than intelligence.

Intelligence talk makes us stare at the model as if capability alone will settle the matter. Environment talk makes us ask how work is arranged. A brilliant agent in a careless environment can still create damage, confusion, or meaningless output. A less glamorous agent in a well-shaped environment may produce work that is easier to review, safer to trust, and calmer to live with.

This matters beyond software development.

Every workplace agent will need a room. A finance agent needs access to some numbers and not others. A support agent needs customer context but not casual authority to invent policy. A design agent needs brand materials, component rules, and a place where exploration can be messy without pretending every draft is approved. A research agent needs source boundaries and a memory that can be inspected. A health-adjacent agent needs strong walls around advice, privacy, escalation, and uncertainty.

The room is where organizational values become mechanical.

Many companies will try to skip this part. They will buy the agent, connect the systems, celebrate adoption, and call the rest governance. But governance that lives only in a policy document is theater. The real policy is the action surface: what the agent can see, what it can change, what it must ask, what it records, and what the human can understand without becoming a full-time auditor.

There is a humane version of this future.

It does not ask people to supervise everything forever. It also does not ask them to trust invisible work because the demo looked competent. It gives agents enough room to be useful and enough structure to remain answerable. It reduces meaningless interruptions while preserving the moments where human judgment actually matters. It treats boundaries as care, not as bureaucratic drag.

The uncanny thing about agents is that they make software feel less like a tool and more like a coworker. But maybe the better question is not whether the agent feels human.

Maybe the better question is whether we have built it a decent place to work.

Because the future will not be made only by smarter models. It will be made by the rooms we give them, the doors we lock, the windows we leave open, and the traces we require before anyone is allowed to say the work is done.