top of page

OpenAI, Anthropic, and the Race Toward Controlled Autonomy

  • Writer: Felix-Sebastian Cosma
    Felix-Sebastian Cosma
  • 3 days ago
  • 6 min read

Everyone is talking about autonomous AI agents.


The more important question is controlled autonomy.


That may sound like a contradiction. It is not. Autonomy without control is not a product category. It is a liability. Control without autonomy is not much better. It is just another manual workflow with a more expensive interface.


The real race between OpenAI, Anthropic, and the rest of the AI industry is not simply about who builds the most intelligent model. Intelligence matters, of course. Speed matters. Tool use matters. Context windows matter. But the market is moving toward a more practical question: who can give AI systems enough authority to be useful without giving them so much authority that they become dangerous, unpredictable, or impossible to govern?


This is the race toward controlled autonomy.


From chat to action


For the first wave of AI adoption, the interface was chat. A user typed a question, the model produced an answer, and the risk was mostly contained inside the quality of that answer. The model could be wrong, lazy, overconfident, or misleading, but in most cases it could not directly change the world. It could advise. It could draft. It could summarize. It could produce code that a human still had to review and run.


That boundary is disappearing.


OpenAI has been moving ChatGPT from a conversational assistant toward an agent that can use a virtual computer, browse websites, run code, work with files, and ask for permission before consequential actions. Anthropic has been moving Claude in a similar direction from another angle, especially through Claude Code and computer-use capabilities that let the model edit files, run commands, and interact with development tools.


This is a serious shift. The product is no longer merely the answer. The product is the action.

Once an AI system can take actions, the central risk changes. The question is no longer only whether the model said something wrong. The question is what the model was allowed to do after it said it.


Think about it. A bad answer in a chat window is one problem. A bad answer that sends an email, changes a file, deletes a database record, commits code, exposes private data, or triggers a payment is a different problem entirely.


That is why the leading AI companies are not just competing on capability. They are competing on permission systems, sandboxes, user confirmations, tool boundaries, isolation, monitoring, and refusal behavior. These are not secondary features. They are the product.


OpenAI: broad utility with explicit control points


OpenAI appears to be pushing toward broad consumer and business utility. The direction is clear: an agent that can move across research, browsing, code execution, document creation, connectors, and web interaction inside a unified experience. The promise is convenience. Ask for a task, and the system figures out the path.


That is powerful because many real tasks do not fit neatly into one tool. Planning a meeting may require calendar context, email context, news research, document synthesis, and perhaps a final message. Competitive analysis may require browsing, spreadsheet work, and presentation output. The more the agent can move between tools, the more useful it becomes.


But broad utility creates broad risk. A general agent that can access many tools also has many possible ways to fail. It can misunderstand the user. It can be manipulated by hostile content on the web. It can expose sensitive information from connectors. It can perform an action that the user would not have approved if the risk had been clear.


OpenAI’s answer, at least publicly, has been to combine capability with explicit control points: permission before consequential actions, the ability to interrupt or take over, active supervision for certain critical tasks, refusals for high-risk actions, and privacy controls around browsing sessions and logged-in websites.


This is the right direction, because a general-purpose agent needs more than intelligence. It needs discipline. It needs to know when it is allowed to continue and when it must stop. The hard problem is that permissions cannot become theater. If the user is asked to approve too much, approval becomes noise. If the user is asked too little, autonomy becomes blind trust.


The winning system will not simply ask for permission more often. It will ask for permission at the right moments.


Anthropic: developer autonomy inside stronger boundaries


Anthropic’s visible strategy has a different flavor. Claude Code is not trying to be a general assistant for every consumer task. It is focused heavily on software work, where the environment is dangerous in a very concrete way. A coding agent can modify files, run shell commands, install packages, touch secrets, create vulnerabilities, or break a production workflow if the boundaries are weak.


That makes software development one of the clearest test beds for controlled autonomy. Developers want agents to move fast. They do not want to approve every harmless command. But they also do not want an agent with unlimited access to the filesystem, network, credentials, and deployment pipelines.


Anthropic’s sandboxing work is important because it recognizes a basic truth: per-action approval does not scale. If the agent has to ask for everything, the user becomes the bottleneck. Worse, the user starts clicking yes without thinking. Approval fatigue turns human oversight into a ritual.

Sandboxing changes the model. Instead of asking permission for every action, the system defines a safe boundary upfront. Inside that boundary, the agent can move more freely. Outside that boundary, it cannot. That is closer to how real delegation works. You do not tell a junior engineer to ask before every keystroke. You give him access to a branch, a test environment, and a clear scope of work.


This approach is more serious than pretending that every risk can be handled through prompts. A prompt can tell an agent not to touch sensitive files. A sandbox can make those files unreachable. A prompt can tell an agent not to exfiltrate data. Network isolation can make exfiltration harder. The first is instruction. The second is control.


The autonomy problem is a governance problem


The industry often talks about agents as if autonomy were a single slider. Less autonomy means safer. More autonomy means more useful. Reality is not that simple.


Autonomy is not one thing. It has scope, duration, authority, memory, access, reversibility, and consequence. An agent that can summarize documents autonomously is not the same as an agent that can send messages autonomously. An agent that can edit a local draft is not the same as an agent that can merge code into production. An agent that can browse public web pages is not the same as an agent that can use private connectors.


The useful question is not whether an AI system should be autonomous. The useful question is where autonomy should stop.


That is why controlled autonomy is becoming the real battleground. OpenAI and Anthropic are approaching it from different product surfaces, but both are being pulled toward the same architectural reality. Agents need tools. Tools create consequences. Consequences require governance.


This is also why the language of “human in the loop” is not enough. It sounds responsible, but it can mean almost anything. Is the human approving every action? Reviewing a plan before execution? Watching only critical steps? Auditing results after the fact? Setting policies that the system enforces automatically? These are very different models of control.


A serious agent system must answer practical questions. What can the agent access? What can it modify? What must it ask before doing? What can it never do? What gets logged? Who owns the result? How does the system recover when something goes wrong?


Those are governance questions.


Why this race matters


The companies that solve controlled autonomy will not merely make better assistants. They will define the operating model for AI in organizations.


Enterprises do not adopt technology only because it is impressive. They adopt it when it can be made reliable, auditable, manageable, and politically survivable. A demo can tolerate magic. A business process cannot. The moment an agent touches customer data, legal obligations, financial records, production systems, or external communications, somebody has to own the decision.


This is where the next phase of AI competition becomes more mature. Capability is still necessary, but capability alone is not enough. The market will reward systems that can explain their boundaries, enforce authority, reduce approval fatigue, and keep humans responsible without making them manually supervise every step.


OpenAI has the advantage of distribution and broad task coverage. Anthropic has a strong position in developer workflows and safety framing. Both are moving toward the same truth: an AI agent becomes valuable when it can act, but it becomes acceptable only when it can be controlled.

That is the tension. Everyone wants agents that can do more. Nobody wants to be surprised by what they did.


The principle


The next AI race will not be won by the company that simply gives agents the most freedom.

It will be won by the company that understands where freedom must end.


Autonomy scales action. Governance scales trust.


Recent Posts

See All

Comments


bottom of page