Google Launches Managed Agents in the Gemini API

Google Launches Managed Agents in the Gemini API

Google just made it significantly easier to build and ship AI agents without babysitting infrastructure. On May 19, 2026, Google announced managed agents in the Gemini API — a feature that lets developers define agents as configuration files and run them inside secure, fully managed cloud sandboxes. No container wrangling, no custom orchestration layer, no “wait, why is the agent leaking memory again at 2am” incidents. Just define, deploy, run.

This is a meaningful step forward. And depending on how tightly you follow the agentic AI space, it might be more significant than it first appears.

Why This Matters Now

The past 18 months have seen every major AI lab race to build “agentic” capabilities into their platforms. OpenAI has Codex and its operator framework. Anthropic has Claude’s tool-use APIs. Meta’s Llama models have been bolted into various agent frameworks by the open-source community. But the dirty secret of the agentic AI boom is that building agents is still genuinely hard — not because the models aren’t capable, but because the infrastructure around them is a mess.

Most developers building agents today are duct-taping together LLM calls, tool definitions, memory layers, retry logic, sandboxed execution environments, and some form of state management. The model is often the easiest part. The plumbing kills you.

That’s the gap Google is targeting here. Managed agents in the Gemini API abstract away most of that plumbing. You define your agent’s behavior in a structured file — think of it like a declarative spec for what your agent does, what tools it can use, what permissions it has — and Google’s infrastructure handles the rest: spinning up the execution environment, managing state, enforcing security boundaries, and scaling as needed.

The timing makes sense too. Google has been aggressively expanding its developer ecosystem around Gemini throughout 2026. We covered Google’s Gemini Startup Forum earlier this year, where over 100 companies were already building production applications on Gemini. Managed agents feels like Google’s move to lock those developers in deeper — give them infrastructure they’d otherwise have to build themselves, and make switching to a competitor much more painful.

What Managed Agents Actually Do

Here’s the core idea broken down clearly. With managed agents, you’re working with three main concepts:

  • Agent Definition Files: You define your agent as a structured configuration — its instructions, the tools it can call (web search, code execution, external APIs, etc.), its memory behavior, and its operational constraints. This is declarative, meaning you describe what the agent should do, not how to orchestrate every step.
  • Secure Cloud Sandboxes: When an agent runs, it executes inside an isolated environment managed entirely by Google’s cloud infrastructure. The sandbox enforces security boundaries — your agent can’t go rogue and start doing things outside its defined permissions. This is critical for enterprise use cases where compliance and auditability matter.
  • Managed State and Lifecycle: Google handles the agent’s runtime state, meaning you don’t have to build your own persistence layer for multi-step tasks. The platform tracks what the agent has done, what it’s waiting on, and manages handoffs between steps.
  • Built-in Tool Access: Agents get native access to Gemini’s existing tool ecosystem — Google Search, code execution, and presumably Google Workspace integrations. Third-party tool connections are also supported through the API.
  • Monitoring and Logging: Because the execution environment is managed, you get observability out of the box. Logs, traces, and usage metrics surface through Google Cloud’s standard tooling.

This is conceptually similar to what OpenAI has built with its Assistants API, which also provides managed threads, file storage, and tool execution. But Google is going further on the infrastructure side — the sandbox model feels closer to running a microservice than just managing a conversation thread.

There’s also a parallel worth drawing to how serverless functions evolved. AWS Lambda and Google Cloud Functions let developers define units of compute in simple files and offload all the server management to the cloud. Managed agents feel like the same conceptual move applied to AI — define your agent’s behavior declaratively, let the platform handle execution.

Who This Is Actually Built For

Let’s be honest about the target audience here. This isn’t for hobbyists asking Gemini to write poems. This is squarely aimed at professional developers and enterprises that want to deploy AI agents in production.

The “secure cloud sandbox” framing is doing a lot of work in that sentence. Security and isolation are not features developers care about when they’re prototyping. They’re features procurement teams, CISOs, and compliance officers demand before anything touches production. By making isolation a platform-level feature rather than something developers bolt on, Google is making it easier for enterprise buyers to say yes.

I’d expect the typical early adopter here to be engineering teams at mid-to-large companies — the kind building internal automation agents for data pipelines, customer support workflows, document processing, or code review. Teams that have the Gemini API budget already and need to move agents from prototype to production without spinning up dedicated DevOps resources.

Interestingly, this also has implications for smaller teams. If you’re a startup building an AI-native product on top of Gemini, managed agents removes a significant chunk of infrastructure work that would otherwise eat engineering time. We’ve seen how transformative that kind of infrastructure abstraction can be in other contexts — it’s essentially the same reason Stripe killed a category of payment infrastructure engineers at early-stage startups.

The competitive pressure on OpenAI is real here too. OpenAI has invested heavily in developer adoption through Codex and its various API products. We’ve written about how business ops teams are using Codex and the operational patterns emerging there. Google is now offering a competing managed execution story that may be more appealing to teams already invested in Google Cloud.

The Security Question Nobody’s Answering Yet

Here’s the thing: “secure cloud sandbox” is a marketing claim until it’s been stress-tested. What exactly are the isolation guarantees? Can agents access external networks by default, or is outbound traffic restricted? How are credentials for third-party tools stored and injected into the sandbox? What happens when an agent is in the middle of a multi-step task and crashes — does state recover cleanly?

These are not hypothetical concerns. The history of cloud execution environments is littered with sandbox escapes, credential leakage, and surprising shared-tenancy behaviors. Google has the engineering depth to get this right — they run some of the most sophisticated container infrastructure on the planet — but the details of the security model will matter enormously to enterprise buyers.

The Vertex AI documentation ecosystem is where those details will likely live, and that’s where I’d look for the fine print before committing anything sensitive to managed agents.

Key Takeaways

  • Google’s managed agents let developers define AI agents as files and run them in fully managed, isolated cloud environments — removing most of the infrastructure burden from teams building agentic applications.
  • The secure sandbox model is specifically designed for production and enterprise use cases where security boundaries and auditability matter.
  • This competes directly with OpenAI’s Assistants API but pushes further on the infrastructure side, closer to a serverless execution model for agents.
  • Teams already on Google Cloud will find the deepest integration benefits; the monitoring and tooling story ties into existing GCP infrastructure.
  • The critical unknowns are the specific security guarantees, pricing at scale, and how well the platform handles complex multi-step agent failures.

What is the Gemini API managed agents feature?

It’s a new capability in Google’s Gemini API that lets developers define AI agents as configuration files and deploy them in secure, Google-managed cloud sandboxes. The platform handles execution, state management, security isolation, and scaling so developers don’t have to build that infrastructure themselves.

Who should consider using managed agents?

Professional developers and enterprise engineering teams building production AI agents — particularly those already using Google Cloud. It’s best suited for automation workflows, document processing, data pipelines, and any multi-step agentic task that needs to run reliably at scale.

How does this compare to OpenAI’s Assistants API?

Both provide managed execution environments for AI agents, but Google’s approach leans harder into infrastructure isolation with its sandbox model, making it feel closer to serverless compute than a managed conversation framework. OpenAI’s offering has a larger existing developer base and deeper integration with Codex and other tools.

When and how can developers access managed agents?

The feature was announced May 19, 2026 through the official Gemini API. Developers can access it through the Gemini API, with full documentation expected through Google’s developer and Vertex AI portals. Pricing details are likely tied to existing Gemini API usage tiers.

Google is betting that the harder it is to leave its infrastructure, the longer developers stay. Managed agents is genuinely useful, but it’s also a stickiness play — once your agent definitions, state, and tooling are baked into Google’s execution environment, migrating becomes a real project. I wouldn’t be surprised if we see rapid adoption among Google Cloud shops in H2 2026, followed by a longer conversation about what vendor lock-in actually costs when the platform is doing this much work for you.