DevelopmentNew Release13 min readPublished May 19, 2026

Two products named Managed Agents, opposite architectures, shipped on the same morning — Google's runs inside Google's sandbox, Anthropic's runs inside yours.

Google Managed Agents API: Agents in a Single Call

Google launched the Managed Agents API on May 19, 2026 — a single POST to the Interactions API spins up a Gemini 3.5 Flash agent in an isolated Linux sandbox with code execution, web search, and URL fetch. The Antigravity harness drives the runtime; AGENTS.md and SKILL.md replace orchestration code. And on the exact same morning, Anthropic shipped a product with the identical name.

DA
Digital Applied Team
Senior strategists · Published May 19, 2026
PublishedMay 19, 2026
Read time13 min
Sources8
To spin up an agent
1
API call — POST /v1beta/interactions
preview today
Default agent ID
antigravity
antigravity-preview-05-2026
Same-name products
2
Google + Anthropic, same morning
May 19, 2026
Gemini 3.5 Flash pricing
$1.50/Mtok
$9.00 output · $0.15 cached · compute free

Google's Managed Agents API launched on May 19, 2026 as a public preview, giving any developer the ability to spin up a Gemini 3.5 Flash agent with code execution, web search, and URL fetch in a single Interactions API call — no orchestration framework, no sandbox provisioning, no agent loop to maintain.

The timing produced the week's sharpest narrative collision: on the same morning, at Code with Claude London, Anthropic added self-hosted sandboxes and MCP tunnels to its own product — also called "Managed Agents." Two frontier labs, one product name, opposite runtime philosophies. Google's version runs exclusively inside Google's infrastructure; Anthropic's runs in yours.

This guide covers what Google actually shipped: the Interactions API surface and its previous_interaction_id state model, the Antigravity harness that powers the runtime, which tools are available in preview and which are gated, how preview pricing works, and how the three managed-agent primitives from Google, Anthropic, and OpenAI now sit side by side. For the full Google I/O 2026 announcement roundup, see our hub post.

Key takeaways
  1. 01
    One POST call is the entire agent primitive.POST /v1beta/interactions with an agent field (instead of model) spins up an Antigravity-powered Gemini 3.5 Flash agent in an isolated Linux sandbox. Google handles the runtime; you handle the prompt and the tools list.
  2. 02
    Antigravity is the harness; AGENTS.md + SKILL.md replace code.The default managed agent is antigravity-preview-05-2026. Custom agents are configured via versionable markdown files — AGENTS.md for the agent definition, SKILL.md for invocable skills — the same format Antigravity 2.0 uses.
  3. 03
    State persists across calls via previous_interaction_id.Each interaction returns an environment ID. Pass previous_interaction_id to resume the session with all files and state intact. Custom environments have a 7-day TTL. Sessions can also be forked by branching the same parent ID.
  4. 04
    Three tools available in preview; five are gated.Available: code_execution, google_search, url_context. Unavailable during preview: file_search, computer_use, google_maps, function_calling, mcp_server. Structured output is also unsupported in managed-agent preview runs.
  5. 05
    Same name, opposite architectures — the category just split open.Google's Managed Agents runs only inside Google's sandbox. Anthropic's Claude Managed Agents runs hosted or inside your perimeter via Cloudflare, Daytona, Modal, or Vercel. OpenAI's Agents SDK is the third pole — provider-agnostic and local-first.

01Name CollisionTwo products, one name, opposite architectures.

The cleanest story of the week is a naming accident that happened in real time. Google I/O 2026 went live on May 19 with "Managed Agents in the Gemini API" — a hosted agent runtime accessed via the Interactions API. Hours later, at the Code with Claude London event, Anthropic announced self-hosted sandboxes and MCP tunnels for its own product: "Claude Managed Agents." Both products share every word of the name. Neither team knew the other was shipping.

The architectural difference is the story. Google's Managed Agents runs exclusively inside Google's infrastructure — the antigravity-preview-05-2026agent ID is a Google-hosted runtime you cannot move or inspect at the infra level. Anthropic's Managed Agents is explicitly multi-tenant: the session can run on Anthropic's cloud or inside your own perimeter via self-hosted sandbox providers (Cloudflare, Daytona, Modal, Vercel). Same product category, opposite trust models. This is not a minor implementation detail — it maps directly to compliance, data sovereignty, and vendor lock-in decisions.

OpenAI's position is the third pole: the OpenAI Agents SDK (openai-agents, v0.17.3 as of May 2026) runs in your process against the Responses API and is provider-agnostic — supporting 100+ LLMs including non-OpenAI models. The "Sandbox Agents" pattern in that SDK (using UnixLocalSandboxClient) is the local-process equivalent, not a hosted runtime.

The broader implication: the "managed agent" category has split into at least three distinct runtime models in the span of one week. The table in section 07 maps all three against each other. For a detailed look at Anthropic's same-day Code with Claude London launch, see that companion post.

02Single-Call AgentOne POST — isolated Linux sandbox, Gemini 3.5 Flash, live.

Google's pitch is in one sentence from the announcement: "With a single call, you can now spin up an agent that reasons, uses tools and executes code in an isolated, ephemeral Linux environment." The mechanics behind that sentence are worth unpacking.

The consumer endpoint is POST https://generativelanguage.googleapis.com/v1beta/interactions. The enterprise / Vertex path is POST https://aiplatform.googleapis.com/v1beta1/projects/{PROJECT_ID}/locations/{LOCATION}/interactions. In both cases, the request body replaces the standard model field with an agent field pointing to a managed agent ID. The response is an interaction object with a status, an output, and an environment ID for follow-up calls.

Each interaction runs in an isolated, ephemeral Linux sandbox hosted by Google. The sandbox has no implicit outbound network — network access must be explicitly configured per agent. Google's docs describe it as "secure Google-hosted remote environments" without specifying the isolation technology (virtualization layer, syscall filtering, or similar). Do not assume a specific isolation primitive; the spec says "isolated ephemeral Linux environment."

The interaction status field reports six states: in_progress, requires_action, completed, failed, cancelled, incomplete, and budget_exceeded. For long agent loops, set background=True to offload the loop server-side and poll with the interaction ID, or set stream=True to receive Server-Sent Events as the agent runs.

One call, one agent
The Managed Agents API strips the orchestration layer entirely. You don't provision a sandbox, configure a tool executor, or write an agent loop. POST /v1beta/interactions with an agent field and Google runs the loop. "With Gemini Managed Agents, we're abstracting away the complexity, so you can focus on your product experience and agent behavior" — Ali Cevik & Philipp Schmid, Google DeepMind, May 19, 2026.

03Interactions APIHow the Interactions API extends generateContent.

The Interactions API is a new unified RESTful surface for stateful and agentic workloads. It extends generateContentwith server-side state, agent execution, and tool orchestration — but it does not replace it. Google's migration guide is explicit: "generateContent remains the primary path for standard production workloads; we recommend the Interactions API for all new development" involving agents or stateful sessions. Teams running existing generateContent integrations have no forced migration.

The key new body field is previous_interaction_id. Pass the ID returned from a prior interaction to resume the session with all files and state intact inside the same sandbox environment. Google's own description: "Each interaction creates or receives an environment, which you can use in follow-up calls to resume the session with all files and state intact." Custom environments have a 7-day standard TTL; you can also pre-load Cloud Storage data into the sandbox file system at creation time.

Sessions can be forked. Passing the same previous_interaction_id from two different new requests creates two divergent branches off the same parent — useful for A/B exploring agent paths without re-paying setup cost. The thinking_level parameter from Gemini 3.5 Flash is also available via generation_config.thinking_level (string enum: minimal | low | medium | high, default medium). See our Gemini 3.5 Flash benchmarks and API guide for a full treatment of the thinking parameter surface.

Agentic API surfaces compared · state model and capability scope

Source: Google Interactions API docs, OpenAI Responses API reference, Anthropic Managed Agents docs — May 2026
Interactions API (Google)POST /v1beta/interactions · previous_interaction_id · 7-day env TTL
State + agent + tools
Responses API (OpenAI)POST /v1/responses · previous_response_id · store: true
State + tools · no hosted sandbox
Claude Managed Agents (Anthropic)REST + SSE · managed-agents-2026-04-01 header · cloud or self-hosted
Session + env + events
generateContent (Google — stateless)POST /v1beta/models/{model}:generateContent · no session state
Stateless completion

The architectural parallel between the Interactions API and OpenAI's Responses API is deliberate and observable in the specs. Both use a single unified endpoint, both model state via a previous_*_idpattern, and both expose a tool catalog as first-class fields on the request body. Google's version adds the hosted-sandbox layer — the agent runs inside Google's infrastructure rather than calling out to external tools you host. That is the meaningful architectural delta.

04Antigravity HarnessAntigravity is the harness; markdown is the new orchestration code.

The default managed agent ID is antigravity-preview-05-2026. This is Google's Antigravity harness running Gemini 3.5 Flash — the same infrastructure that powers Google's own coding products. Google's announcement states: "You get access to the same technology and infrastructure that powers our own agents." The Interactions API also supports gemini-2.5-pro and gemini-2.5-flash for non-managed (direct model) calls on the same endpoint.

Custom agents replace orchestration code with versionable markdown files. AGENTS.md defines the agent — its purpose, constraints, and the list of skills it can invoke. SKILL.md defines individual invocable skills, using YAML frontmatter for metadata and a markdown body for the skill instructions. When a session opens, the agent reads the list of skill names and descriptions; when a task matches, the agent loads the full SKILL.md body. The result: agent behavior is version-controlled, reviewable, and deployable without a code change. For a detailed look at how Antigravity 2.0 fits the broader desktop-agent picture, see our Antigravity 2 deep dive.

This converges on a pattern emerging across all three frontier labs: markdown as agent runtime DSL. Google uses AGENTS.md / SKILL.md; Anthropic's tooling uses CLAUDE.md; the OpenAI Agents SDK uses instructions strings that are, in practice, markdown documents. The convergence on human-readable instruction files as the primary agent configuration surface is itself a signal about where the agent development model is heading.

The managed agent runtime is a bet that the future of agent development is a REST call and a markdown file — not a framework, not a loop, not a provisioned sandbox.Digital Applied synthesis, May 19, 2026

05Built-In ToolsThree tools available; five gated during preview.

The Interactions API schema declares a comprehensive tool catalog, but what's actually available for managed-agent runs during preview is a strict subset. The gap between the schema and the preview reality is the most practically important detail for any team evaluating this for production use.

Available
code_execution
Runs Python in the sandbox

The core tool. Executes code in the isolated Linux environment, reads and writes files, surfaces stdout/stderr and file artifacts in the response. This is the primary reason to choose Managed Agents over a plain Interactions API call.

Available in preview
Available
google_search
Live web search via Grounding API

Gives the agent real-time web access. Results are retrieved and summarized within the agent loop — you do not need to wire up a search tool separately. Uses Grounding API under the hood.

Available in preview
Available
url_context
Fetch and read a URL

The agent can retrieve and read a specific URL as part of its reasoning loop. Useful for documentation lookup, page summarization, or multi-page content extraction workflows.

Available in preview
Gated
file_search
Search across uploaded files

Declared in the Interactions API schema, not available for managed-agent runs during preview. Expected to open as the preview matures. Verify current status against the live API reference before assuming availability.

Unavailable — preview
Gated
computer_use
Desktop and browser control

Computer-use tool type exists in the schema. Unavailable for managed-agent preview runs. No announced timeline for managed-agent enablement — available separately via the Responses API for direct model calls.

Unavailable — preview
Gated
mcp_server
Remote MCP tool override

MCP server blocks are first-class in the Interactions API schema — you can specify a URL, name, and headers to bring your own MCP tools into the Google-hosted sandbox. Currently unavailable for managed-agent preview runs; available for direct model calls on the same endpoint.

Unavailable — preview

Additional tool types declared in the schema but unavailable during managed-agent preview: google_maps, function_calling (custom functions), and structured output. These limits are reported by secondary coverage and should be treated as subject to change — the preview matrix may shift daily as Google opens capability flags. Always verify against the live Interactions API reference before shipping production workloads. If your use case requires MCP or custom functions, the current recommendation is to use the Interactions API in direct model mode (with model rather than agent) until those tool types are enabled for managed agents.

06Preview PricingCompute is free during preview; tokens bill at standard rates.

Google's pricing model during the preview period has two components. Sandbox compute — CPU time, memory, and execution duration inside the Linux environment — is not separately metered. Model usage is billed at standard Gemini 3.5 Flash rates, which apply regardless of whether the call is a standard completion or a managed-agent interaction.

Input tokens
Gemini 3.5 Flash input
$1.50/Mtok

Standard rate during preview. All tokens sent to the model — prompt, system instructions, tool schemas, session context — bill at this rate. The 1M-token context window makes long agent loops viable at this price point.

Cached: $0.15/Mtok
Output tokens
Gemini 3.5 Flash output
$9.00/Mtok

Reasoning traces, code artifacts, tool call responses, and final output all count as output tokens. Multi-step agent loops with extensive reasoning can accumulate output tokens quickly — monitor with thinking_level=minimal for cost-sensitive workloads.

6× input rate
Sandbox compute
Compute during preview
$0

CPU time, memory, and execution duration inside the isolated Linux environment are not separately metered during preview. This rate is explicitly a preview benefit — compute pricing will likely surface at GA. Model token costs apply regardless.

Preview benefit · subject to change

The practical cost implication: an agent loop that generates 100K output tokens (reasoning traces + code + responses) costs approximately $0.90 in model usage at current rates, with sandbox compute free. At GA, compute charges will change that arithmetic. For teams evaluating managed-agent economics relative to self-hosted alternatives, the preview window is the right time to benchmark actual token consumption — the compute-free period gives you clean signal on the model-cost component before the full pricing materializes.

There are two access paths for the Managed Agents API: public preview on the Gemini API (available to any developer account) and private preview on the Gemini Enterprise Agent Platform (qualified enterprise customers on Vertex AI). The enterprise path adds the Vertex API endpoint, IAM-based access controls, and enterprise SLAs. Our AI transformation practice can help you evaluate which path fits your compliance and scale requirements.

07Platform ComparisonGoogle vs Anthropic vs OpenAI — the canonical managed-agent matrix.

Three managed-agent / agentic API primitives now exist from the major frontier labs. All three share a common architectural template — single endpoint, state via a previous_*_idpattern, built-in tool catalog — but diverge sharply on runtime location, sandbox model, and provider lock-in. The table below is the canonical cross-vendor reference for Q2 2026.

Google
Managed Agents API (Interactions API)

Runtime: Google-hosted only. Endpoint: POST /v1beta/interactions. State: previous_interaction_id + 7-day env TTL. Default agent: antigravity-preview-05-2026 (Gemini 3.5 Flash). MCP: schema-declared, preview-gated. GA status: public preview May 19, 2026. Pricing: model tokens at standard Gemini 3.5 Flash rates; compute free during preview.

Best for: Google-native workloads, code execution, search-augmented agents on the Gemini platform.
Anthropic
Claude Managed Agents (REST + SSE)

Runtime: Anthropic cloud or self-hosted sandbox (Cloudflare, Daytona, Modal, Vercel). Endpoint: Managed Agents REST API + SDK helpers, beta header managed-agents-2026-04-01. State: session + environment + SSE event stream. MCP: first-class; MCP tunnels in research preview. Rate limits: 300 create rpm / 600 read rpm per org. Compute: customer-owned on self-hosted path.

Best for: sovereignty or compliance requirements, bring-your-own-runtime, MCP-heavy workflows.
OpenAI
Agents SDK + Responses API

Runtime: your process (agent loop runs locally; Responses API is the stateful completion surface). Package: openai-agents v0.17.3 (Python 3.10+). State: previous_response_id + store: true. MCP: first-class. Provider-agnostic: 100+ LLMs. Sandbox Agents pattern (UnixLocalSandboxClient) for local container execution. No hosted sandbox — you provision and pay for the sandbox compute separately.

Best for: provider-agnostic agent builds, local-first control, teams that prefer to own the runtime layer.

The choice axis is not primarily about model capability — all three can route to competitive frontier models. The real decision is runtime location and trust model. Google's hosted sandbox removes operational burden at the cost of full vendor dependency. Anthropic's self-hosted option adds operational responsibility in exchange for data residency control. OpenAI's local-first SDK gives maximum flexibility and provider optionality at the cost of running and paying for the execution environment yourself.

For teams building on multiple model providers, our OpenAI Agents SDK vs LangGraph vs CrewAI matrix covers the code-first framework tier, where ADK 2.0, LangGraph, and CrewAI sit. For the model-under-the-hood comparison on agentic coding benchmarks, see our Gemini 3.5 Flash vs GPT-5.5 vs Opus 4.7 head-to-head.

08Google Agent StackGoogle's four-tier agent stack — where each layer fits.

Google I/O 2026 didn't ship one agent product — it shipped a complete stack. The four tiers share the same Antigravity harness underneath but address fundamentally different developer personas and operational trade-offs. Most I/O coverage lists these products as a feature list. The more useful framing is a coordinate system: one axis is hosted vs. self-hosted runtime; the other is low-code vs. code-first orchestration.

Low-code
Agent Studio
Studio

Visual, low-code agent builder for product teams and non-engineers. Drag-and-drop workflow composition over the Antigravity harness. Target audience: operators and product owners who need to deploy agents without writing orchestration code.

Hosted · Low-code
One-call REST
Managed Agents API
API

The Interactions API surface covered in this post. Single POST call, Google-hosted sandbox, Antigravity harness, Gemini 3.5 Flash default. Target audience: developers who want a managed runtime without the operational overhead of running an agent framework.

Hosted · REST · Preview May 19, 2026
IDE + CLI
Antigravity 2.0
ATV2

The developer IDE and CLI for building and testing agents locally against the Antigravity harness. AGENTS.md + SKILL.md are authored and debugged here before deployment. The same harness powers Managed Agents API in production.

Local dev · GA May 19, 2026
Code-first
ADK 2.0
ADK

Agent Development Kit 2.0 reached GA on May 19. Code-first multi-agent framework with a unified graph-based execution engine, Python / TypeScript / Go / Java / Kotlin (beta) support, and Agent-to-Agent (A2A) protocol for interop between layers. You write the orchestration; ADK executes it.

Self-hosted · Code-first · GA May 19, 2026

The coordinate-system framing resolves most "which Google product should I use" questions: if you want Google to run the agent loop, choose Agent Studio (no-code) or Managed Agents API (REST). If you want to run the loop yourself, choose Antigravity for local development and ADK 2.0 for production multi-agent orchestration. All four tiers use the same Antigravity harness, which means agents authored for one tier can be promoted to another with minimal rework.

The ADK 2.0 GA is the most significant independent milestone in the stack. It adds Go, Java, and Kotlin (beta) alongside the existing Python and TypeScript support, and introduces the Agent-to-Agent (A2A) protocol for cross-layer and cross-vendor agent interop. ADK 2.0 is the code-first sibling to Managed Agents — you build the orchestration yourself instead of paying Google to host it, giving you full control over execution, cost, and observability at the trade-off of operational overhead.

09OutlookGA timeline, tool expansion, and the Gemini CLI sunset.

Three near-term developments are worth tracking for any team evaluating the Managed Agents API for production roadmaps.

Preview-to-GA tool expansion. The most immediately consequential gap is the preview lock on file_search, computer_use, google_maps, function_calling, and mcp_server for managed-agent runs. Google has not announced a specific GA timeline for the Managed Agents API, but the declared tool schema suggests these capabilities are engineered — the preview limit is a feature-flag gate, not an architectural constraint. The mcp_server gap in particular is the most significant for teams building on the MCP ecosystem, since every other frontier managed runtime treats MCP as first-class.

Vertex AI enterprise path.The private preview on the Gemini Enterprise Agent Platform brings IAM-based access controls, enterprise SLAs, and the Vertex AI compliance posture to the Managed Agents runtime. As this path opens to broader enterprise access, regulated industries — financial services, healthcare, public sector — gain a viable hosted-agent option that inherits Vertex's existing compliance certifications. Watch for a GA announcement on the enterprise path to signal when production-scale regulated deployments become feasible without additional certification work.

Gemini CLI sunset — June 18, 2026. Google announced that the Gemini CLI will be sunset on June 18, 2026. Pro and Ultra customers using the CLI today must migrate to the Antigravity + Managed Agents stack before that date. This is the most immediate practical pressure for existing Gemini developer customers — not a theoretical future consideration, but a five-week migration window from the date of this post. The migration path leads directly into the Antigravity IDE and the Managed Agents API as the replacement developer surface.

The broader trend these three developments point to is a compression of the "agent deployment" surface area. Compute pricing, tool parity with self-hosted alternatives, and enterprise compliance are the three unlock conditions for wide production adoption. The preview window is the right time to benchmark agent loop economics and tool coverage against your specific workloads — before GA pricing changes the arithmetic. For teams building production-grade agentic workflows, our AI transformation engagements start with exactly this kind of comparative evaluation across managed and self-hosted runtimes.

Conclusion

One REST call, a Linux sandbox, and a naming collision that defines the week.

The Managed Agents API ships Google's bet that the future agent surface is a single REST call. POST /v1beta/interactions spins up an isolated Linux sandbox running the Antigravity harness with Gemini 3.5 Flash as the default agent. The state pattern — previous_interaction_id— mirrors OpenAI's Responses API; the AGENTS.md + SKILL.md model replaces orchestration code with version-controlled markdown. Compute is free during preview; model usage bills at standard Gemini 3.5 Flash rates ($1.50/$9.00 per Mtok input/output).

The name collision with Anthropic's Claude Managed Agents — shipped the same morning at Code with Claude London — is the cleanest framing of the week. Google's version is hosted-only; Anthropic's runs in your perimeter via Cloudflare, Daytona, Modal, or Vercel. Same product category, opposite trust models. OpenAI's Agents SDK plus Responses API is the third pole — provider-agnostic and local-first. The "managed agent" category has split into three distinct runtime philosophies in a single morning.

What to watch: the preview-tool gaps (file_search, computer_use, mcp_server) closing toward GA; compute pricing surfacing as the runtime matures; the Vertex AI enterprise path widening for regulated industries. And most immediately: the Gemini CLI sunset on June 18 forces Pro and Ultra customers onto the Antigravity + Managed Agents stack — that migration is the practical pressure for any team already building on Google's developer tooling.

Navigate the managed-agent landscape

One REST call to an agent runtime — we help you choose the right one.

Our team helps engineering and product organizations evaluate managed-agent runtimes — Google Managed Agents, Anthropic Claude Managed Agents, OpenAI Agents SDK — and design the right runtime architecture for compliance, cost, and scale requirements.

Free consultationExpert guidanceTailored solutions
What we work on

Agentic runtime engagements

  • Managed-agent runtime selection — Google vs Anthropic vs OpenAI
  • Preview-to-production migration planning for Managed Agents API
  • Gemini CLI sunset migration — Antigravity + Managed Agents stack
  • Agent cost modeling — compute + token economics at scale
  • Compliance evaluation for hosted vs self-hosted sandbox runtimes
FAQ

The questions teams ask about managed-agent runtimes.

Google's Managed Agents API is a hosted agent runtime launched May 19, 2026 as a public preview. A single POST to /v1beta/interactions spins up a Gemini 3.5 Flash agent in an isolated Linux sandbox with code execution, web search, and URL fetch built in. You specify an agent field (pointing to a managed agent ID like antigravity-preview-05-2026) rather than a model field, and Google runs the agent loop inside its own infrastructure. No orchestration framework, no sandbox provisioning, no agent loop to maintain on your side.