Google I/O 2026 opens in 48 hours — and for the first time in the conference's history, the headline launch is expected to be a Flash-tier model rather than a Pro. Gemini 3.5 Flash is rumored to hit the developer keynote on Tuesday May 19 with a stable gemini-3.5-flash model ID, leaked pricing of $1.50/$9.00 per million tokens, and a new thinking_level API surface replacing the old integer budget.
The inversion matters. Google's usual cadence has always been Pro first, Flash as the trimmed-down follow-up. Shipping 3.5 Flash before 3.5 Pro is a deliberate “small models win developer mindshare” narrative — and it lands in the same week that Anthropic opens Code with Claude in London. The May 19-20 window is the single most compressed AI-news period of the quarter.
This guide separates confirmed agenda items from pre-keynote leaks from pure analyst speculation, deep-dives the rumored Gemini 3.5 Flash API surface and Antigravity 2.0 expectations, and tells you exactly what to pre-bookmark before the stream goes live Monday morning.
- 01Gemini 3.5 Flash is the headline — not Pro.Industry watchers expect Google to ship the stable gemini-3.5-flash model ID at the developer keynote, with Gemini 3.5 Pro reportedly held back for June. Leaked pricing: $1.50 in / $9.00 out per Mtok, 1.05M input context window.
- 02The thinking_level API replaces the integer budget.Pre-keynote leaks from AI Studio changelog feeds suggest the new Flash introduces a string-enum thinking_level prop (minimal, low, medium, high) in place of the numeric thinking_budget. Bookmark gemini-3.5-flash as the stable model ID to swap in the day it lands.
- 03Antigravity 2.0 adds CLI, SDK, and scheduled tasks.The successor to the Nov 18, 2025 Antigravity 1.0 (76.2% SWE-Bench on Gemini 3 Pro) is expected to ship on Gemini 3.5 Flash with a native desktop app, CLI surface, SDK, and slash-command support — setting a new SWE-Bench baseline.
- 04Gemini Spark is Google's 24/7 personal AI agent play.Reportedly rolling out to trusted testers on launch day and entering Beta for Google AI Ultra subscribers the following week, Spark is expected to integrate Gmail, Calendar, Drive, YouTube, Maps, plus third-party apps including Canva, OpenTable, and Instacart.
- 05Gemini 4 is not expected — Polymarket odds favor 3.x.Polymarket odds have sat below 25% for a Gemini 4 announcement throughout Q1-Q2 2026. Expect a 3.5 generation bump with Deep Think and Code Assist updates, not a new major version number.
01 — Keynote ScheduleShoreline Amphitheatre, Tuesday May 19 — 10:00 AM PT livestream.
Google I/O 2026 runs Tuesday-Wednesday May 19-20, with a Monday May 18 community pre-event. The opening keynote is delivered by Sundar Pichai at 10:00 AM PT from the Shoreline Amphitheatre in Mountain View — livestreamed free on YouTube. The developer keynote follows immediately, led by Josh Woodward (VP, Google Labs and Gemini app) and Hema Budaraju (VP, AI Mode in Search). Both are on the confirmed agenda per io.google/2026.
The breakout session library opens to logged-in Google accounts immediately after the opening keynote. Android 17 developer sessions — covering adaptive layouts, predictive back, and the new Foreground Services API — are confirmed on the breakout track. So are Gemini-in-Chrome sessions and the AI Mode in Search GA expansion track.
02 — Triage TableWhat is confirmed, what is leaked, what is speculation.
Most pre-I/O coverage mashes confirmed agenda items together with leaks and analyst conjecture. The table below applies a three-tier taxonomy — each row is tagged by evidence quality. CONFIRMED means it appears on the official io.google/2026 schedule. LEAKED means a named publication with direct sourcing has reported it as pre-keynote intelligence. SPECULATION means analyst or market-watcher inference with no named source.
Opening keynote + developer keynote
Both keynotes on the official io.google/2026 schedule. Sundar Pichai leads the opening; Josh Woodward + Hema Budaraju lead developer track. Free YouTube livestream.
AI Mode in Search breakout track
AI Mode in Search GA expansion and Gemini-in-Chrome sessions confirmed in the breakout library. Hema Budaraju named speaker for both.
Gemini 3.5 Flash stable launch
Stable model ID gemini-3.5-flash expected at developer keynote. Leaked pricing at $1.50/$9.00 per Mtok, 1.05M input / 65,536 output tokens. New thinking_level enum API surface replacing integer budget.
Antigravity 2.0 — CLI, SDK, desktop app
Successor to the Nov 18, 2025 Antigravity 1.0 (76.2% SWE-Bench). Rumored adds: native desktop app, CLI surface, SDK, scheduled tasks, slash-command support. Expected on Gemini 3.5 Flash.
Gemini Spark — personal AI agent (Beta)
24/7 personal AI agent built on Gemini 3.5 Flash. Trusted-tester rollout launch day; Google AI Ultra Beta the following week. Third-party integrations include Canva, OpenTable, Instacart.
Universal Cart — cross-merchant checkout
Speculated as the eCommerce-track headline. A cross-merchant shopping primitive spanning multiple Google surfaces with deal tracking and restock alerts. No named sourcing as of May 17.
Managed Agents API — single-call agent creation
Expected as a developer-keynote announcement: a single-API-call agent creation primitive with built-in reasoning, tool use, and isolated Linux execution environments. Paywalled sourcing only.
Gemini Omni — multimodal video generation
Speculated debut on tiered subscriber plans with SynthID watermarking and YouTube Shorts integration. Analyst inference only — no named leak source.
The takeaway from the triage: Gemini 3.5 Flash and Antigravity 2.0 have multi-outlet named sourcing and are the closest to certainty among unconfirmed items. Universal Cart and Gemini Omni are plausible given Google's public product roadmap but lack primary source corroboration as of today. Track the official blog.google “Everything announced at I/O” post — it typically goes live within an hour of the opening keynote closing.
03 — Gemini 3.5 FlashThe new API surface: thinking_level, model ID, and what the leaks say.
Pre-keynote leaks from AI Studio changelog feeds point to a new API surface for Gemini 3.5 Flash — the most developer-relevant change is the replacement of the thinking_budget integer with a string enum called thinking_level. The reported values are minimal, low, medium, and high, with mediumas the default. This mirrors the design philosophy already present in the Claude API's thinking block, which Anthropic shipped earlier this year.
The existing Gemini model IDs context: the current Flash tier sits at Gemini 3 Flash (launched Dec 17, 2025) and Gemini 3.1 Flash-Lite (GA May 7-8, 2026). Gemini 3.5 Flash is rumored to leapfrog both without replacing 3.1 Flash-Lite in the cost tier — they would coexist. The Gemini 3.1 Flash-Lite cost-tier breakdown documents that existing hierarchy before 3.5 changes the picture.
gemini-3.5-flash
Rumored stable model ID replacing gemini-3-flash-preview in the default tier. Bookmark this string — it is the one to swap into your API calls on Tuesday.
Input token window
Leaked context window of 1,050,000 input tokens — slightly larger than the current 1M cap in Gemini 3 Flash. Output reportedly capped at 65,536 tokens.
thinking_level enum
Leaked string-enum replacing the integer thinking_budget: minimal, low, medium (default), high. Each maps to a reasoning depth bracket — high is the frontier-quality mode.
On pricing, the leaked $1.50 input / $9.00 output per million tokens positions Gemini 3.5 Flash between the current 3 Flash ($0.30 in / $2.50 out) and Gemini 3.1 Pro ($4.50 in / $18.00 out). That gap is where agentic workloads — multiple sequential tool calls, long context chains, structured reasoning loops — currently sit without a well-priced option. If the leak holds, 3.5 Flash fills it directly. The verified Gemini 3.5 Flash benchmark guide will publish once confirmed specs are public after Tuesday's keynote.
04 — Antigravity 2.0From Gemini 3 Pro to Gemini 3.5 Flash — what changes.
Antigravity 1.0 launched November 18, 2025 as a VS Code fork with Manager View running five parallel agents, Chrome integration, and Gemini 3 Pro as the native model — hitting 76.2% on SWE-Bench at launch as a free preview. Our Antigravity 1.0 deep-dive in the AI-IDE landscape documented that baseline in detail.
Antigravity 2.0 is expected to ship on Gemini 3.5 Flash rather than a Pro-tier model — which is itself part of the Flash-first narrative. The leak profile from The Verge and TechCrunch points to four additions over the 1.0 baseline: a native desktop app (not a VS Code fork), a CLI surface, a first-party SDK, and scheduled tasks with slash-command support.
VS Code fork · Gemini 3 Pro
Launched Nov 18, 2025. Manager View runs 5 parallel agents. Chrome integration built in. 76.2% SWE-Bench at launch on free preview tier. Model: Gemini 3 Pro native.
Native desktop app · Gemini 3.5 Flash
Expected May 19, 2026 at developer keynote. Rumored additions: native desktop app (not a fork), CLI surface, first-party SDK, scheduled tasks, slash-command support. SWE-Bench target unknown until launch.
External AI IDEs
Cursor Composer 2.5 is launching imminently (May 18) and Windsurf continues to update rapidly. The competitive landscape for the IDE-native agentic coding market is at peak intensity this week. See the full comparison.
1.0 vs 2.0 — do not conflate
Most coverage will blur the two releases. Antigravity 1.0 is the Nov 18, 2025 product on Gemini 3 Pro. Antigravity 2.0 has not shipped as of May 17 — these are pre-keynote leaks, not announcements.
The Flash-first cadence at I/O 2026 signals something structural: small models are where Google competes for developer loyalty, not headline benchmark rankings.Digital Applied analysis, May 17, 2026
05 — AI Mode in SearchOne billion users — and a Search overhaul.
Hema Budaraju's session is the one that SEOs and search strategists should watch most carefully. Pre-event analyst notes from Search Engine Land expect the session to frame AI Mode as now serving over a billion people monthly — a milestone that would make it one of the fastest growth curves in Google Search's history. That figure remains a pre-keynote expectation, not a self-published Google number as of this writing.
What the confirmed session content tells us: AI Mode in Search GA expansion is on the breakout track. That means moving beyond the limited rollout into broader availability, likely U.S.-wide or global, depending on regulatory status. The practical implication for digital marketing teams is that the organic click distribution patterns measured in 2025 may shift again — AI Mode surfaces structured summaries rather than ten blue links, rewarding content that answers intent directly rather than optimizing for position. Our agentic SEO practice is structured around exactly this shift.
Beyond GA expansion, analyst notes speculate about intent anticipation — a feature that surfaces results or actions before the user completes a query, modeled on behavioral patterns across Search, Gmail, and Assistant. That remains in the speculation tier as of May 17; watch Budaraju's session for any mention of “proactive Search” framing.
06 — New SurfacesGemini Spark and the Managed Agents API.
Two surfaces are in the leaked-but-expected tier and are worth watching separately. Gemini Spark is the consumer story; the Managed Agents API is the developer infrastructure story.
Gemini Spark
A 24/7 personal AI agent built on Gemini 3.5 Flash. Trusted-tester rollout reportedly on launch day; Google AI Ultra subscriber Beta the following week. First-party integrations: Gmail, Calendar, Drive, YouTube, Maps. Third-party: Canva, OpenTable, Instacart. Source: 9to5Google pre-event.
Managed Agents API
A developer-facing API for creating agents in a single call — with built-in reasoning, tool use, and isolated Linux execution environments. Comparable to the Anthropic agent runtime announced earlier in 2026. Paywalled leak sourcing from The Information.
Google AI Ultra
The expected gating tier for Spark Beta, Information Agents, and highest Antigravity 2.0 quotas. Google AI Ultra was introduced in 2025 — this I/O is expected to stack more product value onto that tier to justify the price point.
The Managed Agents API, if it ships as speculated, would place Google in direct competition with Anthropic's agent runtime and OpenAI's Responses API at the infrastructure layer. The three labs are converging on a similar primitive: a managed execution environment where reasoning, tool calls, and code execution are bundled together rather than left to developer scaffolding. For teams already building on the agentic coding agent landscape, this represents a shift from DIY orchestration to platform-managed agent infrastructure.
07 — Strategic AnalysisThe Flash-first inversion — what it means for the next 12 months.
Google's historical cadence has been consistent: ship the Pro tier first, establishing the quality ceiling, then release Flash as the efficient derivative 8-16 weeks later. Every Gemini generation from 1.0 through 3.1 followed this pattern. Gemini 3.5 breaks it — Flash ships first, Pro is reportedly held for June 2026.
This is not an accident. It is a deliberate signal that the Flash-Pro intelligence gap has narrowed to the point where Flash can lead the narrative. The Flash-first move makes sense only if Google believes 3.5 Flash is meaningfully better on agentic and coding benchmarks than 3.1 Pro — because that is the audience the developer keynote is playing to. Ship Flash first, own developer mindshare with the affordable, fast model, then ship Pro to the enterprise and research audience next month.
The forward projection: if 3.5 establishes Flash-first as the new pattern, expect every subsequent Google model generation to lead with the Flash tier. That reframes how developer teams should track Google's model roadmap — the Flash launch date becomes the primary planning horizon, not the Pro launch date. Teams currently building on where Gemini 3.1 Pro sits today should pre-budget for a 3.5 Flash evaluation cycle starting Tuesday.
Gemini Pro vs Flash release order — every generation
Source: ai.google.dev model timeline + Digital Applied analysisThe horizontal bars above show every generation in the same order — Pro then Flash — until 3.5. The inversion is not subtle. Pre-event analysis from Stratechery has framed this as Google shifting developer-acquisition strategy: instead of winning with the most powerful model, win with the most affordable model that is still capable enough for production agentic workloads. Whether that framing lands depends on what the actual 3.5 Flash benchmark numbers show — and those won't be available until the keynote on Tuesday.
08 — ContextThree events, one window.
The May 19-20 window is unusual even by the standards of the compressed 2026 AI calendar. Three major events run in parallel — and the developer audience has to choose where to point its attention.
Google I/O 2026
The primary event. Consumer + developer keynotes, 300+ sessions, Android 17, AI Mode, Gemini 3.5 Flash expected. Free YouTube livestream. Developer breakouts require a Google account.
Code with Claude
Anthropic's in-person developer conference opens the same two days. Claude 4 API deep-dives, agentic patterns, and the Anthropic SDK roadmap. Directly counter-programmed against I/O. See the Code with Claude preview for what Anthropic is expected to announce.
SAP Sapphire 2026
SAP's flagship enterprise conference starts the same Monday. Enterprise AI-in-ERP announcements, Joule AI agent updates, and SAP Business AI partner ecosystem sessions. Relevant for teams evaluating AI transformation at the infrastructure layer.
The counter-programming collision is not coincidental at the industry level — major conferences cluster around the same windows because enterprise calendar constraints make any other scheduling harder. But for developers tracking all three spaces, the practical move is to watch the Google I/O opening keynote live on Tuesday morning, catch the Code with Claude London sessions in the afternoon archive (Anthropic typically releases session recordings within hours), and scan the SAP Sapphire enterprise AI track for any partner announcements that affect your stack.
The Code with Claude London preview covers what Anthropic is expected to ship in that window. The two previews are designed to be read together as complementary context for the same 48-hour period.
09 — Developer PrepWhat to pre-do before Monday's stream.
The developer keynote moves fast — announced model IDs, pricing tables, and API surfaces go from slide to live documentation in minutes. Developers who are already oriented land in a much better position to evaluate quickly. Here is what to set up before Tuesday.
gemini-3.5-flash in your config
Swap gemini-3-flash-preview for gemini-3.5-flash in your model config files before the keynote. The stable ID is the leaked target — you can update the surrounding params (thinking_level, context window) on Tuesday once confirmed.
AI-IDE comparison baseline
Re-read the Antigravity 1.0 baseline in the AI-IDE comparison before Antigravity 2.0 ships. Understanding the Manager View / Chrome / SWE-Bench baseline is required to evaluate the 2.0 delta claims accurately.
Developer keynote · not just the opening
The opening keynote (Sundar Pichai) is the consumer story. The developer keynote (Josh Woodward + Hema Budaraju) is where the API surfaces, model IDs, and SDK changes land. If you only watch one session, watch the developer keynote.
thinking_budget usage in your codebase
If you use the Gemini API with the thinking_budget integer parameter, audit those call sites before Tuesday. If the thinking_level enum is confirmed, you will want to migrate quickly to avoid relying on deprecated integer behavior.
Flash first is a strategy, not just a release order.
The Google I/O 2026 keynote is worth watching live for three things: the confirmed pricing and benchmark numbers for Gemini 3.5 Flash (to verify or refute the leaks), the Antigravity 2.0 demo (to see how the CLI and SDK surface actually work), and the AI Mode in Search milestone number (to understand how aggressively Google is framing the shift away from the ten-blue-links model). Everything else — Universal Cart, Gemini Omni, Deep Think upgrades — is secondary context that can be read in the post-keynote recap.
What is likely to actually ship on Tuesday: Gemini 3.5 Flash (high confidence from named leak sourcing), Antigravity 2.0 (high confidence), AI Mode GA expansion (confirmed in breakout schedule), Gemini Spark Beta for Ultra subscribers (medium confidence). What is less certain: the Managed Agents API, Universal Cart, Gemini Omni. Set your priors accordingly and avoid treating pre-keynote leaks as confirmed product launches.
The Flash-first move, if it holds, reshapes how developer teams should track Google's model roadmap for the next 12 months. Flash stops being a trimmed-down derivative and becomes the primary developer surface. Pro becomes the specialized high-compute option shipped later to a smaller audience. That is a meaningful shift in how to think about the economics of building on Gemini — and it is worth revisiting once the actual 3.5 Flash benchmark numbers are public on Tuesday.