AI DevelopmentForecast13 min readPublished May 17, 2026

Confirmed agenda items · leaked specs · pure speculation — triaged in one place

Google I/O 2026: What Gemini 3.5 Actually Means

Google I/O 2026 opens Tuesday May 19. Gemini 3.5 Flash is expected as the headline launch — rumored at $1.50/$9.00 per Mtok and 1.05M input context — with Gemini 3.5 Pro held back for June. Antigravity 2.0, Gemini Spark, and AI Mode at 1B users round out the surface. Here is what developers should watch and what to pre-bookmark before Monday.

DA
Digital Applied Team
Senior strategists · Published May 17, 2026
PublishedMay 17, 2026
Read time13 min
Sources12 primary
Keynote start
10 AM PT
Tuesday May 19
Shoreline Amphitheatre
Expected Flash pricing
$1.50/Mtok in
$9.00/Mtok out (leak)
rumored
Expected Flash context
1.05M
input tokens (leak)
pre-keynote leak
Antigravity 1.0 SWE-Bench
76.2%
current baseline
Nov 18, 2025

Google I/O 2026 opens in 48 hours — and for the first time in the conference's history, the headline launch is expected to be a Flash-tier model rather than a Pro. Gemini 3.5 Flash is rumored to hit the developer keynote on Tuesday May 19 with a stable gemini-3.5-flash model ID, leaked pricing of $1.50/$9.00 per million tokens, and a new thinking_level API surface replacing the old integer budget.

The inversion matters. Google's usual cadence has always been Pro first, Flash as the trimmed-down follow-up. Shipping 3.5 Flash before 3.5 Pro is a deliberate “small models win developer mindshare” narrative — and it lands in the same week that Anthropic opens Code with Claude in London. The May 19-20 window is the single most compressed AI-news period of the quarter.

This guide separates confirmed agenda items from pre-keynote leaks from pure analyst speculation, deep-dives the rumored Gemini 3.5 Flash API surface and Antigravity 2.0 expectations, and tells you exactly what to pre-bookmark before the stream goes live Monday morning.

Key takeaways
  1. 01
    Gemini 3.5 Flash is the headline — not Pro.Industry watchers expect Google to ship the stable gemini-3.5-flash model ID at the developer keynote, with Gemini 3.5 Pro reportedly held back for June. Leaked pricing: $1.50 in / $9.00 out per Mtok, 1.05M input context window.
  2. 02
    The thinking_level API replaces the integer budget.Pre-keynote leaks from AI Studio changelog feeds suggest the new Flash introduces a string-enum thinking_level prop (minimal, low, medium, high) in place of the numeric thinking_budget. Bookmark gemini-3.5-flash as the stable model ID to swap in the day it lands.
  3. 03
    Antigravity 2.0 adds CLI, SDK, and scheduled tasks.The successor to the Nov 18, 2025 Antigravity 1.0 (76.2% SWE-Bench on Gemini 3 Pro) is expected to ship on Gemini 3.5 Flash with a native desktop app, CLI surface, SDK, and slash-command support — setting a new SWE-Bench baseline.
  4. 04
    Gemini Spark is Google's 24/7 personal AI agent play.Reportedly rolling out to trusted testers on launch day and entering Beta for Google AI Ultra subscribers the following week, Spark is expected to integrate Gmail, Calendar, Drive, YouTube, Maps, plus third-party apps including Canva, OpenTable, and Instacart.
  5. 05
    Gemini 4 is not expected — Polymarket odds favor 3.x.Polymarket odds have sat below 25% for a Gemini 4 announcement throughout Q1-Q2 2026. Expect a 3.5 generation bump with Deep Think and Code Assist updates, not a new major version number.

01Keynote ScheduleShoreline Amphitheatre, Tuesday May 19 — 10:00 AM PT livestream.

Google I/O 2026 runs Tuesday-Wednesday May 19-20, with a Monday May 18 community pre-event. The opening keynote is delivered by Sundar Pichai at 10:00 AM PT from the Shoreline Amphitheatre in Mountain View — livestreamed free on YouTube. The developer keynote follows immediately, led by Josh Woodward (VP, Google Labs and Gemini app) and Hema Budaraju (VP, AI Mode in Search). Both are on the confirmed agenda per io.google/2026.

The breakout session library opens to logged-in Google accounts immediately after the opening keynote. Android 17 developer sessions — covering adaptive layouts, predictive back, and the new Foreground Services API — are confirmed on the breakout track. So are Gemini-in-Chrome sessions and the AI Mode in Search GA expansion track.

Watch these two sessions
The developer keynote (Josh Woodward + Hema Budaraju) is the session where Gemini 3.5 Flash and Antigravity 2.0 are most likely to land. The AI Mode in Search breakout (Hema Budaraju) is the venue for the expected 1 billion users milestone reveal. Both stream live and are archived immediately — watch the developer keynote first if you can only pick one.

02Triage TableWhat is confirmed, what is leaked, what is speculation.

Most pre-I/O coverage mashes confirmed agenda items together with leaks and analyst conjecture. The table below applies a three-tier taxonomy — each row is tagged by evidence quality. CONFIRMED means it appears on the official io.google/2026 schedule. LEAKED means a named publication with direct sourcing has reported it as pre-keynote intelligence. SPECULATION means analyst or market-watcher inference with no named source.

CONFIRMED
Opening keynote + developer keynote
May 19, 10:00 AM PT · Shoreline

Both keynotes on the official io.google/2026 schedule. Sundar Pichai leads the opening; Josh Woodward + Hema Budaraju lead developer track. Free YouTube livestream.

Source: io.google/2026
CONFIRMED
AI Mode in Search breakout track
Developer sessions — GA expansion

AI Mode in Search GA expansion and Gemini-in-Chrome sessions confirmed in the breakout library. Hema Budaraju named speaker for both.

Source: io.google/2026 schedule
LEAKED
Gemini 3.5 Flash stable launch
$1.50 in / $9.00 out per Mtok · 1.05M context

Stable model ID gemini-3.5-flash expected at developer keynote. Leaked pricing at $1.50/$9.00 per Mtok, 1.05M input / 65,536 output tokens. New thinking_level enum API surface replacing integer budget.

Source: Bloomberg, The Information, 9to5Google
LEAKED
Antigravity 2.0 — CLI, SDK, desktop app
Gemini 3.5 Flash native · built on Antigravity 1.0

Successor to the Nov 18, 2025 Antigravity 1.0 (76.2% SWE-Bench). Rumored adds: native desktop app, CLI surface, SDK, scheduled tasks, slash-command support. Expected on Gemini 3.5 Flash.

Source: The Verge, TechCrunch pre-event
LEAKED
Gemini Spark — personal AI agent (Beta)
Gmail · Calendar · Drive · Canva · OpenTable

24/7 personal AI agent built on Gemini 3.5 Flash. Trusted-tester rollout launch day; Google AI Ultra Beta the following week. Third-party integrations include Canva, OpenTable, Instacart.

Source: 9to5Google pre-event
SPECULATION
Universal Cart — cross-merchant checkout
Search · Gemini · YouTube · Gmail

Speculated as the eCommerce-track headline. A cross-merchant shopping primitive spanning multiple Google surfaces with deal tracking and restock alerts. No named sourcing as of May 17.

Source: Retail Dive analyst inference
SPECULATION
Managed Agents API — single-call agent creation
Reasoning + tools + isolated Linux environments

Expected as a developer-keynote announcement: a single-API-call agent creation primitive with built-in reasoning, tool use, and isolated Linux execution environments. Paywalled sourcing only.

Source: The Information (paywalled)
SPECULATION
Gemini Omni — multimodal video generation
AI Plus / Pro / Ultra tiers · SynthID watermarking

Speculated debut on tiered subscriber plans with SynthID watermarking and YouTube Shorts integration. Analyst inference only — no named leak source.

Source: The Verge analyst inference

The takeaway from the triage: Gemini 3.5 Flash and Antigravity 2.0 have multi-outlet named sourcing and are the closest to certainty among unconfirmed items. Universal Cart and Gemini Omni are plausible given Google's public product roadmap but lack primary source corroboration as of today. Track the official blog.google “Everything announced at I/O” post — it typically goes live within an hour of the opening keynote closing.

03Gemini 3.5 FlashThe new API surface: thinking_level, model ID, and what the leaks say.

Pre-keynote leaks from AI Studio changelog feeds point to a new API surface for Gemini 3.5 Flash — the most developer-relevant change is the replacement of the thinking_budget integer with a string enum called thinking_level. The reported values are minimal, low, medium, and high, with mediumas the default. This mirrors the design philosophy already present in the Claude API's thinking block, which Anthropic shipped earlier this year.

The existing Gemini model IDs context: the current Flash tier sits at Gemini 3 Flash (launched Dec 17, 2025) and Gemini 3.1 Flash-Lite (GA May 7-8, 2026). Gemini 3.5 Flash is rumored to leapfrog both without replacing 3.1 Flash-Lite in the cost tier — they would coexist. The Gemini 3.1 Flash-Lite cost-tier breakdown documents that existing hierarchy before 3.5 changes the picture.

Model ID (stable)
gemini-3.5-flash
3.5Flash

Rumored stable model ID replacing gemini-3-flash-preview in the default tier. Bookmark this string — it is the one to swap into your API calls on Tuesday.

Leaked: ai.google.dev pre-event feeds
Input context (leaked)
Input token window
1.05M

Leaked context window of 1,050,000 input tokens — slightly larger than the current 1M cap in Gemini 3 Flash. Output reportedly capped at 65,536 tokens.

Output: 65,536 tokens
Thinking API surface
thinking_level enum
4levels

Leaked string-enum replacing the integer thinking_budget: minimal, low, medium (default), high. Each maps to a reasoning depth bracket — high is the frontier-quality mode.

Default: medium

On pricing, the leaked $1.50 input / $9.00 output per million tokens positions Gemini 3.5 Flash between the current 3 Flash ($0.30 in / $2.50 out) and Gemini 3.1 Pro ($4.50 in / $18.00 out). That gap is where agentic workloads — multiple sequential tool calls, long context chains, structured reasoning loops — currently sit without a well-priced option. If the leak holds, 3.5 Flash fills it directly. The verified Gemini 3.5 Flash benchmark guide will publish once confirmed specs are public after Tuesday's keynote.

04Antigravity 2.0From Gemini 3 Pro to Gemini 3.5 Flash — what changes.

Antigravity 1.0 launched November 18, 2025 as a VS Code fork with Manager View running five parallel agents, Chrome integration, and Gemini 3 Pro as the native model — hitting 76.2% on SWE-Bench at launch as a free preview. Our Antigravity 1.0 deep-dive in the AI-IDE landscape documented that baseline in detail.

Antigravity 2.0 is expected to ship on Gemini 3.5 Flash rather than a Pro-tier model — which is itself part of the Flash-first narrative. The leak profile from The Verge and TechCrunch points to four additions over the 1.0 baseline: a native desktop app (not a VS Code fork), a CLI surface, a first-party SDK, and scheduled tasks with slash-command support.

Antigravity 1.0
VS Code fork · Gemini 3 Pro

Launched Nov 18, 2025. Manager View runs 5 parallel agents. Chrome integration built in. 76.2% SWE-Bench at launch on free preview tier. Model: Gemini 3 Pro native.

Current baseline (Nov 2025)
Antigravity 2.0
Native desktop app · Gemini 3.5 Flash

Expected May 19, 2026 at developer keynote. Rumored additions: native desktop app (not a fork), CLI surface, first-party SDK, scheduled tasks, slash-command support. SWE-Bench target unknown until launch.

Expected at I/O Day 1
Cursor / Windsurf
External AI IDEs

Cursor Composer 2.5 is launching imminently (May 18) and Windsurf continues to update rapidly. The competitive landscape for the IDE-native agentic coding market is at peak intensity this week. See the full comparison.

Competitor context
Date discipline
1.0 vs 2.0 — do not conflate

Most coverage will blur the two releases. Antigravity 1.0 is the Nov 18, 2025 product on Gemini 3 Pro. Antigravity 2.0 has not shipped as of May 17 — these are pre-keynote leaks, not announcements.

Clarification note
The Flash-first cadence at I/O 2026 signals something structural: small models are where Google competes for developer loyalty, not headline benchmark rankings.Digital Applied analysis, May 17, 2026

Hema Budaraju's session is the one that SEOs and search strategists should watch most carefully. Pre-event analyst notes from Search Engine Land expect the session to frame AI Mode as now serving over a billion people monthly — a milestone that would make it one of the fastest growth curves in Google Search's history. That figure remains a pre-keynote expectation, not a self-published Google number as of this writing.

What the confirmed session content tells us: AI Mode in Search GA expansion is on the breakout track. That means moving beyond the limited rollout into broader availability, likely U.S.-wide or global, depending on regulatory status. The practical implication for digital marketing teams is that the organic click distribution patterns measured in 2025 may shift again — AI Mode surfaces structured summaries rather than ten blue links, rewarding content that answers intent directly rather than optimizing for position. Our agentic SEO practice is structured around exactly this shift.

Beyond GA expansion, analyst notes speculate about intent anticipation — a feature that surfaces results or actions before the user completes a query, modeled on behavioral patterns across Search, Gmail, and Assistant. That remains in the speculation tier as of May 17; watch Budaraju's session for any mention of “proactive Search” framing.

06New SurfacesGemini Spark and the Managed Agents API.

Two surfaces are in the leaked-but-expected tier and are worth watching separately. Gemini Spark is the consumer story; the Managed Agents API is the developer infrastructure story.

Consumer
Gemini Spark
24/7 personal agent · Google AI Ultra Beta

A 24/7 personal AI agent built on Gemini 3.5 Flash. Trusted-tester rollout reportedly on launch day; Google AI Ultra subscriber Beta the following week. First-party integrations: Gmail, Calendar, Drive, YouTube, Maps. Third-party: Canva, OpenTable, Instacart. Source: 9to5Google pre-event.

Leaked: 9to5Google
Developer
Managed Agents API
Single call · reasoning + tools + Linux exec

A developer-facing API for creating agents in a single call — with built-in reasoning, tool use, and isolated Linux execution environments. Comparable to the Anthropic agent runtime announced earlier in 2026. Paywalled leak sourcing from The Information.

Leaked: The Information (paywalled)
Subscriber tier
Google AI Ultra
$200+/mo · gates Spark, Antigravity 2.0 quotas

The expected gating tier for Spark Beta, Information Agents, and highest Antigravity 2.0 quotas. Google AI Ultra was introduced in 2025 — this I/O is expected to stack more product value onto that tier to justify the price point.

Speculation: blog.google AI Ultra page

The Managed Agents API, if it ships as speculated, would place Google in direct competition with Anthropic's agent runtime and OpenAI's Responses API at the infrastructure layer. The three labs are converging on a similar primitive: a managed execution environment where reasoning, tool calls, and code execution are bundled together rather than left to developer scaffolding. For teams already building on the agentic coding agent landscape, this represents a shift from DIY orchestration to platform-managed agent infrastructure.

07Strategic AnalysisThe Flash-first inversion — what it means for the next 12 months.

Google's historical cadence has been consistent: ship the Pro tier first, establishing the quality ceiling, then release Flash as the efficient derivative 8-16 weeks later. Every Gemini generation from 1.0 through 3.1 followed this pattern. Gemini 3.5 breaks it — Flash ships first, Pro is reportedly held for June 2026.

This is not an accident. It is a deliberate signal that the Flash-Pro intelligence gap has narrowed to the point where Flash can lead the narrative. The Flash-first move makes sense only if Google believes 3.5 Flash is meaningfully better on agentic and coding benchmarks than 3.1 Pro — because that is the audience the developer keynote is playing to. Ship Flash first, own developer mindshare with the affordable, fast model, then ship Pro to the enterprise and research audience next month.

The forward projection: if 3.5 establishes Flash-first as the new pattern, expect every subsequent Google model generation to lead with the Flash tier. That reframes how developer teams should track Google's model roadmap — the Flash launch date becomes the primary planning horizon, not the Pro launch date. Teams currently building on where Gemini 3.1 Pro sits today should pre-budget for a 3.5 Flash evaluation cycle starting Tuesday.

Gemini Pro vs Flash release order — every generation

Source: ai.google.dev model timeline + Digital Applied analysis
Gemini 1.0 Pro → FlashPro first · Flash released ~8 weeks later
Pro → Flash
Gemini 1.5 Pro → FlashPro first · Flash at Google I/O 2024
Pro → Flash
Gemini 2.0 Pro → FlashPro first · Flash-Thinking, Flash-Lite followed
Pro → Flash
Gemini 3.0 Pro → FlashPro first (Gemini 3 Pro) · Gemini 3 Flash Dec 17, 2025
Pro → Flash
Gemini 3.5 Flash → ProFlash FIRST (I/O May 19) · Pro expected June 2026
INVERTED

The horizontal bars above show every generation in the same order — Pro then Flash — until 3.5. The inversion is not subtle. Pre-event analysis from Stratechery has framed this as Google shifting developer-acquisition strategy: instead of winning with the most powerful model, win with the most affordable model that is still capable enough for production agentic workloads. Whether that framing lands depends on what the actual 3.5 Flash benchmark numbers show — and those won't be available until the keynote on Tuesday.

08ContextThree events, one window.

The May 19-20 window is unusual even by the standards of the compressed 2026 AI calendar. Three major events run in parallel — and the developer audience has to choose where to point its attention.

Mountain View
Google I/O 2026
May 19-20 · Shoreline Amphitheatre

The primary event. Consumer + developer keynotes, 300+ sessions, Android 17, AI Mode, Gemini 3.5 Flash expected. Free YouTube livestream. Developer breakouts require a Google account.

Free livestream: io.google/2026
London
Code with Claude
May 19-20 · Anthropic developer event

Anthropic's in-person developer conference opens the same two days. Claude 4 API deep-dives, agentic patterns, and the Anthropic SDK roadmap. Directly counter-programmed against I/O. See the Code with Claude preview for what Anthropic is expected to announce.

anthropic.com/events
Madrid
SAP Sapphire 2026
Opens May 19 · Enterprise ERP + AI track

SAP's flagship enterprise conference starts the same Monday. Enterprise AI-in-ERP announcements, Joule AI agent updates, and SAP Business AI partner ecosystem sessions. Relevant for teams evaluating AI transformation at the infrastructure layer.

sap.com/sapphire

The counter-programming collision is not coincidental at the industry level — major conferences cluster around the same windows because enterprise calendar constraints make any other scheduling harder. But for developers tracking all three spaces, the practical move is to watch the Google I/O opening keynote live on Tuesday morning, catch the Code with Claude London sessions in the afternoon archive (Anthropic typically releases session recordings within hours), and scan the SAP Sapphire enterprise AI track for any partner announcements that affect your stack.

The Code with Claude London preview covers what Anthropic is expected to ship in that window. The two previews are designed to be read together as complementary context for the same 48-hour period.

09Developer PrepWhat to pre-do before Monday's stream.

The developer keynote moves fast — announced model IDs, pricing tables, and API surfaces go from slide to live documentation in minutes. Developers who are already oriented land in a much better position to evaluate quickly. Here is what to set up before Tuesday.

Bookmark
gemini-3.5-flash in your config

Swap gemini-3-flash-preview for gemini-3.5-flash in your model config files before the keynote. The stable ID is the leaked target — you can update the surrounding params (thinking_level, context window) on Tuesday once confirmed.

Do before Monday
Review
AI-IDE comparison baseline

Re-read the Antigravity 1.0 baseline in the AI-IDE comparison before Antigravity 2.0 ships. Understanding the Manager View / Chrome / SWE-Bench baseline is required to evaluate the 2.0 delta claims accurately.

Do before Monday
Watch live
Developer keynote · not just the opening

The opening keynote (Sundar Pichai) is the consumer story. The developer keynote (Josh Woodward + Hema Budaraju) is where the API surfaces, model IDs, and SDK changes land. If you only watch one session, watch the developer keynote.

Tuesday 10 AM PT
Audit
thinking_budget usage in your codebase

If you use the Gemini API with the thinking_budget integer parameter, audit those call sites before Tuesday. If the thinking_level enum is confirmed, you will want to migrate quickly to avoid relying on deprecated integer behavior.

Do before Monday
Agentic SEO implication
AI Mode in Search GA expansion at I/O 2026 could be the single most consequential announcement for digital marketing teams in Q2 2026. Broader rollout accelerates the shift from position-1 CTR optimization to intent-match content strategy. If you have not audited your content architecture against AI Mode's structured summary format, Tuesday is a good forcing function. Our agentic SEO service is built for exactly this transition.
What to watch at I/O 2026

Flash first is a strategy, not just a release order.

The Google I/O 2026 keynote is worth watching live for three things: the confirmed pricing and benchmark numbers for Gemini 3.5 Flash (to verify or refute the leaks), the Antigravity 2.0 demo (to see how the CLI and SDK surface actually work), and the AI Mode in Search milestone number (to understand how aggressively Google is framing the shift away from the ten-blue-links model). Everything else — Universal Cart, Gemini Omni, Deep Think upgrades — is secondary context that can be read in the post-keynote recap.

What is likely to actually ship on Tuesday: Gemini 3.5 Flash (high confidence from named leak sourcing), Antigravity 2.0 (high confidence), AI Mode GA expansion (confirmed in breakout schedule), Gemini Spark Beta for Ultra subscribers (medium confidence). What is less certain: the Managed Agents API, Universal Cart, Gemini Omni. Set your priors accordingly and avoid treating pre-keynote leaks as confirmed product launches.

The Flash-first move, if it holds, reshapes how developer teams should track Google's model roadmap for the next 12 months. Flash stops being a trimmed-down derivative and becomes the primary developer surface. Pro becomes the specialized high-compute option shipped later to a smaller audience. That is a meaningful shift in how to think about the economics of building on Gemini — and it is worth revisiting once the actual 3.5 Flash benchmark numbers are public on Tuesday.

Stay ahead of every Google I/O launch

From I/O keynote to production in weeks, not quarters.

Our team tracks every major model launch, API surface change, and developer tooling shift — and helps businesses evaluate, implement, and operate agentic AI systems before the hype cycle catches up with the reality.

Free consultationExpert guidanceTailored solutions
What we work on

Post-I/O implementation

  • Gemini 3.5 Flash API migration and benchmark evaluation
  • Antigravity 2.0 and Cursor / Windsurf IDE evaluation programs
  • AI Mode in Search content strategy audits
  • Agentic workflow design on the Managed Agents API
  • Cost modeling across Gemini Flash, Pro, and Ultra tiers
FAQ · Google I/O 2026

Questions we're getting before Monday.

The opening keynote is Tuesday May 19, 2026 at 10:00 AM PT at the Shoreline Amphitheatre, Mountain View, California. It streams free on YouTube. The developer keynote — led by Josh Woodward (VP, Google Labs and Gemini app) and Hema Budaraju (VP, AI Mode in Search) — follows immediately after. Google I/O 2026 runs as a full two-day conference through Wednesday May 20, with a Monday May 18 community pre-event. Breakout sessions open to logged-in Google accounts immediately after the opening keynote closes.