The MCP Dev Summit 2026 was the moment the Model Context Protocol crossed from adoption story to infrastructure question. The North America edition ran April 2–3, 2026 at the New York Marriott Marquis and drew roughly 1,200 attendees — about double the prior summit. The headline number, 110 million monthly SDK downloads, was the easy takeaway. The harder, more durable one was governance.
For two years the standard MCP pitch was a download chart. That era ended on stage in New York. The keynotes, the 23 security-focused sessions, and the incoming spec all pointed the same direction: MCP is now load-bearing inside large organisations, and the open problems are operational — sessions, sprawl, identity, and audit trails — not whether the protocol works.
This readout treats the summit as an inflection rather than a news dump. It unpacks the incoming July 28 spec through a governance lens, translates the spec into a team-action matrix, makes the case that Shadow MCP is the new Shadow IT, and compares what each hyperscaler now brings to enterprise MCP. For the underlying download and server counts this builds on, see our MCP adoption statistics and the running MCP server ecosystem tracker.
- 01The summit reframed MCP from adoption metric to infrastructure.The North America edition (April 2–3, 2026) drew ~1,200 attendees and devoted 23 of its sessions to security. The story stopped being downloads and became governance, identity, and audit.
- 02110M monthly downloads is the April 2026 keynote figure.MCP co-creator David Soria Parra cited 110M monthly SDK downloads — reached in 16 months, a milestone React took three years to hit. The December 2025 baseline was 97M; both are vendor-stated, so timestamp whichever you cite.
- 03The July 28 spec makes the protocol stateless.The incoming release removes the initialize handshake and the session header so any request can route to any server instance. That unlocks vanilla round-robin load balancers — the change that makes MCP feasible at scale, not just possible.
- 04Shadow MCP is the new Shadow IT.Organisations typically discover several times more MCP deployments than IT expected. Servers bind to localhost, listen on random high ports, or hide behind reverse proxies — and surface in IDE plugins as production dependencies with no approval.
- 05A centralised gateway plus registry is the emerging consensus.Amazon, Uber, AWS, Docker, Kong, Solo.io, Bloomberg, and Cloudflare are converging on the same control-plane pattern. The procurement question for engineering leads is now which governance stack, not whether to govern.
01 — The InflectionWhat the summit actually signalled.
MCP Dev Summit North America 2026 was held April 2–3 at the New York Marriott Marquis, drawing approximately 1,200 attendees — roughly double the prior summit. The programme spanned 17 keynotes and more than 95 sessions across four tracks, with security the single most represented theme at 23 dedicated sessions. That ratio is the tell: a protocol still proving itself spends its stage time on capability demos, while a protocol entering production spends it on threat models and identity.
David Soria Parra, MCP's co-creator at Anthropic, opened Day 1 with the figure that travelled furthest — 110 million monthly SDK downloads, a level MCP reached in 16 months. For context, React took roughly three years to hit the same milestone. That number is vendor-stated and timestamped to the April keynote; an earlier December 2025 Anthropic announcement put the figure at 97 million. Both are real points on a steep curve, which is exactly why the date matters more than the headline.
The institutional signals were just as loud as the metrics. The Agentic AI Foundation — the body the protocol was donated to in December 2025 — reported reaching 170 member organisations in under four months, well past the 100-member milestone cited from the stage. Jim Zemlin stepped down as interim Executive Director, with Mazin Gilbert appointed to lead the foundation. The x402 Foundation launched at the summit, folding a payment protocol into the agent ecosystem. None of this is the behaviour of an experiment; it is the scaffolding of an institution.
"MCP is the Linux of agents. We're going to keep going."— Jim Zemlin, Linux Foundation CEO, MCP Dev Summit North America 2026
Our reading: the "Linux of agents" framing is more than a soundbite. It signals a deliberate move toward a neutral, foundation- governed standard with a predictable lifecycle — and the spec changes below back that up. The summit's most underrated message was not that MCP is popular. It was that MCP is now being run like infrastructure, with a deprecation policy, an authorization roadmap, and an enterprise governance track to match.
02 — ArchitectureThe spec shift everyone underreported.
The current stable specification is dated 2025-11-25, with stdio and Streamable HTTP transports over JSON-RPC 2.0. The incoming July 28, 2026 release — whose release candidate was locked May 21, 2026 by lead maintainers David Soria Parra and Den Delimarsky, with a validation window running to July 28 — is the most consequential revision since the protocol's creation. Most coverage summarised it as "sessions are gone." That undersells what changes.
Concretely, the July 28 spec removes the initialize/initialized handshake and the Mcp-Session-Id header, making the protocol fully stateless. The operational consequence is the part that matters: any MCP request can now route to any server instance with no sticky sessions and no shared session store. You can place a server behind a vanilla AWS Application Load Balancer or a Cloudflare load balancer with zero custom routing logic. That is the difference between MCP being technically possible at scale and being genuinely feasible to operate at the scale of an Uber or a Pinterest.
The release is additive as well as subtractive. It introduces two official extensions — MCP Apps (server-rendered HTML UIs in sandboxed iframes with JSON-RPC channels) and a Tasks Extension (a stateless lifecycle via tasks/get, tasks/update, and tasks/cancel after a tool handles a request). Tool inputSchema and outputSchema gain full JSON Schema 2020-12 support, so oneOf, anyOf, allOf, conditionals, and $ref with $defs all become valid. New Mcp-Method and Mcp-Name headers let a load balancer route without inspecting the request body, and ttlMs / cacheScope fields enable client-side caching of tools/list results.
Three features are formally deprecated: Roots (replaced by tool parameters, resource URIs, or server config), Sampling (replaced by direct LLM provider API calls), and Logging (replaced by stderr for stdio transports and OpenTelemetry for structured observability). Crucially, these deprecations are annotation-only — removal requires a separate proposal after at least 12 months. That formal deprecation-to-removal window is itself the news: it is the first concrete evidence that the foundation is treating MCP as long-lived infrastructure rather than a perpetually experimental protocol.
03 — Team ActionThe July 28 spec as a team-action matrix.
Coverage of the release candidate tends to list its SEPs without answering the question engineering leads actually have: who on my team has to do what, and by when. The table below maps each significant change to its impact class, the role affected, and the required action. Every cell traces to the official MCP release notes; the timeline column reflects the announced July 28 window.
| Change | Impact | Who acts | Required action |
|---|---|---|---|
| Breaking — stateless core | |||
| Sessions removed | Breaking | Both | Drop the handshake and session header; move any session state to request payloads or external stores. Re-test behind a round-robin load balancer. |
| Error code change | Breaking | Client devs | Handle missing-resource errors as JSON-RPC standard -32602 instead of the MCP-custom -32002. |
| Additive — new capability | |||
| MCP Apps + Tasks | Additive | Server devs | Adopt server-rendered iframe UIs and the stateless task lifecycle where long-running tools need progress and cancellation. No action if unused. |
| JSON Schema 2020-12 | Additive | Server devs | Tighten tool input/output schemas with oneOf/anyOf/$ref once Tier 1 SDKs ship support. |
| Routing + caching headers | Additive | Both | Use Mcp-Method/Mcp-Name for body-free routing; cache tools/list via ttlMs/cacheScope; propagate W3C Trace Context. |
| Hardening + deprecation | |||
| Auth alignment (OAuth/OIDC) | Hardening | Client devs | Validate iss per RFC 9207, declare OpenID Connect application_type at Dynamic Client Registration, bind credentials to the issuing server. |
| Roots / Sampling / Logging | Deprecation | Both | Migrate off the three deprecated features; annotation-only now, with at least a 12-month window before any removal. |
04 — Governance RiskShadow MCP is the new Shadow IT.
The most important governance idea to come out of the summit period is also the least technical: Shadow MCP. The analogy to Shadow IT is exact because the mechanism is identical. A low-friction tooling layer spreads faster than IT can track it, and then quietly becomes a compliance liability. With MCP, the spread is faster still, because a server is often just a dependency a developer adds to an IDE plugin.
Security researchers and vendors analysing the problem report a consistent pattern: organisations typically discover several times more MCP deployments than IT expected — commonly framed as a 3–10x gap. The reason these servers evade traditional visibility is structural. They bind to localhost, listen on random high ports, or sit behind reverse proxies. They appear inside IDE plugins as routine production dependencies with no formal approval step. None of that trips a conventional asset inventory.
"Behind corporate firewalls, we're quietly wiring MCPs to Salesforce, Jira, internal wikis, Snowflake daily."— David Soria Parra, MCP co-creator / Anthropic, MCP Dev Summit North America 2026
That quote is the whole problem in one sentence. Every one of those connections is a credentialed bridge between an autonomous agent and a system of record, created without a change ticket. The security picture around it is genuinely uncomfortable. Independent analyses describe a meaningful share of public MCP servers relying on static secrets for authentication, and a non-trivial fraction being lookalikes that impersonate legitimate products. We treat the specific percentages in those reports as secondary, vendor-sourced estimates rather than audited fact — but the direction is not in dispute: the authentication posture of the public ecosystem is weak, and the impersonation surface is real.
Aggregator coverage of the summit also points to dozens of CVEs filed against MCP implementations in early 2026 and a slice of public servers affected by tool poisoning. Because those figures trace to a single secondary source rather than primary CVE databases, we cite them only as a qualitative signal — there were enough vulnerabilities disclosed to make security the summit's dominant track, and the tool-poisoning class is real, regardless of the precise count.
More deployments than IT expects
The recurring finding across Shadow MCP analyses: organisations consistently uncover several times more MCP servers than their IT teams approved, creating immediate compliance gaps.
Evasion mechanisms
MCP servers bind to localhost, listen on random high ports, or sit behind reverse proxies — and surface inside IDE plugins as production dependencies. Conventional asset inventories miss all three.
Rebinding on local servers
Jonathan Leitschuh demonstrated DNS rebinding vulnerabilities affecting local MCP servers at the summit, with origin-header validation emphasised as the critical safeguard for any localhost-bound server.
Here is the operational implication worth coining: MCP sprawl audits are now a recurring IT hygiene task, not a one-time cleanup. The same way endpoint and SaaS inventories became quarterly rituals once Shadow IT matured, MCP discovery belongs on a standing cadence. That governance work is now formal enough that a session titled "From Shadow MCP to Sanctioned MCP" appeared on the schedule at the Mumbai edition of the summit — shadow governance is a named track, not a hallway conversation.
05 — Consensus PatternThe enterprise control plane everyone converged on.
If the summit had a single architectural consensus, it was this: the enterprise MCP control plane is a centralised gateway plus a registry. Organisations converging on the pattern span the full spectrum — Amazon, Uber, AWS, Docker, Kong, Solo.io, Bloomberg, and Cloudflare all described variants of the same shape. A gateway terminates and governs every agent-to-server call; a registry is the single source of truth for which servers exist and who may use them. This is the same logical role an LLM gateway architecture plays for model traffic, applied to tool traffic.
Uber gave the most concrete production picture. The company disclosed an MCP Gateway and Registry exposing thousands of Thrift, Protobuf, and HTTP endpoints, fronted by a Go-based GenAI Gateway that handles PII redaction. The scale figures it shared — more than 5,000 engineers, over 10,000 internal services, 1,500+ monthly active agents, and 60,000+ agent executions weekly, with its Minions Agent making around 1,800 code changes a week and used by 95% of engineering — are vendor-stated and not independently audited. Even discounted for that, they describe MCP operating as core production infrastructure, not a pilot.
The gateway
Terminates every MCP call, enforces identity and policy, redacts PII, and emits the audit trail. The natural place to detect Shadow MCP via hostname scanning, URI patterns, and JSON-RPC body inspection.
The registry
Catalogues which MCP servers exist, who owns them, and who may use them. Turns Shadow MCP discovery into a closed loop: anything not in the registry is, by definition, unsanctioned.
Token efficiency
Cloudflare reported its Code Mode cutting token usage sharply — roughly 9,400 tokens for four servers and 52 tools down to about 600 — by letting the model write code against tool APIs rather than loading every definition into context.
Cloudflare published a full enterprise MCP reference architecture in mid-April, combining remote MCP servers, Access for SSO and MFA, MCP Server Portals, AI Gateway, and Cloudflare Gateway for shadow-MCP detection. Its claimed Code Mode token reduction — from roughly 9,400 tokens to about 600 — is vendor-stated, but the underlying technique matters more than the exact figure: the same progressive, code-execution approach to tool discovery is how Claude Code defers tool definitions that exceed a context threshold, with Anthropic reporting substantial token savings. Token efficiency, in other words, is becoming a property of the gateway layer, not just the model.
06 — ProcurementWhat each hyperscaler actually ships.
For an engineering lead making a procurement decision, the useful artefact is a side-by-side of what each vendor now brings to enterprise MCP. The table below compares the major offerings on maturity, core capability, auth model, and audit posture. Every cell traces to the vendor's own announcement or independent analyst coverage from the April–June 2026 window.
| Offering | Maturity | Core capability | Auth model | Audit trail |
|---|---|---|---|---|
| AWS MCP Server | GA — May 6, 2026 | One call_aws tool across 15,000+ AWS API operations, plus doc-search and sandboxed script execution. | Existing IAM credentials; MCP Proxy bridges to OAuth 2.1; cross-account and cross-role access. | CloudWatch metrics (AWS-MCP namespace) + CloudTrail for compliance. |
| Cloudflare Enterprise MCP | Reference arch — Apr 14, 2026 | Remote MCP, Server Portals, AI Gateway, and Code Mode token efficiency at the edge. | Access for SSO/MFA; Cloudflare Gateway for shadow-MCP detection. | Gateway logging; hostname, URI, and JSON-RPC body inspection. |
| Claude MCP Tunnels | Research preview — May 19, 2026 | Outbound-only encrypted reach to private MCP servers with no inbound firewall rules or public endpoints. | End-to-end encrypted; regulated-industry rollout first. | Runs inside the customer perimeter; integrates with existing controls. |
| Claude self-hosted sandboxes | Public beta — May 19, 2026 | Isolated execution via Cloudflare, Daytona, Modal, or Vercel — plus a bring-your-own option. | Per-partner isolation; zero-trust secrets injection (Cloudflare) and VPC peering (Vercel). | Inherits the sandbox provider's controls and logging. |
07 — PerimeterSandboxes and tunnels: running inside the perimeter.
The governance theme had a concrete product moment a few weeks after the summit. At Code with Claude London on May 19, 2026, Anthropic launched self-hosted sandboxes in public beta and MCP tunnels in research preview. Four sandbox launch partners shipped on day one — Cloudflare (microVMs with zero-trust secrets injection), Daytona (composable, long-running, stateful computers), Modal (sub-second startup for AI workloads), and Vercel (VM security plus VPC peering, millisecond startup) — with a bring-your-own-sandbox option alongside. We cover the launch in depth in our companion piece on self-hosted sandboxes and MCP tunnels.
MCP tunnels are the more architecturally interesting of the two. A tunnel establishes an outbound-only, end-to-end encrypted connection so an agent can reach a private MCP server without inbound firewall rules, public endpoints, or IP allowlisting. Supported in Managed Agents and the Messages API and still in research preview, Anthropic is rolling access out to regulated industries first. The reason that ordering matters is the reason the whole governance theme exists: the bottleneck on production agents is rarely the model. It is the security review.
The day-one customer references underline how production-minded the launch was: Amplitude built a Design Agent on Cloudflare, Clay ran its Sculptor GTM engineering agent on Daytona, DoorDash evaluated Modal for agentic commerce, and Rogo ran AI for institutional finance on Vercel. These are not demos. They are early-adopter teams putting agents against real systems, which is exactly the threshold the stateless spec and the gateway pattern are built to support at scale.
08 — PlaybookA governance playbook for the next two quarters.
The summit's lessons collapse into a small number of decisions most engineering organisations can act on now, ahead of the July 28 release. The matrix below is our reading of the priority order — what to do first, and why.
Run a Shadow MCP discovery sweep
Assume IT undercounts MCP deployments by several times. Scan for localhost-bound servers, high-port listeners, and IDE-plugin dependencies. Make this a quarterly hygiene task, not a one-off.
Plan the stateless migration
If any server holds per-conversation state, the July 28 stateless core is a breaking change. Move state out of sessions now and re-test behind a round-robin load balancer before the release lands.
Stand up a gateway plus registry
Adopt the consensus pattern: one gateway for identity, policy, PII redaction, and audit; one registry as the source of truth. Anything not in the registry is unsanctioned by definition.
Kill static secrets
The weakest part of the public ecosystem is authentication. Align with the spec's OAuth 2.0 / OpenID Connect hardening, validate issuers, and replace static secrets with short-lived, scoped credentials.
For most organisations the sequence matters more than any single item. You cannot govern what you cannot see, so discovery comes first; the stateless migration is the highest-leverage engineering work because it is both breaking and date-bound; and the gateway plus registry is the durable investment that turns discovery into a closed loop. If your team is standing up that control plane from scratch — or weighing AWS, Cloudflare, and Anthropic's offerings against your own constraints — our AI transformation engagements start with exactly this kind of architecture and governance work, and our custom development team builds the gateway and registry layer to fit your stack.
09 — ConclusionThe moment the protocol grew up.
MCP's open question stopped being 'does this work' and became 'can a CISO sign off on it'.
The 2026 Dev Summit will be remembered less for the 110 million downloads and more for the shift it confirmed. MCP is now infrastructure, and infrastructure is judged on different terms — scalability, identity, audit, and a predictable lifecycle. The incoming July 28 spec answers the first of those by deleting sessions, and the 12-month deprecation window answers the last by committing to a stable lifecycle.
The governance story is the one to internalise. Shadow MCP is real, it spreads exactly the way Shadow IT did, and the same antidote applies — visibility first, then a control plane that turns discovery into a closed loop. The consensus on a centralised gateway plus registry, and the arrival of perimeter-friendly tools like self-hosted sandboxes and MCP tunnels, mean the building blocks now exist. The work is operational, not speculative.
Our advice for engineering leads is unglamorous and effective: run a discovery sweep this quarter, put the stateless migration on the roadmap before late July, stand up a gateway and registry, and close the authentication gap. The protocol grew up at this summit. The organisations that treat it as infrastructure — rather than a clever developer toy — are the ones that will ship agents their security teams actually approve.