AWS Summit New York 2026, held June 17 at the Javits Center, was the moment Amazon Web Services stopped selling AI building blocks and started shipping end-to-end agentic infrastructure. The keynote, led by Swami Sivasubramanian, AWS’s VP of Agentic AI, threaded a single argument through every announcement: builders should no longer assemble agent scaffolding by hand.

Three product families carried that thesis. AgentCore became a free managed harness for running production agents. AWS Continuum arrived as an AI-native security platform with a deliberate trust model. And AWS Context was announced as a coming-soon knowledge graph that any agent in an organization can query at runtime. Underneath all of them sits the same design question: how do you let an agent do more without writing it a blank check?

This guide covers what actually shipped versus what was merely announced, the autonomy-escalation pattern that ties the portfolio together, how AWS Context compares to the context-layer products from Microsoft, Google, and Salesforce, and what builders and engineering teams should do about it. Every figure below is sourced to AWS’s own blogs or named third-party summit coverage; vendor-stated claims are labelled as such.

Key takeaways

01
The era of assemble-it-yourself scaffolding is closing.AWS reframed its agent story from building blocks to managed infrastructure. AgentCore, Continuum, and the coming-soon Context service all reduce how much glue code teams write to ship and govern agents.
02
AgentCore Harness is free — and that is the moat.There is no charge for the harness, the AgentCore CLI, or the coding skills; you pay only for the AWS resources consumed. Building a production-grade agent takes three API calls with no orchestration code.
03
Continuum's learn-to-enforce model is the real story.Continuum runs human-in-the-loop in learn mode by default, then lets orgs advance category-by-category to enforce mode for automated remediation. It validates vulnerabilities by building working exploits in a sandbox before proposing fixes.
04
Deterministic controls, not the model, hold the limits.AgentCore Payments enforces spend caps at the infrastructure layer; Bedrock Guardrails live in the gateway outside the agent's view; AgentCore Policy uses formal verification inside isolated microVMs. Governance does not depend on the model behaving.
05
Context is announced, not live — treat it as a signal.AWS Context (coming soon) auto-builds an identity-aware knowledge graph from databases, documents, and chat history. It makes AWS the fourth major vendor to ship a context layer, but it was not available at the summit.

01 — The ShiftFrom building blocks to managed infrastructure.

For two years the cloud-vendor agent pitch was a parts catalog: models here, vector stores there, an orchestration framework you wire up yourself, and governance bolted on afterward. The New York summit was AWS arguing that phase is over. Matt Wood, AWS’s Chief AI and Technology Officer, framed the obstacle plainly during the keynote, putting trust rather than raw capability at the center of why enterprises stall on agents.

That framing matters because it reorders the roadmap. If trust is the bottleneck, then the most valuable thing a cloud provider can ship is not a smarter model but a way to grant autonomy incrementally and prove the limits hold. Almost every announcement at the summit — the free harness, Continuum’s staged enforcement, deterministic spend caps, granular autonomy levels in Amazon Quick — is AWS operationalizing one answer to that single question.

"Trust is the single biggest barrier to adoption for artificial intelligence systems inside most organizations."— Matt Wood, Chief AI and Technology Officer, AWS (Summit NY keynote)

Wood paired that with an economic observation: while the cost of a token at the frontier keeps rising, the cost normalized for a fixed point of intelligence keeps falling year over year. Read together, the two ideas are the business case for the whole portfolio — intelligence is getting cheaper per unit, so the constraint shifts to whether organizations trust agents enough to actually deploy them at scale. That is what AWS spent the summit trying to fix. This positions AgentCore within the same broader pattern as an enterprise agent infrastructure reference architecture rather than a standalone tool.

02 — AgentCoreAgentCore: a free managed harness for production agents.

The headline builder announcement was the Amazon Bedrock AgentCore Managed Harness reaching general availability. The pitch is blunt: a production-grade agent in three API calls with zero orchestration code. Each harness session runs in its own microVM with filesystem and shell access, and persistent filesystem state lets an agent suspend mid-task and resume exactly where it stopped — the kind of durable execution most teams previously hand-built.

The commercial structure is the part worth dwelling on. There is no additional charge for the harness, the AgentCore CLI, or the coding skills; customers pay only for the underlying AWS resources consumed. The harness is available across 14 AWS regions via the CLI, with the Managed Harness in preview across four regions: US East (N. Virginia), US West (Oregon), Europe (Frankfurt), and Asia Pacific (Sydney).

Knowledge

Managed Knowledge Base

S3 · SharePoint · Confluence · Drive

Native connectors plus Smart Parsing for multi-format data prep and an Agentic Retriever for complex multi-step queries. The retrieval layer is now managed rather than assembled.

Bedrock AgentCore

Web search

Managed Web Search

zero data egress

A fully managed tool that lets agents retrieve current web knowledge without data leaving the customer's secured AWS environment — current-events grounding without a side channel.

Bedrock AgentCore

Frameworks

LangGraph · CrewAI · Strands

+ LlamaIndex

AgentCore integrates with the major open frameworks plus AWS's own Strands Agents, which gained a Strands Shell isolated execution environment and chaos-testing capabilities at the summit.

open framework support

Why the price is the strategy

Charging $0 for the harness while charging for the underlying compute is the same playbook AWS ran with free Kubernetes control planes on EKS. Once teams build on AgentCore primitives, AWS wins on runtime and compute, not on the tool layer. Amazon reports AgentCore task performance grew roughly 15× in the six months before the summit — a vendor-stated benchmark with no independent verification, so weigh it as a direction of travel, not a guarantee.

One forward-looking piece, AgentCore Payments, is in preview. Built with Coinbase and Stripe, it is designed to let agents pay for APIs, MCP servers, web content, and other agents via the x402 protocol, with the Coinbase x402 Bazaar MCP server exposing more than 10,000 payable endpoints. Because it is preview-only, treat it as a glimpse of agent commerce rather than a system actively transacting in production — though the design choice underneath it is telling, and it maps directly onto the emerging agent marketplace ecosystem.

03 — Trust LadderThe summit’s implicit trust escalation matrix.

No single AWS slide laid it out, but the announcements collectively define a trust ladder — a map of how each product grants more autonomy and what holds the limit. The table below makes it explicit. Note the recurring pattern in the right-hand columns: the control that enforces an autonomy limit is almost always deterministic infrastructure, not the model deciding to behave.

AWS Summit NY 2026 agent trust escalation matrix: default autonomy, escalation mechanism, human override, deterministic control, and availability status by product.
Product	Default autonomy	Deterministic control	Status
AgentCore Harness	Builder-defined per agent	Per-session microVM isolation; filesystem + shell sandboxed	GA (Managed Harness preview, 4 regions)
AgentCore Policy	Constrained to declared capabilities	Automated reasoning (formal verification) inside isolated microVMs	Bedrock AgentCore
AgentCore Payments	Spend up to a session limit	Session-level spend caps enforced at the infra layer, not by the model	Preview (4 regions)
AWS Continuum	Learn mode (human-in-the-loop)	Category-by-category opt-in to enforce mode; sandboxed exploit proof before fixes	Gated preview
Kiro for iOS	Chat mode; opt-in autonomy mode	Compute runs in AWS cloud; user delegates per task across three modes	Gated preview

The throughline is the original analysis worth taking away: AWS is not asking customers to trust a model. It is asking them to trust infrastructure that constrains the model — microVM isolation, formal verification, gateway-layer guardrails outside the agent’s visibility, and spend caps enforced below the reasoning layer. Bedrock Guardrails sit in the gateway specifically so that a prompt-injection attack against the agent cannot circumvent governance. That is a meaningfully different trust story than “the model was trained to refuse.”

04 — ContinuumContinuum: security at machine speed, with a learn mode governor.

AWS Continuum was announced as an AI-native security platform in gated preview. It operates across four continuous phases — Discovery, Prioritization, Validation, and Mitigation/Remediation — and the validation phase is its genuinely novel move. Rather than scanning and suggesting, Continuum constructs a working exploit in a sandboxed environment to prove a vulnerability is real before proposing a fix. Concrete reproducible proof, not a probabilistic severity score.

The governance design is the second differentiator. Continuum runs in learn mode by default — supervised, human-in-the-loop — and organizations advance category-by-category to enforce mode for automated remediation only as they build confidence. It is, in effect, the first enterprise security agent with a deterministic autonomy escalation path baked into the product, which is exactly why Continuum’s staged model maps so cleanly onto a staged framework for deploying agents into enterprise workflows.

Continuum's four continuous phases

Source: AWS Security Blog — Introducing AWS Continuum

DiscoveryContinuously surface vulnerabilities across the supply chain

Phase 1

PrioritizationRank by real-world exploitability, not raw CVE count

Phase 2

ValidationBuild a working exploit in a sandbox to prove the risk

Phase 3

Mitigation / RemediationPropose or (in enforce mode) apply the fix

Phase 4

Continuum is model-agnostic and was built using data and code samples drawn from AWS and Amazon. It integrates with Git platforms for pull-request code scanning and supports IDE access via Kiro Power, a Claude Code plugin, and MCP. A separate preview, Continuum Threat Modeling, uses the STRIDE framework to auto-generate threat models from design documents or application source code. Worth keeping straight: Continuum for code vulnerabilities is the GA-ready piece, while code scanning and threat modeling are separate previews — the platform as a whole is not generally available.

On why AWS moved when it did, Chet Kapoor, AWS’s VP of security services and observability, attributed the acceleration to Anthropic ’s Claude Mythos model, describing its ability to find and chain software vulnerabilities faster than any human team as a catalyst for the Continuum timeline. He called it a turning point that significantly advanced AWS’s plans. Two cautions: that is a vendor-stated motivation rather than an independently audited claim, and Mythos is offered to select organizations — not a publicly available model builders can call directly.

Why machine-speed defense, now

AWS framed Continuum against a rising-attack-volume backdrop — security research cited by AWS partners points to a sharp year-over-year climb in weekly attack attempts per enterprise, and a meaningful share of enterprise applications shipping with at least one known critical vulnerability. Those specific figures trace to partner commentary rather than a verified primary source, so we treat them as directional. The uncontested point stands on its own: the volume of machine-speed attacks now outpaces human-speed defense, which is the gap Continuum targets.

05 — AWS ContextAWS Context: the data lake for AI agents.

AWS Context was announced as a coming-soon service that automatically builds a knowledge graph from an organization’s existing data. It reads databases, documents, Slack history, and email, infers how everything connects, and makes that map available to every agent in the organization at runtime. Mai-Lan Tomsen Bukovec, an AWS Technology Vice President, framed it as the data lake for AI agents — the shared substrate agents query rather than each rediscovering the schema on their own.

The architecture has two features that separate it from a generic knowledge base. First, governance is identity-aware: each query inherits the calling user’s IAM and Lake Formation permissions, so an agent acting for one user cannot see what that user cannot. Second, metadata is stored in Apache Iceberg format in Amazon S3 Tables, queryable through Amazon Athena and Amazon Redshift — open table format, not a proprietary index. AWS says Context is built on the same technology that powers Amazon Quick, which processes millions of requests daily in production.

Governance

Permission-inheriting queries

IAM-aware

Every Context query inherits the calling user's IAM and Lake Formation permissions. Governance is enforced per-identity at query time, not bolted on as a separate access layer.

Lake Formation

Storage

Open table format on S3 Tables

Iceberg

Metadata lands in Apache Iceberg format in Amazon S3 Tables, queryable via Athena and Redshift. The graph is not locked behind a proprietary store.

Athena · Redshift

Differentiator

Learns from agent usage

Auto-learns

AWS states Context observes which sources produce correct results and which join paths agents rely on, refining the graph without manual re-curation. This is a vendor-stated claim, not yet independently tested.

coming soon

Status check

AWS Context is coming soon, not generally available. It was announced at the summit but had not launched — so plan around it as a roadmap signal, not a service you can wire into production today. The auto-learning-from-usage behavior, the most interesting claim, is vendor-stated and awaits independent testing.

06 — Category ViewThe fourth entrant in an emerging category.

Most coverage treats AWS Context in isolation, which misses the strategic shape of it. AWS is the fourth major cloud or AI vendor to ship a context-layer product, after Microsoft’s GraphRAG in Azure AI Search, Google’s Vertex Grounding, and Salesforce’s Data Cloud semantic layer. The table below frames the four head-to-head. The distinctive AWS claim sits in the auto-learning column — and it is exactly the claim still awaiting independent verification.

Context-layer products compared: AWS Context, Microsoft GraphRAG, Google Vertex Grounding, and Salesforce Data Cloud, across auto-learning, governance model, storage, and availability.
Product	Auto-learns from agent usage?	Governance model	Availability
AWS Context	Yes (vendor-stated; unverified)	IAM + Lake Formation, identity-aware per query	Coming soon
Microsoft GraphRAG	No — graph built from an indexing pipeline	Azure AI Search / Entra ID access controls	Available (Azure AI Search)
Google Vertex Grounding	No — grounding to configured sources	Google Cloud IAM	Available (Vertex AI)
Salesforce Data Cloud	No — modeled semantic layer	Salesforce platform permissions	Available (Data Cloud)

The architectural distinction AWS is selling is real but unproven: the other three build their graph from a pipeline you configure or a model you maintain, while Context claims to learn graph topology from how agents actually query. If that holds up, it is a genuinely different operating model — less ETL, more observation. If it does not, Context is a well-governed entrant in a category that already has three shipping products. The honest read today, given that Context is coming-soon and the differentiator is vendor-stated, is to evaluate it on availability rather than on the marketing claim.

07 — Kiro for iOSAgents that keep running after the screen goes dark.

Kiro for iOS, announced in gated preview, is a true native app rather than a web wrapper, with three interaction modes: chat, spec (continuing a specification workflow), and autonomy (delegating a full task). The signals story is the architecture underneath: compute runs in the AWS cloud, so a session keeps running after the phone screen goes dark. It reads less like a code editor on a phone and more like a remote control for long-running agents. For the full desktop story, our guide to Kiro, AWS’s agentic IDE covers the spec-driven workflow the iOS companion extends.

That untethering reframes what an agentic IDE is. If a task can run for hours unattended while you walk away from the desk, the IDE is infrastructure, not an editor — and supervision becomes the primary human job. AWS leaned into the case-study angle here: it cited Southwest Airlines deploying Kiro to more than 2,700 developers as part of a transition to a cloud-based, AI-enabled architecture, a customer figure that is vendor-stated.

A naming caution that trips up most coverage: do not conflate Amazon Quick with the developer tooling. Amazon Quick is the enterprise productivity assistant — it added 16 new built-in integrations (including Adobe, Moody’s, and Snowflake), a redesigned activity feed, and no-code autonomous agents. Kiro is the developer IDE. They are separate products with separate autonomy models, and Context is built on the same technology that powers Quick, not Kiro.

08 — ImplicationsWhat it means for agencies and engineering teams.

Looking forward, the summit reads as AWS standardizing the agent stack the way it standardized container orchestration: own the managed layer, give the tooling away, and monetize the runtime. For teams, that turns a build-versus-buy decision that was open six months ago into a more nuanced one. The decision tree below is how we would scope it today.

Greenfield agents

Starting a new agent project

The free AgentCore Harness plus managed knowledge base and web search removes most of the scaffolding teams used to hand-build. Prototype on it before committing to a custom orchestration layer — the lock-in is real, but so is the time saved.

Start on AgentCore

Security automation

Staged remediation

Continuum's learn-to-enforce model is the right shape for security teams nervous about autonomous fixes. Run learn mode first, validate the sandbox-exploit proofs, then graduate categories to enforce. It is gated preview, so request access early.

Pilot Continuum in learn mode

Knowledge layer

Org-wide agent context

AWS Context is coming soon, not shippable. If you need a context layer now, the available options are Microsoft GraphRAG, Google Vertex Grounding, or Salesforce Data Cloud. Track Context's GA date; do not architect around it yet.

Wait on Context; use a shipping layer

Vendor strategy

Avoiding single-cloud lock-in

The free-harness moat is designed to anchor you to AWS runtime. Keep agent logic portable where you can — open frameworks like LangGraph and CrewAI run on AgentCore but are not AWS-only — so the tooling convenience does not silently become a migration tax.

Build on portable frameworks

For most agencies and engineering teams, the practical move is to treat the GA pieces and the preview pieces very differently. Build pilots on the free AgentCore Harness today; request gated access to Continuum and evaluate its learn mode against a real codebase; and hold AWS Context as a roadmap item, not a dependency. If you are weighing how this fits a broader agentic strategy, that comparative scoping is exactly where our AI and digital transformation engagements start, and where our CRM automation work grounds agent pilots in a workflow that pays for itself.

09 — ConclusionThe summit where assembly-yourself ended.

The shape of agentic infrastructure, June 2026

AWS stopped selling parts and started selling a trust model.

AWS Summit New York 2026 was less a feature dump than a thesis statement. The free AgentCore Harness, Continuum’s staged enforcement, deterministic spend caps, gateway-layer guardrails, and a coming-soon identity-aware Context graph all answer the same question: how do you let agents do more without handing them a blank check?

The honest framing keeps the status distinctions straight. AgentCore Harness is generally available and free; Continuum, Kiro for iOS, and Context are preview, gated preview, or coming soon. AgentCore Payments is preview, not a production payment rail. The 15× performance and auto-learning claims are vendor-stated and await independent checks. Treat the GA pieces as buildable and the rest as a credible map of where AWS is heading.

The broader signal is the one that matters most: the competitive edge in agents is moving from the model to the infrastructure that governs it. When trust, not capability, is the bottleneck, the vendor that ships the most convincing deterministic guardrails wins the deployment — and the runtime bill that comes with it. That is the move AWS made in New York, and it is the move every other cloud will now have to answer.

AWS Summit NY 2026: AgentCore, Continuum and Context