eCommerceNew Release12 min readPublished June 9, 2026

AWS packages Amazon's shopping AI · ~60 days to deploy · catalog readiness is the gate

Amazon Will Sell You Its Shopping Agent: Should You Buy?

On May 27, 2026, AWS started selling the engine behind Amazon's own shopping assistant to outside retailers — a packaged agent AWS says can ship in roughly 60 days. Kate Spade's Gift Concierge, built on Anthropic's Haiku 4.5, is the first production deployment. The real decision isn't the agent. It's whether your catalog is machine-readable enough to use it.

DA
Digital Applied Team
Senior strategists · Published Jun 9, 2026
PublishedJun 9, 2026
Read time12 min
Sources11 cited
ASA deploy target
~60days
AWS-stated vs build from scratch
First deployment model
Haiku 4.5
Kate Spade Gift Concierge
Retailers not catalog-ready
73%
still prepping or not started
Mirakl survey
Conversational conversion lift
3.5×
Amazon-stated, on Amazon.com
vendor figure

The AWS Agentic Shopping Assistant takes the conversational AI that powers Amazon's own storefront and packages it — architecture, starter code, and AWS professional services — for any retailer to deploy. AWS announced it on May 27, 2026, framing the build as "weeks rather than years." For eCommerce operators, that promise is real, and so is the catch underneath it.

The headline coverage fixated on two vendor figures: a 3.5× conversion lift for conversational shopping and a roughly 60-day deployment window. Both are Amazon's numbers, and both deserve scrutiny. The more useful question for a merchant is the one almost nobody asked — what has to be true about your product data before an agent can deliver any of that lift on your site.

This guide treats merchant readiness as the actual decision. We break down what shipped, the Kate Spade reference build on Anthropic's Haiku 4.5, the difference between the packaged ASA and the AgentCore platform underneath it, the unit economics that make consumer-scale agents affordable, and the strategic cost of running your conversational commerce on the infrastructure of the company you compete with. Two proprietary tools — a build-vs-buy matrix by retailer archetype and a catalog readiness scoring card — turn the abstract debate into something you can run against your own feed.

Key takeaways
  1. 01
    AWS packaged Amazon's shopping AI for outside retailers.Announced May 27, 2026, the Agentic Shopping Assistant bundles the architecture, starter code, and AWS guidance drawn from Alexa for Shopping (formerly Rufus). AWS targets a roughly 60-day deployment versus building from scratch.
  2. 02
    Kate Spade is the first production deployment.Tapestry built Kate Spade's AI Gift Concierge on Amazon Bedrock AgentCore using Anthropic's Haiku 4.5, going live April 13, 2026 after roughly 2.5 months of testing. It is the reference customer cited in the launch.
  3. 03
    Your catalog is the real gate, not the agent.Per a Mirakl partner survey, 40% of eCommerce businesses were still standardizing product pages for agentic AI in early 2026 and 33% had not started. An agent can only reason as well as the structured data you feed it.
  4. 04
    The model choice signals the unit economics.Amazon used Anthropic's least expensive flagship — Haiku 4.5 — not Sonnet or Opus. A multi-turn shopping conversation costs fractions of a cent in inference, which is why consumer-scale agents are economically viable.
  5. 05
    Adopting it means depending on a competitor.Every retailer who runs ASA on AWS pays the #1 US retailer for the infrastructure behind their conversational storefront. Weigh the time savings against Amazon's data signals and platform dependency before committing.

01What ShippedAmazon is now renting out its shopping brain.

On May 27, 2026, AWS announced the Agentic Shopping Assistant (ASA), a packaged offering that gives retailers the architecture, starter code, and expert guidance drawn from Amazon's own shopping assistant. Two weeks earlier, on May 13, Amazon had renamed that assistant from Rufus to Alexa for Shopping — the same underlying technology AWS is now licensing to outsiders. According to Amazon, Rufus reached more than 300 million customers during its beta.

AWS's pitch is speed. Rather than assembling a conversational commerce stack from raw components, a retailer adopts a blueprint that AWS says compresses the work to roughly 60 days — "weeks rather than years." That timeline is vendor-stated, and it assumes the hardest precondition is already met: a product catalog structured enough for a language model to reason over. More on that gate in section 04.

AWS-built
AWS does the build
Fastest · most managed

The AWS Generative AI Innovation Center delivers the agent end-to-end, including safety guardrails, brand-voice tuning, and analytics monitoring. The closest path to the ~60-day target.

Engagement model 1
Advisory-led
AWS guides, you build
Shared delivery

AWS provides architecture, starter code, and expert guidance while your team owns implementation. More control over customization at the cost of more internal engineering.

Engagement model 2
Partner-supported
Systems integrator delivers
Outsourced build

A certified AWS systems-integrator partner stands up the deployment. Useful when you lack in-house agent engineering but want a single accountable vendor.

Engagement model 3
Launch snapshot
The Agentic Shopping Assistant was announced May 27, 2026, built on three AWS services — Amazon Bedrock (foundation-model access), Amazon Bedrock AgentCore (agent orchestration), and Amazon OpenSearch (product retrieval). Pricing for the packaged ASA is custom "private offer" only; there is no public list price, and retailers must contact an AWS account representative. Only the underlying AgentCore infrastructure has published rates.

Treat the launch claims carefully. Amazon states its shopping assistant drove on the order of $12 billion in incremental sales and that conversational sessions convert at 3.5× the rate of traditional keyword search — but both are Amazon's own figures, observed on Amazon.com, with no independent audit found. They are a reason to pay attention, not a forecast for your storefront. A separate, independent McKinsey figure puts AI-generated product recommendations at 4.4× higher conversion than traditional search; it is a different measurement and should not be conflated with Amazon's 3.5× claim.

02First CustomerKate Spade's Gift Concierge, the reference build.

The launch's anchor case is the Kate Spade AI Gift Concierge, built by Tapestry — Kate Spade's parent company — on Amazon Bedrock AgentCore. It went live on April 13, 2026, making it the first production deployment of this stack outside Amazon itself. (Kate Spade New York is the brand; Tapestry, Inc. is the parent, and Digital Commerce 360 ranks Tapestry #40 in its Top 2000 — with Amazon at #1.)

The agent was purpose-built around a specific pain point: Amazon cites research that 53% of shoppers report stress during gift purchases. The Gift Concierge turns that into a guided conversation — occasion, recipient, budget, style — rather than a keyword search. Under the hood it runs on Anthropic's Claude Haiku 4.5 (API ID claude-haiku-4-5-20251001) via Amazon Bedrock, and Tapestry completed roughly 2.5 months of testing before the customer-facing launch.

"AWS brought the recipe, but together we built the customization our consumers needed."— Yang Lu, Chief Information and Digital Officer, Tapestry

That phrasing — "AWS brought the recipe" — is the honest shape of the offering. ASA is a blueprint plus services, not a turnkey product you flip on. The customization Tapestry references (brand voice, the gift-finding flow, the curation logic) is where the real work lives, and it is work only a retailer with a clean, queryable catalog can do well. The roughly 2.5-month Kate Spade testing window is also a useful reality check against the headline 60-day figure: a recognized brand with deep resources still spent the better part of a quarter getting it right.

03The LayersASA is the package; AgentCore is the platform.

The single most common confusion in launch coverage is treating the ASA and Amazon Bedrock AgentCore as the same thing. They are layered. AgentCore is the platform — a modular agent runtime. ASA is the packaged offeringbuilt on top of it, with Amazon's shopping-specific scaffolding and AWS professional services wrapped around. You can adopt AgentCore without buying ASA; you cannot buy ASA without running on AgentCore.

AgentCore ships as a set of modular components — Runtime, Memory, Gateway, Identity, Code Interpreter, Browser, Observability, Payments, Evaluations, and Policy among them — and it is deliberately framework-agnostic. It supports LangChain, LangGraph, LlamaIndex, CrewAI, the OpenAI Agents SDK, the Claude Agent SDK, Google ADK, and Strands Agents, and it works with models from Anthropic, OpenAI, Google, Amazon (Nova), Meta (Llama), and Mistral. That breadth matters for the build-vs-buy decision: the same runtime can host a fully self-built agent, not just Amazon's packaged one.

Foundation models
Amazon Bedrock
6+providers

Model access layer. Anthropic Claude, OpenAI, Google Gemini, Amazon Nova, Meta Llama, and Mistral are all available, so model choice stays open even inside the Amazon stack.

Layer 1
Orchestration
Bedrock AgentCore
11modules

The agent runtime: Runtime, Memory, Gateway, Identity, Code Interpreter, Browser, Observability, Payments, Evaluations (13 built-in evaluators), and Policy. Framework-agnostic.

Layer 2
Retrieval
Amazon OpenSearch
1:1

Product retrieval. The agent's answers are only as good as what OpenSearch can surface — which loops directly back to how structured your catalog is.

Layer 3
Governance worth noting
AgentCore Policy uses Cedar, AWS's open-source policy language, for real-time, deterministic access control that runs outside the agent's own code. It can block an unauthorized action — say, a refund over a set threshold — in milliseconds, regardless of what the model decides. That separation of policy from model behavior is the kind of guardrail any production commerce agent needs.

04The Real GateThe agent is easy. Your catalog is the project.

Here is the part the launch coverage skipped. A conversational agent reasons over your product data, so its ceiling is set by how structured and complete that data is. If your feed is full of thin titles, missing specs, stale availability, and no fit or compatibility fields, no amount of model quality rescues it — the agent simply has nothing reliable to reason about.

The readiness gap is wide. Per a Mirakl partner survey, 40% of eCommerce businesses were still standardizing product pages for agentic AI in early 2026, and 33% had not started at all — roughly three in four retailers not yet ready to capture the lift an agent promises. The investment decision a merchant actually faces is not "buy the agent or not." It is "fund the catalog remediation that lets any agent work." That work — building a machine-readable product catalog — is the real line item.

Why the data structure matters

Agents discover and act on products through structured fields and protocols, not marketing prose. If you want a deeper read on what an agent-readable catalog actually requires, see our guide to the Agent Commerce Protocol (ACP) and what agent-readable catalogs require. The same conversational layer is what surfaces tailored results, which is where AI-powered personalization and product recommendations meet catalog quality.

Independent data suggests the upside is real for those who close the gap. Adobe reported that AI-referred visitors to US retail sites converted 42% better in March 2026 than in March 2025, reversing a prior deficit, and that those visitors spent 48% longer on site. A widely circulated industry figure holds that catalogs near full required-field completeness see several times higher visibility in AI recommendation surfaces than sparse ones; we treat that as directional, because its primary research source is not a recognized independent firm. The direction of travel is consistent even where the exact multiple is not verifiable: complete, structured data wins in agent-mediated discovery.

"If product research, discovery, understanding, and purchase happen on OpenAI's platform rather than a retailer's website, retailers risk becoming like fulfillment companies."— Kartik Hosanagar, Wharton School, University of Pennsylvania

05Unit EconomicsWhy Haiku 4.5 is the tell.

Amazon's model choice for the Kate Spade reference build is a quiet but important signal. It used Anthropic's least expensive flagship — Haiku 4.5 — not Sonnet or Opus. That is a deliberate decision about unit economics. At consumer scale, a shopping conversation might run five to ten turns at a couple thousand tokens each. On a model priced like Haiku 4.5, the model-inference cost of an entire conversation lands in fractions of a cent. That is the reason a 300-million-customer assistant can exist without the inference bill swallowing the margin.

The infrastructure underneath it follows the same logic. AgentCore uses consumption-based pricing with no upfront commitment. Per AWS's published rates, Runtime is billed at $0.0895 per vCPU-hour plus $0.00945 per GB-hour — and critically, only while the agent is active, so the idle time spent waiting on a model response is not billed. Memory runs at $0.25 per 1,000 short-term events and $0.75 per 1,000 long-term records per month. None of those line items is what determines whether the project pays off.

Where the money actually goes · relative effort & cost

Sources: AWS AgentCore pricing; Anthropic Haiku 4.5 pricing
AgentCore Runtime (CPU)$0.0895 / vCPU-hour · billed only while active
low
AgentCore Runtime (memory)$0.00945 / GB-hour
low
Long-term memory$0.75 / 1,000 records / month
low
Model inference (Haiku 4.5)Fractions of a cent per 5–10 turn conversation
low
Catalog remediationThe dominant, one-time line item for most retailers
high

The chart is the whole argument. The running costs of a conversational commerce agent on this stack are genuinely small. The expensive part — for the roughly three in four retailers not yet ready — is the catalog work that has to happen first. Any honest cost model for "should I buy this agent" puts data remediation, not inference, at the top of the ledger. One independent estimate pegged a moderate-traffic support agent (10,000 conversations a month, five turns each) at roughly $50–$200/month in AgentCore infrastructure plus $200–$800 in model inference; that is a third-party figure, not AWS-stated, but it underscores how modest the operating side is relative to the data project.

06The Trade-offYou'd be paying your competitor to compete with it.

Strip away the technology and a strategic tension is left in plain view. Amazon is the #1 US retailer. Every retailer that adopts ASA on AWS pays Amazon for the infrastructure behind their own conversational storefront — and runs that storefront on a platform operated by the company they compete with for the same customers. The Next Web put the paradox crisply: the same company that competes with Kate Spade for handbag sales is now providing the AI engine behind Kate Spade's shopping experience.

The ledger is concrete. On one side: the time savings AWS advertises — weeks rather than years — and a managed, battle-tested stack. On the other: recurring AWS infrastructure revenue flowing to a competitor, and the harder-to-price question of what operational and behavioral signals a retailer's agents generate on Amazon's platform over time. None of that makes ASA the wrong choice. It makes it a choice that should be made with the trade-off named, not ignored.

"Whoever controls the agents now has the power. Retailers will increasingly interact with end customers less and more with their AI agent representatives."— Kartik Hosanagar, Wharton School, University of Pennsylvania

Hosanagar's warning is the forward-looking version of the paradox. As more discovery and purchase shift into agent-mediated channels, the layer that controls the agent accrues the leverage in the relationship — and the retailer that outsources that layer wholesale risks ceding the customer relationship itself. The defensive move is not to refuse agents; it is to keep ownership of the asset agents depend on — your structured catalog and your brand voice — even when the runtime is rented. That is the difference between using Amazon's infrastructure and becoming a line in Amazon's.

07Decision MatrixBuild vs. buy, mapped by retailer archetype.

Most coverage frames this as a binary: buy the Amazon agent or don't. The honest version has three tiers and depends on who you are. Tier A is buying ASA on AWS — fastest, most managed, most Amazon-dependent. Tier B is self-building on AgentCore with your own Bedrock models — more control, still on AWS. Tier C is self-building on open frameworks (LangGraph and the like) — maximum control and the least platform lock-in, at the cost of the most engineering. The matrix below maps the sensible default for each archetype. For a deeper comparison of how AWS compares to building on open-source agent frameworks, see our Q2 2026 platform breakdown.

AWS Agentic Shopping Assistant build-vs-buy decision matrix by retailer archetype, comparing deployment speed, catalog readiness requirement, Amazon data-exposure risk, and recommended path.
Retailer archetypeDeploy speed needCatalog readiness req.Amazon data exposure (1–5)Recommended path
Enterprise retailer (>$1B GMV)ModerateHigh — large SKU base to remediate3Build on AgentCore (Tier B)
Mid-market ($100M–$1B)HighMedium3Buy ASA on AWS (Tier A)
Emerging DTC (<$100M)HighLow — small, controllable catalog2Self-build, open frameworks (Tier C)
Marketplace sellerLowMedium — depends on host feed4Improve catalog first; defer agent
Amazon-competing categoryModerateHigh5Self-build, open frameworks (Tier C)

The pattern: the more directly you compete with Amazon and the more sensitive your demand signals, the harder you should lean toward self-building off Amazon's proprietary scaffolding. The more your constraint is speed-to-market with a manageable competitive exposure, the more ASA earns its keep. And for sellers whose catalog lives on someone else's feed, the agent is premature — fix the data first.

08Readiness AuditScore your own feed against the agent.

"Catalog readiness" is too abstract to act on, so here it is as a scoreable audit. Run each attribute category against your own product feed before you scope any conversational agent. The columns tell you the target completeness, what the agent does when the field is missing, the remediation effort, and how much that category moves conversational discovery.

Catalog readiness scoring card for conversational commerce, listing seven product-data attribute categories with target completeness, agent behavior when missing, remediation effort, and impact on conversational discovery.
Attribute categoryTarget completenessWhen missingEffortDiscovery impact
Product title & descriptionNear-completeAgent can't match intent to productMediumHigh
Technical specificationsNear-completeAgent guesses or omits key factsHighHigh
Compatibility / fit dataHighWrong recommendations, returns riseHighHigh
Availability / delivery accuracyReal-timeAgent promises out-of-stock itemsMediumHigh
Review quality / recencyMediumWeaker social-proof reasoningLowMedium
Price history / promotionsReal-timeStale pricing erodes trustMediumMedium
Visual asset alt-textHighAgent can't reason over imageryLowMedium
Score mostly High completeness
Catalog-ready

Specs, fit, and availability are clean and structured. You can scope an agent now and realistically chase the conversational conversion lift. Decide Tier A vs B by competitive exposure.

Scope the agent
Gaps in specs or fit
Remediate first

The fields with High discovery impact are exactly where you're thin. Fund the catalog work before the agent — that's the line item that determines whether any agent pays off.

Fix data, then revisit
Feed lives on a host platform
Limited control

Marketplace sellers depend on a host feed's structure. Tighten what you can influence and treat a self-owned conversational layer as a later-stage move once your own data is yours.

Defer; influence host feed
Direct Amazon competitor
Mind the dependency

If your category competes head-on with Amazon, weight the data-exposure risk heavily. A self-built agent on open frameworks keeps your demand signals off a competitor's proprietary stack.

Self-build, open frameworks

09ConclusionThe agent is the easy yes. The catalog is the hard one.

Merchant readiness, June 2026

Don't buy the agent. Decide whether your data deserves one yet.

The AWS Agentic Shopping Assistant is a genuinely significant release: Amazon packaging the technology behind a 300-million-customer assistant, with a real reference customer in Kate Spade and a framework-agnostic platform underneath. The roughly 60-day timeline and the 3.5× conversion claim are worth taking seriously, though both are Amazon's figures and should be tested against your own numbers, not adopted as forecasts.

The decision a merchant actually faces is upstream of the agent. Three in four retailers, per the Mirakl survey, are not yet catalog-ready — and an agent can only ever be as good as the structured data it reasons over. The running costs are small; the Haiku 4.5 choice proves the inference math works at scale. The expensive, decisive work is the catalog remediation that lets any agent — Amazon's or your own — perform. Fund that first, and the build-vs-buy question gets easier on its own.

And do not skip the strategic line in the ledger. Running your conversational storefront on the infrastructure of the company you compete with is a defensible choice — when you make it deliberately, keep ownership of your structured catalog and brand voice, and size the data-exposure risk against your category. The retailers who win the agentic-commerce shift will be the ones who treated their product data as the asset and the agent as the interface, not the other way around.

Make your catalog agent-ready

Your product data is the asset agents depend on — make it genuinely ready.

We help eCommerce teams audit catalog readiness, structure product data for agent-mediated discovery, and choose the right build-vs-buy path for conversational commerce — delivered in weeks, not quarters.

Free consultationSenior strategistsTailored roadmap
What we work on

Agentic commerce engagements

  • Catalog readiness audit against the scoring card
  • Product-data structuring for agent discovery
  • Build-vs-buy: ASA, AgentCore, or open frameworks
  • Conversational commerce strategy & brand-voice tuning
  • Cost & dependency modeling before you commit
FAQ · AWS shopping agent

The questions merchants ask first.

Announced on May 27, 2026, the AWS Agentic Shopping Assistant is a packaged offering that lets outside retailers deploy the conversational AI technology behind Amazon's own storefront. AWS provides the architecture, starter code, and expert guidance drawn from Amazon's shopping assistant — renamed from Rufus to Alexa for Shopping on May 13, 2026. It is built on three AWS services: Amazon Bedrock for foundation-model access, Amazon Bedrock AgentCore for agent orchestration, and Amazon OpenSearch for product retrieval. AWS targets a roughly 60-day deployment versus building from scratch, a timeline it frames as 'weeks rather than years.' That target is vendor-stated and assumes your product catalog is already structured enough for a model to reason over.