AI crawler and bot traffic statistics crossed a structural threshold in 2026: on June 3, Cloudflare CEO Matthew Prince shared Radar data showing that automated requests now make up 57.5% of HTML traffic to web content, against 42.5% from humans — the first time machines have held the majority in internet history. This page is the sourced reference behind that headline and the economics underneath it.

What matters for publishers and SEOs is not the single 57.5% figure; it is the asymmetry behind it. Most AI crawling exists to harvest training data, and training crawlers return almost no referral traffic. ClaudeBot, by Cloudflare’s own measurement, crawled tens of thousands of pages for every visit it sent back. That crawl-without-credit gap is the through-line of every number below.

This is the data page — a citable reference with every number traced to the publisher’s own page and a date. For the mechanics of which crawler to allow or block and how, read the companion AI crawler access-control decision matrix. For the rights and news angle on where this training data comes from, see our breakdown of the Common Crawl training-data showdown. Definitions and methodologies differ by source — we flag the measurement basket on every figure that needs one.

Key takeaways

01
Bots passed humans on the open web in 2026.Cloudflare Radar put automated requests at 57.5% of HTML web traffic versus 42.5% human as of June 3, 2026 — the first machine majority. The figure covers HTML requests, not video, email, or gaming.
02
AI crawlers are roughly a fifth of verified bot traffic.Cloudflare put AI crawlers at 20.3% of verified bot traffic in May 2026, with AI-search bots adding 6.5% — about 26.7% of verified bot activity now AI-related.
03
Training, not search, drives most AI crawling.Cloudflare attributed 51.8% of AI crawler requests to training in May 2026 and only 9.3% to search. Training crawlers generate little to no referral traffic back to publishers.
04
The crawl-to-refer gap is enormous and very uneven.Per SEOmator's read of Cloudflare data, ClaudeBot crawled 23,951 pages per referral in Q1 2026 while Perplexity sat near 111:1 and Google's traditional search ratio was about 4.9:1.
05
AI referrals are small but growing and high-quality.Adobe reported AI-referred traffic to US retailers grew 393% YoY in Q1 2026, with those visitors converting and spending more on-site than non-AI traffic — a retail-specific finding, not a web-wide one.

01 — The MilestoneThe web went machine-majority in June 2026.

The single most-cited number of 2026 came from a June 3 post by Cloudflare co-founder and CEO Matthew Prince, sharing Cloudflare Radar data: bots now generate 57.5% of HTML web traffic, with humans at 42.5%. Prince had forecast this crossover at SXSW but expected it roughly a year later — it arrived ahead of schedule. The measurement covers HTML HTTP requests across Cloudflare’s network and explicitly excludes video streaming, email, and gaming traffic, so it is a measure of crawlable web content, not all internet packets.

A second, independently-reported figure tells a compatible story with a different ruler. Imperva’s Bad Bot Report 2026 put automated traffic at 53% of all web traffic in 2025, up from 51% in 2024. The two numbers measure different baskets — Cloudflare counts HTML requests; Imperva counts all web traffic including app and API calls — so they should not be averaged or treated as the same metric. What they share is direction: across independent vendors, machines now move more of the web than people do.

"Welp, that happened faster than I predicted."— Matthew Prince, co-founder and CEO, Cloudflare

Source & method

The 57.5% figure is Cloudflare Radar data shared by Matthew Prince on June 3, 2026, covering HTML HTTP requests only. Imperva’s independently-published Bad Bot Report 2026 (April 29, 2026) reports a separate 53%-of-all-web-traffic figure for 2025. Different baskets, same direction — keep them distinct when citing.

Not all bots are AI bots. Within verified bot traffic, Cloudflare put AI crawlers at 20.3% in May 2026, with AI-search bots adding a further 6.5% — making AI-related activity about 26.7% of verified bot traffic. The rest is the long-standing population of search-engine indexers, uptime monitors, feed fetchers, and the like. Among the named crawlers, Googlebot remained the single largest at 27.26% of AI-adjacent bot HTTP requests in May 2026, ahead of GPTBot and ClaudeBot.

The OpenAI-versus-Anthropic race is closer than headlines suggest and it flips month to month. In May 2026 Cloudflare put GPTBot at 11.48% of AI bot HTTP requests and ClaudeBot at 9.73% — a reversal from April 2026, when ClaudeBot led 11.69% to 9.84%. Reading any single month as a durable ranking is a mistake; the volumes are volatile. Year over year the trend is clearer: per Cloudflare’s August 2025 analysis, GPTBot’s share of verified bot traffic rose from 4.7% in July 2024 to 11.7% in July 2025, and ClaudeBot grew from 6% to nearly 10% over the same window.

Googlebot share

Of AI bot HTTP requests

27.26%

Googlebot remained the largest AI-adjacent crawler in May 2026, though Cloudflare noted it shed roughly 3 points from April. In 2025 Googlebot reached 11.6% of unique web pages — more than 3× GPTBot's 3.6% reach.

Cloudflare · May 2026

GPTBot share

Of AI bot HTTP requests

11.48%

OpenAI's training crawler. Its share of verified bot traffic rose from 4.7% (Jul 2024) to 11.7% (Jul 2025) per Cloudflare, with crawler activity up roughly 305% over that year.

Cloudflare · May 2026

ClaudeBot share

Of AI bot HTTP requests

9.73%

Anthropic's crawler. It led GPTBot in April 2026 (11.69% vs 9.84%) before the order reversed in May. Single-month rankings are volatile — read the trend, not the snapshot.

Cloudflare · May 2026

03 — Purpose SplitMost AI crawling is training, not search.

The most counterintuitive finding in the data refutes a common assumption — that AI bots mostly power AI search and therefore mostly return citations and clicks. They do not. Cloudflare attributed 51.8% of AI crawler requests to training in May 2026, with mixed-purpose (training plus retrieval) adding another 35.7%, and search-only purpose accounting for just 9.3% — though that search slice is rising, up from 7.5% in April 2026. In Cloudflare’s full-year 2025 data, training drove nearly 80% of AI bot activity, up from about 72% the year before.

This is the analytical crux of the whole subject. When more than half of AI crawling exists to build training corpora rather than to answer a live user query, the majority of that activity has no mechanism to send a visitor back to the source. Your content is consumed to improve a model that may later answer questions without ever linking to you. That is a different economic relationship than the search-engine bargain SEOs have operated under for two decades, and it is why the zero-click dynamic matters more, not less, in the AI era. We unpack that shift in our analysis of the 2026 zero-click search landscape.

AI crawler requests by purpose · May 2026

Source: Cloudflare Radar, via TechTimes (May 2026 share data)

TrainingBuilds model corpora · no referral mechanism

51.8%

Mixed (train + retrieve)Dual-purpose crawling

35.7%

SearchLive-query retrieval · up from 7.5% in April

9.3%

The structural read

Search-purpose crawling — the kind that can return a citation — was under 10% of AI crawler requests in May 2026. The other 90% is extraction: content taken to train or augment a model with little or no referral in return. That imbalance, not the raw bot volume, is what reshapes the publisher–crawler bargain.

04 — Value ExchangeThe crawl-to-refer scorecard.

The crawl-to-refer ratio — pages crawled per referral visit sent back — is the single clearest measure of how fair the exchange is, and the spread across platforms is staggering. The figures below come from SEOmator’s March 2026 GEO Data Report reading Cloudflare Radar data, and they carry a hard caveat: ratios are tied to a specific time window and must never be averaged across windows. ClaudeBot, for example, measured 23,951:1 over January–March 2026 and improved to 11,122:1 for the single week of May 25 to June 1, 2026 — both real, neither universal.

One measurement limitation cuts in publishers’ favor and is worth stating plainly: Cloudflare notes that native-app traffic — the Claude and ChatGPT mobile and desktop apps — often arrives without a Referer header, which can make these ratios overstate the imbalance. The direction is not in doubt, but the exact multiples should be read as order-of-magnitude, not precise.

Crawler / Platform	Crawl-to-refer ratio	Window	Returns referral traffic?
AI training & answer crawlers
ClaudeBot (Anthropic)	23,951:1 → 11,122:1	Q1 2026 → wk May 25–Jun 1	Minimal — worst major ratio
GPTBot (OpenAI)	1,276:1	Jan–Mar 2026	Limited
PerplexityBot	111:1	Q1 2026	Most referral-efficient AI crawler
Microsoft Copilot (Bing)	33:1	Mar 2026	Yes — search-backed
Traditional search (benchmark)
Google	4.9:1	Mar 2026	Yes — crawls translate to clicks
DuckDuckGo	1.5:1	Mar 2026	Yes — near-parity

Sources: SEOmator GEO Data Report (Mar 17, 2026) reading Cloudflare Radar; Cloudflare AI search crawl-to-refer data (late-May 2026 weekly). Ratios are window-specific and should not be averaged across periods. Native-app traffic without a Referer header may overstate the imbalance.

The pattern is unmistakable: platforms that run a consumer search product — Google, DuckDuckGo, and to a lesser degree Bing-backed Copilot — return traffic in roughly the volume they take, because a crawl exists to power a result the user clicks. Pure training and answer crawlers do not, because the model answers in place. Anthropic sits at the extreme end of that spectrum, while Perplexity, which surfaces sources inline, is one to two orders of magnitude more referral-efficient than the worst training crawlers. For the log-level evidence behind how these crawlers actually move through a site, see our 30-day site-log study of AI crawler behavior.

05 — DefenseHow publishers are responding.

The blocking response has been large and fast. After Cloudflare made AI-crawler blocking the default for new domains on July 1, 2025, its customers blocked 416 billion AI bot scraping requests in the five months through early December 2025, per data Matthew Prince shared with Search Engine Land. More than 2.5 million sites had chosen to fully disallow AI training as of August 2025, and over a million Cloudflare customers had activated AI-crawler blocking.

In robots.txt directives specifically, GPTBot is the most-blocked AI crawler, appearing in 5.52% of DISALLOW rules in Q1 2026 per TechnologyChecker.io’s analysis, ahead of CCBot (5.08%), ClaudeBot (4.88%), Google-Extended (4.44%), and Bytespider (4.23%). There is a notable gap among news publishers: while 79% block at least one AI training bot, only 46% block Google-Extended, Google’s AI training crawler — a reluctance Matthew Prince has linked to Google’s decision to merge its search and AI crawlers into a single user-agent.

"You can't opt out of one without opting out of both, which is crazy."— Matthew Prince, co-founder and CEO, Cloudflare

That combined-crawler design is the most underreported structural advantage in the data. Cloudflare’s figures imply Google’s combined search-plus-AI crawler gives it roughly 3.2× more webpage access than OpenAI, 4.6× more than Microsoft, and about 4.8× more than Anthropic or Meta — because a publisher who blocks Google’s AI training also forfeits Google search visibility, a trade few will make. For the precise directives that map each crawler to allow, block, or AI-Overviews-only, our complete robots.txt and meta-robots reference is the practical companion, and Google now exposes opt-out controls inside Search Console’s AI performance settings.

One control that gets more attention than it earns is llms.txt. Adoption stood at about 10.13% of nearly 300,000 domains surveyed in 2026 — fairly uniform across traffic tiers — yet analysis of hundreds of millions of LLM bot traffic events found that the AI bots driving search visibility rarely fetch the file. Implementing llms.txt is low-cost and harmless, but the current data does not support treating it as a lever that changes AI search visibility. We cover what it is and isn’t in our llms.txt specification and markdown-first content guide.

06 — The UpsideThe traffic AI does send back.

The crawl-to-refer story is bleak, but the referrals that do arrive are unusually valuable — at least in retail, where the best data exists. Adobe’s Q2 2026 AI Traffic Report, drawing on Adobe Analytics across its US retailer clients, found AI-referred traffic to US retailers grew 393% year over year in Q1 2026, peaking at 1,151% YoY growth in December 2025. Those AI-sourced visitors converted 42% better than non-AI channels, generated 37% more revenue per visit, and spent 48% more time on-site as of March 2026.

Two cautions keep this honest. First, the Adobe figures are retail-specific — other industries Adobe measured grew far less (media and tech in the double digits, not the triple), so the 393% number does not generalize to all websites. Second, AI referral volume is still small relative to traditional search even as it grows fast; websearchapi.ai’s read of Cloudflare Radar put AI search visits up 42.8% YoY, from 15.6 billion to 27.4 billion between Q1 2025 and Q1 2026 — real growth, off a base that remains a fraction of conventional search.

Retail / ecommerce

The clearest upside

Adobe put AI-referred US retail traffic up 393% YoY in Q1 2026, with visitors converting 42% better and spending 48% longer on-site. Worth instrumenting AI referral sources now — but treat the multiples as retail-specific.

Instrument AI referrals

Referral source mix

ChatGPT no longer alone

In one B2B sample (Goodie Wave 2), ChatGPT's share of AI chatbot referrals fell from ~89% (Oct 2025) to 62.6% (Mar–Apr 2026) as Claude and Gemini rose. Absolute shares swing heavily by sample — Statcounter showed ChatGPT far higher. Track your own analytics, not headline shares.

Measure your own split

Volume vs traditional search

Fast growth, small base

AI search visits grew 42.8% YoY to 27.4B (Q1 2026) per websearchapi.ai reading Cloudflare Radar — meaningful, but a fraction of conventional search. Plan for AI referrals as an emerging channel, not yet a primary one.

Emerging channel

Attribution

Most AI value is invisible

Because training and answer crawlers return little traffic, much of AI's influence shows up as brand awareness and assisted conversions, not last-click referrals. Build measurement that credits AI-assisted journeys, not just direct AI clicks.

Model assisted impact

07 — The ArcHow we got here — the milestone timeline.

The machine-majority web did not arrive overnight. The numbers below trace a roughly two-year arc from the first AI-crawl surge to the human-traffic crossover, each anchored to a dated, primary-sourced milestone.

2024

The first surge

AI crawler traffic +757%

AI crawler traffic surged roughly 757% across 2024 per Cloudflare Radar, with training crawlers a large share of AI bot activity — the year AI crawling went from negligible to noticeable.

Cloudflare Radar

Jul 1, 2025

Block-by-default

Cloudflare flips the default

Cloudflare made AI-crawler blocking the default for new domains. Over 1M customers had activated blocking; 2.5M+ sites fully disallowed AI training by August 2025.

Cloudflare press release

Dec 2025

416 billion blocked

Scale of the pushback

In the five months after July 1, Cloudflare customers blocked 416 billion AI bot scraping requests, per data shared by Matthew Prince — the clearest sign of how large the crawl volume had become.

Search Engine Land

Q1 2026

Retail referrals +393%

The upside materializes

AI-referred traffic to US retailers grew 393% YoY in Q1 2026 (Adobe), the first hard evidence that AI search was sending commercially valuable visitors, not just consuming content.

Adobe Digital Insights

Jun 3, 2026

Bots pass humans

57.5% machine majority

Cloudflare Radar data shared by Matthew Prince showed bots at 57.5% of HTML web traffic vs 42.5% human — the first machine majority, arriving ahead of his own forecast.

Cloudflare Radar

08 — ImplicationsWhat the data means for SEO and publishers.

Read together, these numbers point to a forward shift that is already underway. As training and answer crawling keep outpacing search crawling, the share of your content that is consumed without any referral mechanism will keep rising — which means the old proxy of crawl budget as a leading indicator of traffic is breaking down. A page can be crawled tens of thousands of times and earn a single visit. Crawl volume and referral value have decoupled, and they are unlikely to recouple on their own.

The practical projection for the next several quarters is a two-track strategy. On one track, decide deliberately which crawlers to allow, block, or permit only for AI-search — a content-licensing and access-control decision, not a default. On the other, instrument AI referral sources properly so that when the high-quality traffic Adobe documented in retail shows up in your vertical, you can see it, attribute it, and credit AI-assisted journeys rather than only last-click AI visits. Neither track is optional once machines move the majority of the web — both are at the core of how we run our agentic SEO engagements.

The one-line summary

By mid-2026 the web is machine-majority, most AI crawling is extraction rather than search, and the value exchange is deeply uneven — so allow-or-block decisions and AI-referral measurement are now core SEO work, not edge cases. Verify every figure against the primary source and date before you cite it; the numbers move monthly.

09 — ConclusionA sourced baseline for a machine-majority web.

The shape of bot traffic, mid-2026

Crawl volume and referral value have decoupled — and that is the story.

The headline of 2026 is easy to repeat: bots passed humans, 57.5% to 42.5%, per Cloudflare Radar on June 3. The more useful finding sits underneath it. Most AI crawling is training, not search, and training returns almost no traffic — so the relationship between being crawled and being visited has come apart in a way it never had under traditional search.

The exact multiples will keep moving. ClaudeBot improved from 23,951:1 to 11,122:1 in a matter of months; OpenAI and Anthropic trade the lead in bot share month to month; AI referral shares swing wildly by sample. Treat every figure on this page as a dated reading, not a constant, and re-verify against the publisher’s own page before you build a decision on it.

What is stable is the structure. The web is machine-majority, the value exchange is uneven, and the referrals that do arrive — at least in retail — are unusually valuable. That combination makes two things core SEO work: deciding which crawlers to allow or block, and measuring the AI-assisted traffic you do earn. The numbers will change; the need to act on them will not.

AI Crawler & Bot Traffic 2026: The Data

01 — The MilestoneThe web went machine-majority in June 2026.

Of AI bot HTTP requests

Of AI bot HTTP requests

Of AI bot HTTP requests

03 — Purpose SplitMost AI crawling is training, not search.

AI crawler requests by purpose · May 2026

04 — Value ExchangeThe crawl-to-refer scorecard.

05 — DefenseHow publishers are responding.

06 — The UpsideThe traffic AI does send back.

The clearest upside

ChatGPT no longer alone

Fast growth, small base

Most AI value is invisible

07 — The ArcHow we got here — the milestone timeline.

The first surge

Block-by-default

416 billion blocked

Retail referrals +393%

Bots pass humans

08 — ImplicationsWhat the data means for SEO and publishers.

09 — ConclusionA sourced baseline for a machine-majority web.

Crawl volume and referral value have decoupled — and that is the story.

Turn bot-traffic data into a crawler strategy that protects your value.

AI-era SEO engagements

The questions we get every week.

Continue exploring AI search & SEO.

AI Crawler Access Control: The 2026 Decision Matrix

Agentic Crawler Behavior: 30-Day Site Log Study 2026

Gemini Notebook's Fetcher Ignores robots.txt: Now What?

StoreBot Can't Reach Your Product Pages? Fix It Now

AI Crawler & Bot Traffic 2026: The Data

01 — The MilestoneThe web went machine-majority in June 2026.

02 — CompositionInside the bots: who is crawling.

Of AI bot HTTP requests

Of AI bot HTTP requests

Of AI bot HTTP requests

03 — Purpose SplitMost AI crawling is training, not search.

AI crawler requests by purpose · May 2026

04 — Value ExchangeThe crawl-to-refer scorecard.

05 — DefenseHow publishers are responding.

06 — The UpsideThe traffic AI does send back.

The clearest upside

ChatGPT no longer alone

Fast growth, small base

Most AI value is invisible

07 — The ArcHow we got here — the milestone timeline.

The first surge

Block-by-default

416 billion blocked

Retail referrals +393%

Bots pass humans

08 — ImplicationsWhat the data means for SEO and publishers.

09 — ConclusionA sourced baseline for a machine-majority web.

Crawl volume and referral value have decoupled — and that is the story.

Turn bot-traffic data into a crawler strategy that protects your value.

AI-era SEO engagements

The questions we get every week.

Continue exploring AI search & SEO.

AI Crawler Access Control: The 2026 Decision Matrix

Agentic Crawler Behavior: 30-Day Site Log Study 2026

Gemini Notebook's Fetcher Ignores robots.txt: Now What?

StoreBot Can't Reach Your Product Pages? Fix It Now