The AI citation ranking factors that actually drive visibility in ChatGPT, Gemini, Perplexity, and Google AI Overviews are finally measurable. In May 2026, Cyrus Shepard of Zyppy published a meta-analysis that synthesizes 54 experiments, patents, and case studies into a single scored ranking of 23 factors — the first time the GEO field has weighted its advice by evidence strength rather than opinion.
That matters because most generative engine optimization content lists the same factors without ranking them, leaving practitioners to guess which moves are worth the effort. When you pair the Zyppy scoring with the large-scale correlation studies from Ahrefs — 75,000 brands, then nearly 17 million citations, then 4 million AI Overview URLs — a clear and occasionally surprising hierarchy emerges. Some long-held SEO instincts hold up. Others, including a widely-copied statistic about fresh content, do not.
This guide ranks the signals by evidence, decodes the brand-versus- backlinks correlation that is the most actionable finding in the data, explains the fan-out retrieval shift that rewires citation strategy, and converts it all into an evidence-weighted checklist. Every figure is attributed to its primary study, and one popular number is corrected rather than repeated.
- 01Accessibility and search rank still anchor everything.Zyppy scores URL accessibility 9.5/10 and search rank 9.4/10 as the top two factors. A page blocked, paywalled, or returning errors cannot be cited; one ranking in Google's top 10 has a real edge. The thesis: win SEO, win AI citations, with extra steps.
- 02Brand mentions correlate ~3x more strongly than backlinks.Ahrefs studied 75,000 brands and found branded web mentions correlate 0.664 with AI Overview visibility versus 0.218 for backlinks. The top quartile of brands by mentions averages 169 AI Overview mentions versus 14 for the next tier.
- 03Fan-out makes topic-cluster ranking a citation multiplier.AI engines expand a query into many sub-queries and pull from a wider SERP. Only 38% of AI Overview citations now come from the top 10, down from 76% in mid-2025. Ranking across a cluster beats ranking once for a head term.
- 04Cited content is fresher — but not by the number you've seen.Across nearly 17 million citations, AI-cited content averages about 25.7% fresher than organic top-10 results. The viral '4.3x more cited for fresh content' figure traces to no primary study and should not be repeated.
- 05LLMs.txt scores lowest; structure and editorial content win.Shepard found no credible evidence LLMs.txt influences citations (2.0/10). What does correlate: AI-ready structure, self-contained passages, and editorial blog content, which makes up over half of all AI citations in BuzzStream's analysis.
01 — The Evidence BaseA meta-analysis of 54 studies, scored by evidence.
The backbone of this analysis is the AI Citation Ranking Factors study Cyrus Shepard published on Zyppy Signal on May 7, 2026. Rather than asserting which factors matter, it synthesizes 54 experiments, patents, and case studies covering ChatGPT, Gemini, and Perplexity, and scores each of 23 factors on three criteria: repeatability across studies, strength of evidence, and official support from platform documentation or patents. The output is a single 0-to-10 score per factor — a rare attempt to weight GEO advice by how well it is actually supported.
We cross-reference that scoring against four large-scale primary datasets so the conclusions do not rest on one source. Ahrefs contributes three: a 75,000-brand correlation study on what predicts AI Overview visibility, a nearly 17-million-citation study on content freshness, and an 863,000-keyword analysis of where AI Overview citations come from. Seer Interactive contributes a 5.47-million-query study on what citation does to click-through. The combination lets us separate signals that show up in one vendor's blog from signals that replicate across independent datasets.
Zyppy 23-factor scoring
Cyrus Shepard synthesizes experiments, patents, and case studies into a single evidence-weighted ranking. URL accessibility tops the list at 9.5; LLMs.txt sits at the bottom at 2.0.
Ahrefs brand visibility
Branded web mentions correlate 0.664 with AI Overview visibility versus 0.218 for backlinks. The top three signals are all off-site brand signals, not link metrics.
Ahrefs freshness study
AI-cited content averages 1,064 days old versus 1,432 for organic Google top-10 — about 25.7% fresher. ChatGPT shows the strongest freshness preference of the platforms tested.
One framing point before the numbers: nearly every figure below is a study-stated correlation, not proven causation. A high correlation between brand mentions and citations does not prove that adding mentions causes citations; it is equally possible that already-strong brands earn both. The original authors are explicit about this, and we preserve their caveats. Treat the scores and correlations as a prioritized map of where the evidence points, not as guaranteed levers.
02 — Top SignalsThe five factors that score highest.
Five factors cluster at the top of the Zyppy ranking, and they tell a coherent story: AI engines cite pages they can reach, that already rank, that match the query closely, and whose content they are permitted to preview. None of this is exotic — it is the mechanical reality of how grounding and retrieval work, scored against the weight of the evidence behind each claim.
Top five AI citation factors · Zyppy evidence score
Source: Zyppy AI Citation Ranking Factors, May 7, 2026 (score out of 10)URL accessibility (9.5) is the floor. As Shepard puts it, a URL typically needs to be accessible and crawlable, either during training or grounding, for an AI engine to cite it. Pages blocked by robots.txt, sitting behind a paywall, or returning errors are effectively invisible. Search rank (9.4) sits just below: the Ahrefs data shows 38% of AI Overview citations come from pages ranking in the top 10, which is why the study distills its whole thesis into a single line — win SEO, win AI citations, most of the time, with extra steps.
Preview control (9.2) is the overlooked one. Pages using a nosnippet meta directive to suppress preview snippets can inadvertently reduce or eliminate their AI citation visibility, because AI engines lean on snippet data during grounding. Several teams that locked down snippets for traditional-SERP reasons are quietly excluding themselves from AI answers. Query-answer match (9.2) rewards pages that directly answer the specific phrasing of a query over content that is merely topically adjacent.
Win SEO, win AI citations (most of the time, with extra steps.)— Cyrus Shepard, founder of Zyppy
nosnippet or max-snippet directives. The top-scoring factors are accessibility ones — a single misconfigured directive can zero out citations that ranking alone would have earned.03 — The Full RankingAll 23 factors, grouped by category.
The Zyppy study publishes its scores but not a tidy reference table. Below is the full ranking, grouped into the categories we use when scoping GEO work — Technical, Off-page, Content, and a low-evidence tier — with a one-line action per factor. Scores are Shepard's; the grouping and actions are ours. The pattern to notice: the highest-evidence factors are technical accessibility and off-site brand signals, while several content-formatting tactics that dominate GEO advice score in the merely-good range.
| Factor | Score | What to do |
|---|---|---|
| Technical — accessibility and retrieval | ||
| URL Accessibility | 9.5 | Keep pages crawlable, unblocked, and returning 200s during training and grounding windows. |
| Search Rank | 9.4 | Treat classic SEO ranking as the base layer; top-10 placement still has a real citation edge. |
| Fan-out Rank | 9.3 | Rank across the cluster of related sub-queries an AI engine expands a question into, not just the head term. |
| Preview Control | 9.2 | Audit nosnippet directives; blocking previews can suppress AI citations. |
| Off-page — authority and brand presence | ||
| Topic Cluster Ranking | 8.9 | Build depth across a topic so you become a repeated fan-out destination, not a one-keyword page. |
| Brand Web Mentions | High* | Earn unlinked and linked mentions across the web; correlates 0.664 with AI Overview visibility in the Ahrefs study. |
| Content — structure and answerability | ||
| Query-Answer Match | 9.2 | Answer the exact phrasing of target queries directly, not just the broad topic. |
| Answer Near the Top | 8.8 | Lead each section with the answer; content near the top of the page is more likely to make the cut. |
| AI-ready Structure | 8.6 | Use clear headings and tables so passages are cleanly extractable as standalone answers. |
| Self-Contained Passages | 8.0 | Write each section to stand alone as a complete answer unit without external context. |
| Cites Sources Internally | 8.0 | Reference primary data and sources within the content to signal reliability. |
| Freshness | 7.0 | Keep content current; cited content skews fresher, though the effect is moderate, not the dominant factor. |
| Low evidence — handle with skepticism | ||
| LLMs.txt | 2.0 | Lowest-scoring factor. No credible evidence it influences citations; do not prioritize it over the signals above. |
04 — Brand vs LinksBrand mentions correlate 3x more strongly than backlinks.
This is the most actionable finding in the entire dataset. Ahrefs analyzed 75,000 brands using Spearman correlation on domains with a Domain Rating above 40 and keywords with at least 800 monthly searches. The result reorders the priority list a decade of SEO built around links: the three signals that correlate most strongly with AI Overview visibility are all off-site brand signals — branded web mentions (0.664), branded anchor texts (0.527), and branded search volume (0.392). Domain Rating lands at 0.326, and number of backlinks comes in last at 0.218.
| Signal | Correlation | Type | vs backlinks |
|---|---|---|---|
| Branded web mentions | 0.664 | Brand | 3.0x |
| Branded anchor texts | 0.527 | Brand | 2.4x |
| Branded search volume | 0.392 | Brand | 1.8x |
| Domain Rating | 0.326 | Traditional SEO | 1.5x |
| Number of backlinks | 0.218 | Traditional SEO | 1.0x (baseline) |
The disparity is even sharper at the extremes. The top 25% of brands by web mention volume average 169 AI Overview mentions versus just 14 for the next quartile — more than a tenfold gap — while the bottom half by mentions barely register, averaging 0 to 3. This is a winner-takes-most distribution: AI visibility concentrates among brands that have built broad web presence, not merely strong link profiles. A separate analysis by Wellows found Domain Authority's correlation with AI citation likelihood has fallen to roughly r=0.18, down from r=0.43 before 2024, converging on the same conclusion from a different dataset.
The strategic read is a genuine reallocation, not a tweak. If your AI visibility program is still pouring budget into raw link acquisition, the data suggests redirecting toward digital PR, unlinked brand mentions, podcast and publication coverage, and the kind of category presence that grows branded search. Our AI citation visibility audit of 500 SaaS sites found the same pattern in practice — the most-cited brands were rarely the ones with the biggest backlink counts.
Web mentions correlate much more strongly than backlinks [with AI Overview visibility].— Louise Linehan & Xibeijia Guan, Ahrefs
05 — Fan-out RetrievalWhy topic clusters now beat the single top-ranking page.
Fan-out is the mechanical shift that rewires citation strategy. Instead of running a single query and pulling the top results, AI systems expand a user's question into multiple related sub-queries and retrieve across a wider set of SERP results. A page that ranks for many related queries across a topic cluster therefore gains disproportionate citation exposure — which is exactly why Zyppy scores fan-out rank at 9.3 and topic cluster ranking at 8.9, the highest content-strategy signal in the study.
The Ahrefs citation data makes the consequence concrete. Analyzing 863,000 keywords and 4 million AI Overview URLs, the study found that only 38% of AI Overview citations now come from pages ranking in Google's top 10, down sharply from 76% in mid- 2025. Pages ranking 11 to 100 account for 31.2% of citations, and pages beyond rank 100 account for another 31.0%. In other words, nearly two-thirds of citations now go to pages outside the top 10 — because fan-out reaches deeper into the results for each of its sub-queries.
Where AI Overview citations come from · by organic rank
Source: Ahrefs, 863K keywords / 4M AI Overview URLs, March 2026There is a second-order effect worth naming. YouTube is now the single most-cited domain in AI Overviews, having grown 34% over six months to represent 5.6% of all AI Overview URLs — a direct result of fan-out prioritizing video answers for certain query types. The takeaway for content teams is that breadth across a topic, plus presence on the formats AI engines favor for specific intents, matters more than owning the single top spot for a head term. This is the engine behind our citation pattern study and the reason generative engine optimization (GEO) has shifted from page-level tactics to cluster-level coverage. We dug deeper into the rank shift in our analysis of the 38% citation drop from top-10 pages.
06 — The Freshness MythCited content is fresher — but not by the number you've seen.
Freshness is where GEO advice most often goes wrong, so it deserves a careful correction. You have likely seen the claim that fresh content gets cited "4.3x more" in AI search. We could not trace that figure to any primary study. It does not appear in the Zyppy analysis, which scores freshness a moderate 7.0/10, and it does not appear in the Ahrefs freshness study either. It surfaces only on secondary aggregation blogs without a published methodology or sample size. Do not repeat the 4.3x number.
The verified figure is more modest and far better supported. Ahrefs measured content age across 16.975 million cited URLs spanning seven platforms — ChatGPT, Perplexity, Gemini, Copilot, and AI Overviews among them. AI-cited content averages 1,064 days old (about 2.9 years) versus 1,432 days (about 3.9 years) for organic Google top-10 results. That is a 25.7% freshness advantage, not a multiplier. Real, replicable, and useful — but a far cry from "4.3x."
Average age
Across 16.975 million cited URLs on seven platforms, AI-cited content averages about 2.9 years old — meaningfully fresher than organic top-10 results.
Average age
Classic Google top-10 results skew older at roughly 3.9 years. The 368-day gap is the source of the 25.7% freshness advantage AI citations show.
Strongest freshness bias
ChatGPT cites the freshest content of the platforms tested, averaging 958 days. Google AI Overviews behaves most like traditional search, with the smallest freshness bias.
The practical guidance survives the correction intact: keeping content current is worth doing, the effect is real, and ChatGPT in particular rewards recency more than Google AI Overviews does. But freshness is a moderate factor (7.0/10), not a dominant one, and it sits well below accessibility, rank, and brand presence in the evidence hierarchy. A content team that prioritizes a constant refresh cadence over fixing crawlability or building brand mentions is optimizing the wrong variable. Correcting the "4.3x" myth is itself a small case study in why evidence-weighted ranking matters more than repeating whatever number went viral.
07 — Content That Gets CitedEditorial blog content does the heavy lifting.
If accessibility and brand presence determine whether you can be cited, content type determines what actually gets pulled. BuzzStream analyzed 4 million citations from 3,600 AI prompts across 10 industries and found that blog and content pages make up 53.46% of all citations — far ahead of news at 14.09% and social at 8.71%. With brand-owned queries excluded, earned editorial content accounted for roughly 80% of citations. Press releases distributed through syndication, by contrast, accounted for a vanishing 0.04% of the entire dataset.
That last figure is worth sitting with. The press-release-and-wire playbook that still anchors many PR budgets is essentially invisible to AI citation systems, while substantive editorial content — the kind written to answer a question well — does the overwhelming majority of the work. One caveat on the BuzzStream data: it was collected over a single week starting late January 2026, so treat the exact percentages as directional rather than permanent.
Question-answering depth
Makes up 53.46% of all AI citations in the BuzzStream study, and roughly 80% once brand-owned queries are excluded. The single highest-leverage content format for earning citations.
Text plus images & structure
Wellows found pages combining text, images, and structured elements show 156% higher selection rates than text-only pages. Independently unverified, so treat as directional, but the direction is clear.
Timely coverage
14.09% of citations — a real but secondary share. Useful for query types with a recency requirement, and a fit for brands that already publish frequently.
Wire syndication
Just 0.04% of all citations in the BuzzStream dataset. The traditional press-release-distribution playbook is effectively invisible to AI citation systems. Do not rely on it for AI visibility.
There is one more reason to prioritize getting cited beyond the citation itself: it appears to lift clicks. Seer Interactive's April 2026 study of 53 brands across 5.47 million queries found that being cited in AI Overviews is associated with 120% more organic clicks per impression versus not being cited. The authors are careful to flag the causation caveat — stronger brands may simply be more likely to earn both citations and clicks — but the correlation is large enough to make citation a visibility goal worth pursuing, not just a vanity metric.
08 — The Action PlanAn evidence-weighted checklist.
Most GEO checklists are ungrounded opinion. This one maps each action to the evidence weight behind it, so you spend effort in proportion to what the data supports. Work top to bottom: accessibility and rank are prerequisites, brand and cluster presence are the differentiators, and structure and freshness are refinements — not the place to start.
Crawlability & previews
Confirm key pages are crawlable, return 200s, and are not blocking previews via nosnippet. The highest-evidence factor and the cheapest to fix.
Win classic SEO first
Earn top-10 placement on your priority queries. Search rank is the second-highest factor; AI citation still rides on organic strength most of the time.
Brand mentions over links
Redirect budget from raw link acquisition toward digital PR, unlinked mentions, and category presence. Mentions correlate ~3x stronger than backlinks.
Cover the whole topic
Rank across the related sub-queries fan-out expands into, so you become a repeated citation destination rather than a single page.
Make passages extractable
Clear headings, tables, self-contained sections, and answers near the top. Each section should stand alone as a complete answer unit.
Keep content current
Maintain a refresh cadence, especially for ChatGPT-skewed topics. Real but moderate — do this after the higher-weight factors, not instead of them.
Looking forward, the trajectory in this data points one direction. As fan-out deepens and AI engines pull from ever further outside the top 10, the single-page, single-keyword model of SEO will keep losing ground to brand-and-cluster strategies. The 38% top-10 overlap that replaced last year's 76% is unlikely to reverse; if anything, expect it to keep falling as retrieval gets more aggressive. Teams that treat AI visibility as a brand-presence and topical-authority problem — not a link-count problem — are positioned for where the evidence is heading, not just where it is today. For the structural side of that work, our answer engine optimization (AEO) fundamentals and brand citation audit checklist translate these signals into a working program, and our agentic SEO service builds the cluster coverage and brand-presence engine the data rewards.
09 — ConclusionThe signals that actually move the needle.
Win SEO, build brand presence, and earn the right to be cited — in that order.
Strip away the noise and the 2026 citation data tells a remarkably coherent story. AI engines cite pages they can reach, that already rank, that match the query, and whose content they are permitted to preview — and among those, they favor brands with broad web presence and content organized to be extractable. The Zyppy meta-analysis and the large Ahrefs datasets converge on the same hierarchy from independent angles, which is exactly what gives it credibility.
The most consequential reordering is brand over links. With branded web mentions correlating 0.664 against 0.218 for backlinks, and the top quartile of brands earning 169 AI Overview mentions versus 14 for the next tier, the evidence asks marketers to fund category presence and digital PR rather than chase link counts. The freshness story is the cautionary tale alongside it: a real 25.7% advantage got inflated into an unsourced "4.3x" claim that should be retired, not repeated.
The honest framing is the useful one. These are correlations, not guarantees, and the original authors say so plainly. But a prioritized, evidence-weighted map beats the undifferentiated factor lists that pass for GEO advice. Fix accessibility, win classic SEO, build brand mentions, cover the cluster, structure for extraction, and keep content current — in that order. That is what the data, not the hype, actually supports.