MarketingFramework11 min readPublished May 30, 2026

Seven UTM parameters · GA4's 19 default channels · utm_medium decides the channel

UTM Governance: The 2026 Campaign Taxonomy Reference

Attribution models can only be as good as the data feeding them. Inconsistent UTM tagging fragments acquisition reports upstream of any model you choose. This reference covers GA4's channel rules, three naming-convention models, the new AI Assistants channel, and the governance ownership that keeps a taxonomy from drifting.

DA
Digital Applied Team
Senior strategists · Published May 30, 2026
PublishedMay 30, 2026
Read time11 min
SourcesGA4 docs + measurement-ops
GA4 default channels
19
+ Unassigned + (other)
Recognized UTM params
7
5 classic + utm_id + platform
AI Assistants channel
May 14
2026 announcement (SEJ)
GA4 attribution models
2
since April 2024
−4 removed

A UTM governance framework is the set of naming rules, ownership roles, and enforcement mechanisms that keeps your campaign-tracking tags consistent so GA4 can attribute traffic correctly. The reason it matters is unglamorous: attribution is a downstream problem, but most attribution failures originate upstream — in the tags themselves. A data-driven model fed inconsistent UTM values produces confident, precise, and wrong numbers.

UTM parameters — the five originals plus two GA4 additions — look trivial. They are five-to-seven query-string fields appended to a URL. But GA4 reads those fields through a strict, case-sensitive, regex-driven engine. utm_source=Facebook and utm_source=facebook become two distinct sources. A non-standard utm_mediumsilently drops a session into "Unassigned." And once a session is recorded, you cannot retroactively fix the tag that recorded it.

This reference is deliberately operational. It covers the seven parameters GA4 recognizes, why utm_medium is the field that decides everything, the exact channel-grouping rules, the new AI Assistants channel added in May 2026, a copyable canonical source/medium reference table, three naming-convention models, and the ownership structure that keeps a taxonomy alive after the launch spreadsheet goes stale. Treat our UTM parameters reference guide as the prerequisite; this post is the governance layer that sits on top of it.

Key takeaways
  1. 01
    Attribution failures are usually a tagging problem.Garbage UTM inputs produce garbage attribution outputs regardless of model sophistication. Fix the data upstream before debating data-driven vs. last-click.
  2. 02
    utm_medium is the single most consequential field.GA4's Default Channel Grouping applies regex rules to utm_medium to classify each session. A wrong or missing value drops the session into 'Unassigned' — invisible in channel reports.
  3. 03
    Case sensitivity quietly splits your data.utm_source=Facebook and utm_source=facebook are two distinct sources in GA4. You cannot retroactively merge them; consistency has to be enforced at tag time.
  4. 04
    GA4 added a native AI Assistants channel in May 2026.Announced May 14, 2026, it classifies referred traffic from assistants like ChatGPT, Gemini, DeepSeek, Copilot, and Grok via the rule medium exactly matches 'ai-assistant'. Google's own AI Overviews stay in Organic Search.
  5. 05
    Documentation alone does not work — ownership does.A locked shared builder plus a named Taxonomy Guardian who approves new values before launch outperforms any convention that lives only in a wiki nobody reads.

01The Real ProblemAttribution is downstream of tagging.

The marketing-analytics conversation is dominated by attribution models — data-driven, last-click, multi-touch, media mix modeling. That is the wrong place to start. Every attribution model consumes the same raw material: the source, medium, and campaign recorded on each session. If that raw material is fragmented, every model downstream inherits the fragmentation. The most sophisticated data-driven model in the world cannot recover signal that was never captured cleanly.

Consider a single paid-social campaign tagged five different ways across a quarter: facebook / cpc, Facebook / CPC, fb / paid-social, meta / paid_social, and an untagged dark-post link that lands in Direct. In GA4 those become four or five separate source/medium pairs and at least two different channels — and the untagged one disappears into Direct entirely. The campaign looks like five weak performers instead of one strong one. No attribution model fixes that; the damage is already baked into the session records.

This is why governance comes first. Clean source/medium data is the shared dependency of every measurement decision you make afterwards — which is exactly the point our companion piece on multi-touch attribution makes from the modeling side: the model is only as trustworthy as the source/medium data underneath it.

The cardinal rule
Once a value passes into GA4, you cannot go back in time and fix inconsistent ones. Always use the same value for the same entity. Consistency is not a reporting cleanup task — it is a tag-time discipline, because the cleanup window does not exist.

Marketing-ops practitioners describe a recurring pattern: a large share of organizations invest meaningfully in campaigns without a reliable way to attribute results, largely because tagging was never governed. We will not put a single hard percentage on that — the figures circulating in vendor blogs are not independently verified — but the qualitative reality is consistent across every audit we run: the first place attribution breaks is the tag, not the model.

02The VocabularySeven parameters GA4 actually reads.

UTM stands for Urchin Tracking Module — named after Urchin Software, the analytics company Google acquired to build Google Analytics. The five original parameters are utm_source, utm_medium, utm_campaign, utm_term, and utm_content. GA4 added two more that older guides often miss: utm_id for campaign deduplication and utm_source_platform for distinguishing automated platform traffic.

Required
utm_medium
the channel key

The single field GA4's channel-grouping engine reads to classify a session. A wrong value here is the leading cause of Unassigned traffic. Treat it as the most controlled field in your whole taxonomy.

Drives channel grouping
Required
utm_source
the platform / referrer

The specific origin: google, facebook, newsletter, partner-name. Overrides the HTTP referrer entirely — if utm_source is present, GA4 ignores the real referring domain. Use a recognized, lowercase value.

Overrides referrer
Required
utm_campaign
the initiative name

The human-readable campaign identifier. This is where naming conventions earn their keep — a structured pattern here makes filtering, grouping, and reporting tractable across hundreds of campaigns.

Where taxonomy lives
Optional
utm_content
creative variation

Differentiates creative within one campaign: cta-button-top vs. text-link-sidebar, or image vs. video vs. carousel. Optional, but the backbone of any creative-level performance reporting.

A/B & creative splits
Optional
utm_term
paid keyword

Captures the keyword for manually-tagged paid search. Note: when Google Ads auto-tagging (gclid) is active, keyword data flows via the gclid, not utm_term — the two methods conflict unless manual override is explicitly configured.

Manual paid search only
GA4-era
utm_id + source_platform
dedup & platform

utm_id is a campaign ID for deduplication and matching GA4 sessions to offline CRM or ad-platform records. utm_source_platform (Google Ads, DV360, SA360) flags automated traffic so it unlocks more granular channel rules.

New in GA4

Two non-obvious behaviors are worth committing to memory. First, parameter values are case-sensitive: utm_source=Facebook and utm_source=facebook are two different traffic sources. Standardize on lowercase, always. Second, utm_source overrides the HTTP referrer. If a URL carries a utm_source, GA4 uses it and discards the real referring domain. That is powerful when correct and corrupting when wrong — which is exactly why you must never tag internal links.

Never tag internal links
Applying UTMs to navigation within your own site overwrites the original traffic source. A visitor who arrives via Organic Search and then clicks a UTM-tagged internal banner has their entire session re-attributed to that banner. The same caution applies to organic search results — you cannot tag them, and attempting to do so corrupts source data.

03The Keystone FieldWhy utm_medium decides everything.

Most UTM guidance treats the five parameters as roughly equal. They are not. utm_mediumis architecturally privileged: it is the primary key in GA4's Default Channel Grouping engine. Google's channel definitions are a sequence of regex rules, and the majority of them read utm_mediumfirst to decide whether a session is Paid Search, Organic Social, Email, Referral, or something else. Get this field wrong and the session does not just land in the wrong channel — it frequently lands nowhere, in "Unassigned."

Without a consistent value of 'cpc' for utm_medium and the name of the platform in lowercase as the utm_source, we would never get a clean report. Once the values pass into Google Analytics, you can't go back in time and fix inconsistent ones.— Phill Kletting, CXL UTM Parameters guide

The destructiveness compounds. A wrong utm_source misattributes one campaign. A wrong utm_medium mis-classifies an entire channel — every session carrying that value is affected, and those sessions vanish from the channel reports executives actually look at. The fix is to treat utm_medium as a tightly-controlled enumeration: a short, fixed list of approved values that map cleanly to GA4 channels, enforced at the builder level so no one can free-type newsletter when GA4 only accepts email.

A concrete example: the Email channel in GA4 only matches four spellings — email, e-mail, e_mail, and e mail. Tag a campaign medium=newsletter and every one of those sessions falls into Unassigned, no matter how clean the rest of the URL is. The medium is unforgiving by design.

GA4 default channels
Named channels
19

Affiliates, AI Assistants, Audio, Cross-network, Direct, Display, Email, Mobile Push Notifications, Organic Search, Organic Shopping, Organic Social, Organic Video, Paid Other, Paid Search, Paid Shopping, Paid Social, Paid Video, Referral, SMS. Plus Unassigned and (other), which are not true channels.

as of May 2026
Email medium variants
Only four route to Email
4

email, e-mail, e_mail, e mail. Anything else — newsletter, mail, e-newsletter — falls into Unassigned. The most common single-field tagging error we see in audits.

exact-match only
Attribution models
Down from six
2

Since April 2024, GA4 keeps only data-driven (default) and paid-and-organic last-click; first-click, linear, time-decay, and position-based were removed. With only two views of cross-channel performance, UTM fragmentation distorts the only lens you have.

since April 2024

04GA4 Channel RulesHow GA4 maps a tag to a channel.

GA4's Default Channel Grouping is a deterministic rule set. Each session is evaluated against the rules in order, and the first match wins. Understanding the most common rules lets you reverse-engineer the correct tag for any campaign type rather than guessing. Below are the rules that account for the overwhelming majority of marketing traffic.

GA4 channel rules · what utm_medium has to be

Source: Google Analytics Help — Default channel group (May 2026)
Paid Search / Social / Video / Shoppingmedium matches ^(.*cp.*|ppc|retargeting|paid.*)$ AND a matching source
regex + source
Organic Socialsocial source OR medium in social / social-network / sm / social media
source or medium
Emailsource or medium matches email / e-mail / e_mail / e mail
4 spellings
Affiliates · Audio · SMSmedium exactly matches affiliate / audio / sms
exact match
Referralmedium is one of referral, app, or link
3 values
AI Assistantsmedium exactly matches ai-assistant
exact match

The paid channels carry a footgun worth isolating. The medium regex ^(.*cp.*|ppc|retargeting|paid.*)$ matches cpc, cpm, ppc, paid-social, and more — but for a session to land in Paid Social specifically, the utm_sourcemust also match GA4's recognized social-platform list. Use medium=paid_social with an unrecognized source and the session falls to Paid Other, not Paid Social. The medium passes the regex; the source fails the platform check.

Two more rules trip teams up. First, Google Ads auto-tagged campaigns (Demand Gen, Performance Max, Smart Shopping) land in the Cross-network channel even when the medium reads cpc and source reads google — the campaign type from Google Ads metadata takes precedence over generic UTM values. Second, the platform rebrand trap: utm_source=meta is noton GA4's recognized social-site list. Marketers who switched from facebook to meta after the 2021 rebrand saw their paid social traffic silently reclassified. The correct value remains facebook.

When you outgrow the default
GA4 lets each property define up to two custom channel groups, each with up to 25 channels, and applies them retroactively — unlike the old Universal Analytics groupings, which only worked forward. Set a custom group as the Primary Channel Group to override the default mapping for your own taxonomy. This is the escape hatch when your medium values are clean but GA4's default buckets do not match how your business thinks about channels.

05The New ChannelGA4's AI Assistants channel.

In an announcement covered by Search Engine Journal on May 14, 2026, Google added a native AI Assistantschannel to GA4's Default Channel Grouping. It classifies traffic referred from AI assistants — confirmed by name for ChatGPT, Gemini, DeepSeek, Copilot, and Grok — based on the HTTP referrer header. The mechanic is the same regex pattern as every other channel: GA4 auto-assigns medium = "ai-assistant" and campaign = "(ai-assistant)" to matching sessions, and the channel rule is simply medium exactly matches "ai-assistant".

Two boundaries matter. First, the channel explicitly excludes Google's own AI Overviews and AI Mode — that traffic continues to count as Organic Search. Second, AI traffic that arrives without a referrer — in-app browsers, copy-paste navigation — still lands in Direct, the same as any other referrer-less visit. Industry analysts have offered directional estimates that a meaningful but incomplete share of AI referrals carry a usable referrer header; we treat that as commentary rather than a published Google figure, so the practical takeaway is simply that the channel undercounts AI traffic by an unknown margin.

A governance trigger, not just a feature
The native channel is automatic, so you do not tag inbound AI referrals yourself. But it is a prompt to revisit your taxonomy: if you send outbound links into AI-assistant contexts, or you want to segment AI traffic in custom reports, decide now whether to lean on the native ai-assistant medium or build a custom channel group. This is exactly the kind of new pattern a governed taxonomy absorbs cleanly and an ungoverned one misclassifies for months.

The native channel is the formalization of a direction Google had already signaled — earlier guidance walked teams through building custom channel groups with regex to catch assistant traffic. Our read going forward: AI-assistant referrals will become a standing line item in channel reports, and the brands that benefit are the ones whose governance already treats new mediums as a managed lifecycle rather than an ad-hoc addition. If your team is simultaneously wrestling with consent signals, note that referrer availability and Consent Mode v2 both shape how complete your UTM-based attribution can ever be.

06Canonical ReferenceThe DA UTM canonical source/medium table.

This is the artifact worth bookmarking. Practitioners normally have to synthesize correct source/medium values from four or five separate sources. The table below maps common campaign types to the canonical utm_source and utm_medium, the resulting GA4 default channel, and the common mistake that breaks each one. All values derive from GA4's published channel rules; verify against the live help page before locking a property-wide taxonomy.

Campaign type
Google paid search
source / medium
google / cpc
GA4 channel · common mistake
→ Paid Search. Mistake: capitalizing CPC, or relying on manual UTMs while gclid auto-tagging is on (they conflict).
Campaign type
Meta paid social
source / medium
facebook / paid_social
GA4 channel · common mistake
→ Paid Social. Mistake: using source=meta — not on GA4's social list, falls to Paid Other / Unassigned.
Campaign type
LinkedIn paid social
source / medium
linkedin / paid_social
GA4 channel · common mistake
→ Paid Social. Mistake: medium=ppc without a recognized social source lands in Paid Other.
Campaign type
Email newsletter
source / medium
newsletter-name / email
GA4 channel · common mistake
→ Email. Mistake: medium=newsletter — not one of the 4 accepted spellings, falls to Unassigned.
Campaign type
Affiliate partner
source / medium
partner-name / affiliate
GA4 channel · common mistake
→ Affiliates. Mistake: medium=referral routes to Referral instead, hiding the affiliate program.
Campaign type
SMS campaign
source / medium
sms / sms
GA4 channel · common mistake
→ SMS. Mistake: medium=text or medium=mms — only exact 'sms' matches.
Campaign type
Display / banner
source / medium
publisher / display
GA4 channel · common mistake
→ Display. Mistake: medium=banner works, but medium=ad falls to Unassigned.
Campaign type
YouTube paid video
source / medium
youtube / paid_video
GA4 channel · common mistake
→ Paid Video. Mistake: medium=video alone (no paid prefix + source) reads as Organic Video.
Campaign type
Organic social post
source / medium
facebook / social
GA4 channel · common mistake
→ Organic Social. Mistake: tagging an organic post as paid inflates Paid Social spend reporting.
Campaign type
Microsoft (Bing) paid search
source / medium
bing / cpc
GA4 channel · common mistake
→ Paid Search. Mistake: source=microsoft instead of bing may miss the search-engine list.
Campaign type
Podcast / audio sponsorship
source / medium
podcast-name / audio
GA4 channel · common mistake
→ Audio. Mistake: medium=podcast — only exact 'audio' matches the Audio channel.
Campaign type
AI-assistant outbound link
source / medium
(auto) / ai-assistant
GA4 channel · common mistake
→ AI Assistants. Inbound referrals are auto-classified; for outbound links decide between native medium vs. a custom group.
Campaign type
Mobile push notification
source / medium
app-name / push
GA4 channel · common mistake
→ Mobile Push Notifications. Mistake: medium=notification works; medium=app routes to Referral.
Campaign type
QR code (print / OOH)
source / medium
print-piece / qr
GA4 channel · common mistake
→ Unassigned (no 'qr' channel). Use a custom channel group or a recognized medium to classify it.

Notice how often the column that breaks is the medium, and how often the failure mode is silent — Unassigned, Paid Other, or a re-classification into the wrong channel. None of these throw an error; they just quietly distort the report. A locked builder that only offers approved source/medium pairs eliminates the entire right column.

07Naming ModelsThree taxonomy models, one right fit.

The utm_campaign field is where naming conventions live, and practitioners converge on three structural models. The right choice is a function of team size, automation maturity, and how much human readability you need at a glance. Picking the wrong model is survivable; having no model is not.

Encoded IDs
Cryptic model

Encoded identifiers like cmp_9021 resolved against a lookup table. Maximally compact and machine-friendly, but unreadable without the table and dependent on mature automation. Example: cmp_9021. Best for large enterprises with strong tooling.

Enterprise + automation
Fixed order
Positional model

Fixed-order, delimiter-separated segments: facebook-cpc-summer-sale-cta-top. Human-readable and simple, but brittle — a missing or reordered segment silently changes meaning. Example: 2026-05_us_paid-search_bofu_summer-sale. Best for small teams with predictable structures.

Small team · predictable
Attributed pairs
Key-Value model

Self-describing key:value pairs: src:google_med:cpc_cam:summer-sale. The most scalable and auditable — every segment is labeled, so order does not matter and parsing is trivial. The preferred model for mid-to-enterprise organizations running many concurrent campaigns.

Mid-to-enterprise scale

Whichever model you adopt, the mechanical rules are universal. Use hyphens, never spaces — a space encodes as %20 and produces messy, error-prone tags. A common convention combines both delimiters: underscores between convention components and hyphens within values, e.g. 2026-05_us_paid-search_bofu_summer-sale. Keep total URL length under roughly 242 characters as a practical integrity limit across email clients and SMS, and keep campaign names short — under about 20 characters — so they stay manageable in reports.

One privacy caveat that is easy to forget: campaign names appear in UTM parameters that are visible to users in the address bar. Avoid putting real audience names, internal budget codes, or sensitive targeting details into a utm_campaign value. Use neutral theme and product identifiers instead.

The establishment of naming conventions is governance, and without accountability, even the best designed framework would collapse under real-world stress.— Supermetrics, Campaign Naming Conventions

08Governance OwnershipThe Taxonomy Guardian and the lifecycle.

Here is the part most UTM guides skip: a convention documented in a wiki nobody reads will drift back to chaos within a quarter. Enforcement mechanisms matter more than documentation. The two that consistently hold up are a locked shared builder that only emits approved source/medium pairs, and a named Taxonomy Guardian — typically a performance marketer or marketing-operations lead — who owns the approved parameter list, reviews every new value before launch, and runs periodic audits. Strategic oversight from a CMO or Head of Growth keeps the taxonomy aligned with how the business reports.

Best practice is establishing strict naming conventions documented in a shared reference guide, but enforcement mechanisms matter more than documentation.— Uplifter, UTM Best Practices

The second piece of operational discipline is treating UTM values like software artifacts with a lifecycle. Just as engineering teams manage feature flags and API deprecations through explicit states, a UTM taxonomy benefits from formal status transitions. Each value moves through Draft, Approved, Active, and Archived — and only specific roles can move it between states. This is what stops taxonomy drift at the source: a new value cannot go live until a named owner approves it.

Lifecycle state
Draft
Who sets it · trigger
Any contributor · proposes a new value
What it means for reporting
Not yet usable. The value sits in the master UTM log awaiting review. No live campaigns may use it.
Lifecycle state
Approved
Who sets it · trigger
Taxonomy Guardian · review passes
What it means for reporting
Cleared for use but not yet live. The builder can now emit it. Confirms it maps to the intended GA4 channel.
Lifecycle state
Active
Who sets it · trigger
Auto · first live campaign uses it
What it means for reporting
In production and appearing in reports. The canonical, documented value for its entity going forward.
Lifecycle state
Deprecated / Archived
Who sets it · trigger
Taxonomy Guardian · retired or superseded
What it means for reporting
Historical only. Retained in reports for past data but blocked from new campaigns to prevent reuse drift.

The connective tissue is the master UTM log — a shared sheet whose columns track utm_campaign, utm_source, utm_medium, utm_content, utm_term, plus Objective, Owner, Launch Date, Status, and Approver. The governance rule is simple and non-negotiable: adding a new parameter value requires a named owner to approve it before use. That single approval gate is what separates a taxonomy that survives staff turnover from one that quietly decays.

09Rollout & QAShipping it, and keeping it clean.

A taxonomy is only as good as the QA that protects it. The decision tree below maps the three things that actually move the needle on data quality — and the one that does not. Tooling and process beat good intentions every time.

Tag at the source
Dynamic macros where possible

Let ad platforms auto-populate UTM values at click time via dynamic parameters (Google Ads {campaignid}, {keyword}, {adgroupid}; equivalents in Meta and Microsoft). Eliminates manual typos at scale — the largest single source of fragmentation.

Use platform macros
Lock the builder
Approved values only

Replace free-text UTM entry with a shared builder that offers only approved source/medium pairs from the master log. Most fragmentation is free-typing; remove the keyboard and you remove the error class.

No free-text fields
QA every link
Test before launch

Open each tagged link in an incognito session; verify Session source/medium and Session campaign populate in GA4 real-time; confirm channel alignment; check that redirects do not strip UTM params from the destination URL.

Incognito + real-time check
Documentation only
A wiki nobody enforces

A convention that lives only as a document, with no builder lock, no approval gate, and no audits, drifts back to inconsistency within a quarter. Necessary, but never sufficient on its own.

Insufficient alone

For teams ready to operationalize this end to end, the sequence is: start from the official Google Campaign URL Builder for one-off links, move high-volume tagging to platform macros, lock everything else behind a shared builder fed by the master log, and assign a Taxonomy Guardian to own the approval gate and quarterly audits. Once the inputs are clean, the downstream analytics work — whether that is exporting GA4 data to BigQuery for deeper analysis or weighing media mix modeling against attribution — actually produces trustworthy answers. This is exactly the governance layer our analytics & measurement engagements build before any attribution work begins.

10ConclusionGovern the inputs, trust the outputs.

The discipline behind clean attribution

Fix the data before you choose the model.

UTM governance is unglamorous, and that is precisely why it gets skipped — and precisely why fixing it produces an outsized return. Every attribution decision, every channel report, every media-mix comparison rests on the same foundation: the source, medium, and campaign recorded on each session. Govern those inputs and the downstream analytics finally tell you something true.

The mechanics are knowable. utm_medium is the keystone field that decides the channel. GA4 reads it through case-sensitive, regex-driven rules across 19 default channels, with a new AI Assistants channel as of May 2026. A canonical source/medium table removes guesswork, and one of three naming models gives campaigns a structure. None of that is hard to learn.

What is hard is keeping it consistent over time, across people, under real-world pressure. That is a governance problem, not a knowledge problem — and it is solved by a locked builder, a named Taxonomy Guardian, an approval gate for every new value, and a lifecycle that treats UTM values like the durable infrastructure they actually are. Documentation describes the standard; enforcement is what keeps it.

Build a measurement foundation that holds

Clean UTM inputs make every downstream model worth trusting.

Our team builds the measurement foundation first — locked UTM taxonomy, GA4 channel alignment, and the governance ownership that keeps attribution trustworthy — before layering on the analytics and modeling that depend on it.

Free consultationExpert guidanceTailored solutions
What we work on

Measurement & taxonomy engagements

  • UTM taxonomy design + a locked shared builder
  • GA4 default & custom channel-grouping alignment
  • Taxonomy Guardian role + approval-gate process
  • AI Assistants channel + emerging-source coverage
  • GA4 → BigQuery pipelines on clean source/medium data
FAQ · UTM governance

The questions we get every week.

A UTM governance framework is the combination of naming rules, ownership roles, and enforcement mechanisms that keeps campaign-tracking tags consistent across a marketing team. It typically includes a canonical list of approved utm_source and utm_medium values, a documented naming convention for utm_campaign, a master UTM log that records every value with an owner and status, a locked shared builder that only emits approved values, and a designated Taxonomy Guardian who reviews new values before launch. The point is not the documentation itself — it is the enforcement. A convention that lives only in a wiki drifts back to inconsistency within a quarter, so the framework's value comes from the builder lock, the approval gate, and periodic audits that catch drift early.