DevelopmentCost Playbook12 min readPublished June 11, 2026

Token-metered billing · 1 credit = $0.01 · output-token spread is ~40x across the model menu

GitHub Copilot AI Credits Are Live: A Cost Playbook

On June 1, 2026, GitHub Copilot retired flat Premium Request Units for token-metered AI Credits — where 1 credit equals $0.01. Plan prices held, but included usage effectively shrank, and the safety-net fallback to cheaper models when quota runs out is gone. Here is what changed and a 5-step audit to keep your spend predictable.

DA
Digital Applied Team
Senior strategists · Published Jun 11, 2026
PublishedJun 11, 2026
Read time12 min
Sources7 primary + community
Billing model live
Jun 1
PRUs → AI Credits
Same price
Output-token spread
~40x
cheapest vs priciest model
Model choice = lever
Reported Day-1 burn
82%
of one user's monthly credits
User-reported
Community sentiment
958
downvotes vs 24 up
Discussion #192948

GitHub Copilot AI Credits went live on June 1, 2026, replacing the flat Premium Request Unit (PRU) model with token-metered billing where one AI Credit equals one US cent. Subscription prices did not move, but the unit of measurement did — and for power users running agentic workflows, that quietly turned a predictable monthly bill into a usage meter that can spike fast.

The reaction was immediate. Developers report 10x to 100x cost swings on heavy workloads; one editor watched 82% of a monthly allowance evaporate on the first day; a GitHub Community discussion on the change collected 958 downvotes against 24 upvotes. The anxiety is less about the headline price and more about predictability: under PRUs you knew roughly what a month cost, and under AI Credits you do not, unless you instrument it.

This playbook does the part most coverage skips — the arithmetic. We lay out exactly what changed, why the output-token rate is the single biggest cost lever you control, and a concrete five-step spend audit any team can run this week to cap exposure before the invoice lands. Every number traces to GitHub’s own documentation or named primary reporting; user-reported extremes are flagged as such.

Key takeaways
  1. 01
    PRUs became token-metered AI Credits on June 1, 2026.Usage is now billed against input, output, and cached tokens at each model's published API rate, where 1 AI Credit equals $0.01. Code completions and Next Edit Suggestions stay free on paid plans.
  2. 02
    Plan prices held; included usage effectively shrank.Per GitHub's announcement, Pro includes $10 in credits at $10/month and Pro+ includes $39 at $39/month. Because agentic sessions burn far more tokens than a chat question, the same dollar buys less real work.
  3. 03
    Model choice is worth more than upgrading your plan.GitHub's published per-model rates span roughly 40x on output tokens between the cheapest and most expensive options. Switching to an efficient model stretches your allowance further than buying a higher tier.
  4. 04
    The cheaper-model fallback is gone.Under PRUs, exhausting your quota downgraded you to a free model. Under AI Credits, running out is a hard wall unless an organization has pre-configured budget controls to allow overage at per-token rates.
  5. 05
    Run a five-step audit before you trust the new bill.Measure token burn, set a default model deliberately, enforce caps via the new four-level budget controls, separate free completions from metered agent runs, and compare Copilot's bundled value against direct API access.

01What ChangedFrom flat requests to a token meter.

The old system charged in Premium Request Units. A “premium request” was a discrete action — one chat question, one agent invocation — and each plan included a monthly bucket of them. The unit was coarse, but it was legible: you could count requests and roughly predict a month.

AI Credits replace that with metered token consumption. As of June 1, 2026, Copilot bills you against the input tokens, output tokens, and cached tokens each request consumes, priced at the published API rate for whichever model handled it. The conversion is deliberately simple — 1 AI Credit = $0.01 USD — so a model that costs $30 per million output tokens burns 3,000 credits for a million tokens of output. GitHub frames this as a usage-based model; the practical effect is that an autonomous, multi-hour coding session and a one-line chat question no longer cost the same.

GitHub’s stated rationale is that asymmetry. In its announcement, the company argued that a quick chat question and a sustained agentic run previously consumed the same PRU, and that this flat model “is no longer sustainable” as agent workloads grow. That reasoning is coherent. The friction is that the cost of agentic asymmetry has been moved onto the user’s meter without a price cut to absorb it.

What stays free
Code completions and Next Edit Suggestions do not consume AI Credits on any paid plan — the inline autocomplete most developers use all day is unmetered. Credits are burned by chat, agent runs, and Copilot code review. Note that Copilot code review now also consumes GitHub Actions minutes and AI Credits, adding a second billing dimension to track.
Old model
Premium Request Units
Flat requests · predictable

A fixed monthly bucket of premium requests per plan. One action cost one request regardless of how much compute it triggered. Coarse, but easy to forecast — and it fell back to a free model when the bucket ran dry.

Retired June 1, 2026
New model
AI Credits
Token-metered · 1 credit = $0.01

Billed against input, output, and cached tokens at each model's published rate. A chat question and a multi-hour agent session no longer cost the same. More accurate to true compute — and far harder to predict month to month.

Live since June 1, 2026

02Plans & CreditsThe plans, the allowances, and a new Max tier.

Subscription prices were not raised. According to GitHub’s announcement, each paid plan includes a monthly credit allowance denominated to match its price: Copilot Pro at $10/month includes $10 in AI Credits (1,000 credits), and Copilot Pro+ at $39/month includes $39 (3,900 credits). For organizations, Copilot Business is $19 per user per month with $19 in credits each, and Copilot Enterprise is $39 per user with $39 each — both pooled at the billing entity rather than locked to individual seats.

GitHub Copilot plans, monthly price, included AI Credits, and whether credits are pooled, as stated by GitHub (June 2026).
PlanMonthly priceIncluded AI CreditsPooled?Source basis
Copilot Pro$10$10 (1,000 credits)IndividualGitHub announcement
Copilot Pro+$39$39 (3,900 credits)IndividualGitHub announcement
Copilot Business$19/user$19/user (1,900, pooled)PooledGitHub announcement
Copilot Enterprise$39/user$39/user (3,900, pooled)PooledGitHub announcement
Copilot Max$100$200 (20,000) — as listed at launchIndividualPlans page (vendor-stated)

The notable addition is Copilot Max, a new $100/month tier launched alongside the billing change for “sustained, high-volume agent workflows.” As listed on the plans page at launch, Max includes $200/month in AI Credits — a 2-to-1 credit-to-price ratio that is unusual enough to verify directly before you rely on it, since GitHub has used promotional credit boosts elsewhere and the figure is vendor-stated rather than confirmed as a permanent structure.

Organizations get a genuine concession during the transition. Business and Enterprise customers receive promotional enhanced credits from June 1 to September 1, 2026 — 3,000 credits per user on Business and 7,000 per user on Enterprise, well above the standard 1,900 and 3,900. After September 1, 2026, those allowances revert to standard. If you are budgeting a team rollout, model the post-promo number, not the headline you see this summer.

Pooling matters for teams
Business and Enterprise credits are pooled at the billing-entity level, not held in per-seat buckets. An organization with 100 Business seats draws from a shared monthly pool — so light users’ unused credits offset heavy users automatically. That smooths variance across a team, which is the one structural advantage AI Credits hand to larger orgs over individual subscribers.

03The Cost LeverModel choice, not plan tier, is the real lever.

Here is the insight buried in GitHub’s pricing table that most coverage misses: the spread between the cheapest and most expensive model is enormous, and it is dominated by output tokens, not input. On GitHub’s published per-model rates, the cheapest option lists around $1.25 per million output tokens while the most expensive lists at $50 — roughly a 40x spread on the single rate that matters most for agentic work, because agents generate a lot of output.

Context windows get the headlines, but output rate is the dominant driver of a Copilot bill. An agent that reasons, writes code, and explains itself produces a large output stream on every step. The chart below compares published output-token rates across a sample of the Copilot model menu (per million tokens, as stated in GitHub’s docs). Model availability differs by plan, so treat this as the rate card rather than a guarantee that every model is selectable on every tier — verify what your plan exposes.

Output-token rates across the Copilot model menu · per 1M tokens

Source: GitHub Docs · Models & Pricing
Most expensive (output)Top of the menu · per 1M output tokens
$50
Premium frontier (output)Highest-end general models · per 1M
~$30
Mid-tier reasoning (output)Common agent defaults · per 1M
~$15
Efficient coding (output)Fine-tuned / flash models · per 1M
~$4.50
Cheapest (output)Nano-class models · per 1M output tokens
$1.25

Translate that into capacity. Using a third-party cost-per-task analysis from TokenMix as an illustration, a heavy agent iteration (roughly 250K input / 20K output) runs about $0.28 on an efficient fine-tuned model, around $1.05 on a mid-tier Claude Sonnet configuration, and about $1.85 on a top-end general model. At 1 credit = $0.01, those are roughly 28, 105, and 185 credits per heavy step. The proprietary table below converts that into how many such steps each plan’s included allowance buys — and it shows switching models matters more than switching tiers. These TokenMix figures are third-party estimates; verify against GitHub’s own pricing for your exact prompt shapes before you budget on them.

Estimated heavy agentic steps per month by model and Copilot plan, derived from third-party TokenMix cost-per-task estimates at 1 credit equals $0.01. Illustrative, not a GitHub guarantee.
Model class~Credits / heavy stepSteps on Pro ($10)Steps on Pro+ ($39)Steps on Business ($19)
Efficient fine-tuned (~$0.28)~28~35~140~67
Mid-tier Sonnet (~$1.05)~105~9~37~18
Top-end general (~$1.85)~185~5~21~10

Read that table the way a finance team would. On Pro+, running heavy steps on the efficient model buys roughly 140 sessions a month; running the same workload on the top-end model buys about 21. That is a ~6.7x capacity difference from a single dropdown — larger than the gap between any two adjacent plan tiers. The strategic takeaway for any team operating Copilot at scale is blunt: pick the model deliberately per task, and stop paying frontier output rates for work a cheaper model handles fine. If you want to compare those wrapped rates against calling the models directly, our LLM API pricing index and cost tracker keeps the underlying per-token rates side by side.

04The ReactionWhy developers are angry — and what is signal.

The backlash has been loud and specific. A GitHub Community discussion on the change collected 958 downvotes against 24 upvotes, with recurring complaints: no credit rollover month to month, the removal of the free-tier model fallback, the difficulty of predicting token costs in advance, and single agentic requests consuming hundreds of credits. Tellingly, GitHub’s own FAQ pre-answered the question of why anyone should stay after the change — a sign the company anticipated defection. Its answer was that Copilot “remains the best value and experience for agentic coding.”

The firsthand reports are where it gets concrete. David Ramel, writing in Visual Studio Magazine, documented his own first-day burn rate after the switch.

"I saw 1,227 of my allotted 1,500 free monthly credits basically eaten up on day 1, or about 82% of my allotted credits. If that pace continued, I was headed for a $180 bill for the month."— David Ramel, editor and writer, Visual Studio Magazine

Other reports are user-generated and unverified, but the pattern is consistent. One developer described cancelling on June 2 after a few questions on a codebase consumed 14% of the monthly quota. Another, in a widely-shared post, framed the math as going from “$80/m to (projected) $1,000/m.” A third, on a Hacker News thread, estimated the change as “80-100X more expensive” for their workload. We cite these explicitly as individual, self-reported experiences — they are not measured averages, and they should not be read as typical usage. What they do establish is that the variance is real and the worst case is steep.

The interpretation that matters: this is fundamentally a predictability problem, not purely a price problem. A flat $10 or $39 you can plan around. A meter that can 10x on a bad week is a budgeting hazard even if the average lands lower than the headlines. That is why the operational response — instrumentation and caps — matters more than the sticker reaction.

05The Hidden RiskThe fallback wall nobody mentions.

The most underreported change is buried in the changelog. Under the old PRU system, exhausting your premium requests did not stop you working — Copilot fell back to a lower-tier model so you could keep going on a degraded but usable experience. That safety net has been removed.

Under AI Credits, running out of credits is a hard wall. For organizations, GitHub provides budget controls that can either block further usage or allow overage billed at published per-token rates once the pool is depleted — but that behavior has to be configured. For an individual subscriber, or a team that has not set up budget controls, hitting the cap mid-task is an abrupt stop rather than a graceful downgrade. This is the operational risk to close first: a team that assumes the old fallback still exists can find a working day ended by a credit ceiling.

Credit conversion
1 AI Credit
$0.01

Every credit is one US cent of metered usage, priced at the handling model's published per-token rate. A $30/M output model burns 3,000 credits for a million tokens of output.

GitHub Docs
Output spread
Cheapest vs priciest
~40x

The published output-token rate ranges from roughly $1.25 to $50 per million across the model menu. Output rate, not context size, dominates the cost of an agentic step.

Model choice = lever
Budget controls
Levels of caps
4

User-level, cost-center (group), organization-level, and enterprise-wide spending limits are now generally available — the mechanism that replaces the lost free-model fallback for teams.

Configure before rollout

06The PlaybookThe five-step spend audit to run this week.

You do not need to cancel Copilot to control it. You need to instrument it. Here is the sequence we would run for any team that depends on Copilot for agentic work, ordered so each step makes the next one cheaper.

1. Measure your actual token burn

Before changing anything, get a baseline. Pull usage from the billing dashboard for a representative week and separate what is free (completions, Next Edit Suggestions) from what is metered (chat, agent runs, code review). You cannot manage a number you have not measured, and the free-versus-metered split alone often reveals that the perceived “Copilot bill” is concentrated in a handful of heavy agent sessions.

2. Set your default model deliberately

Most developers run whatever default a feature ships with, and the default differs by feature — chat, agent, and completion can each map to a different model. Given the ~40x output spread, this is the single highest-leverage change. Pick an efficient model as the day-to-day default and reserve the expensive frontier models for the tasks that genuinely need them.

3. Enforce caps with budget controls

For organizations, configure the four-level budget hierarchy — user, cost-center, organization, and enterprise limits — and decide explicitly whether hitting a cap should block usage or allow metered overage. This is the step that replaces the lost fallback: without it, an exhausted pool is a hard stop; with it, you control what happens at the ceiling.

4. Separate free work from metered work in your habits

Reinforce the boundary the new model rewards. Inline completions are free; lean on them. Treat agent invocations and code review as metered operations with a real per-step cost, and don’t fire a full agent run at a problem an autocomplete or a cheap chat turn solves. Small workflow discipline compounds across a team.

5. Compare Copilot’s bundled value against alternatives

Finally, run the breakeven. Compare the dollar value of your included credits against what the same usage would cost calling the models directly through tools like Cline, RooCode, or Continue.dev — and against rival assistants. The point is not to churn reflexively; it is to know your threshold, so a budget conversation is grounded in arithmetic rather than headlines.

Where this fits a broader stack
A Copilot audit rarely sits in isolation — it is one line in a wider AI-tooling budget. If you are rationalizing spend across coding assistants, model APIs, and agent infrastructure at once, our AI & digital transformation engagements start with exactly this kind of cost-and-capability audit before any tooling decision is made.

07The ContrastWindsurf went the other way.

The most instructive context for Copilot’s move is a near-mirror decision weeks earlier. On March 19, 2026, Windsurf replaced its opaque per-“Flow Action” credit system — where a single prompt could trigger an unpredictable number of background actions — with a flat one-prompt-credit model. That change was explicitly user-favorable: it reduced unpredictability and made spend easier to forecast.

GitHub is doing the inverse. It moved from predictable flat requests to an opaque token meter. One vendor concluded that predictability was the feature users valued and built toward it; the other concluded that metered accuracy was worth the predictability cost. Both can be defensible business decisions, but the contrast is sharp, and it tells you that “usage-based” is not automatically the direction the market is heading. For the full picture of how the assistant landscape now prices itself, our deeper look at Windsurf’s March 2026 credit repricing walks through the opposite trade-off, and our broader take on token-based versus outcome-based agent pricing explains why token billing feels punitive on agentic work in the first place.

"This is removing the one real advantage GHCP had over Claude Code et al."— Anonymous developer, GitHub Community FAQ thread

08The DecisionStay, cap, or switch.

The right move depends on how you actually use Copilot, not on the loudest take. Four profiles cover most teams.

Completion-heavy
You mostly use inline autocomplete

Code completions and Next Edit Suggestions stay free. If agent runs are occasional, the metered change barely touches you — stay on your current plan and don't overthink it.

Stay — no action needed
Agent-heavy individual
Heavy agentic workflows

This is where the meter bites. Set an efficient default model, treat agent runs as metered operations, and measure a baseline week before deciding whether Pro+ or a direct-API setup is cheaper for your pattern.

Audit, then default-tune
Organization
Business or Enterprise at scale

Pooled credits and the promo through September 1, 2026 buy you runway, but model the post-promo allowance now. Configure the four-level budget controls before September so the fallback wall never surprises a working day.

Cap before the promo ends
Cost-sensitive switcher
The bundled value no longer pencils out

If your breakeven analysis shows direct API access is materially cheaper, evaluate Cline, RooCode, or Continue.dev — or rival assistants. Weigh switching cost honestly; don't churn on a headline.

Compare, don't react

Looking forward, expect the model menu and the credit allowances to keep moving — this is an early, contested pricing design, not a settled one. New sign-ups across Copilot tiers were paused as of June 2026, with GitHub stating reopening is expected “in the coming weeks,” which suggests the company is still tuning the system under load. The durable lesson is independent of where GitHub lands: once an assistant is metered by tokens, your cost discipline has to live in your own instrumentation, defaults, and caps — not in the vendor’s pricing page. Teams that internalize that now will be calm whichever way the rates drift. If you are weighing Copilot against the field while the dust settles, our comparison of Copilot, Cursor, and Windsurf as alternatives lays out the switching math.

09ConclusionThe same price for less — unless you instrument it.

The shape of metered Copilot, June 2026

The bill is now a meter — manage it like one.

GitHub Copilot’s move to AI Credits is the clearest sign yet that AI coding assistants are being repriced around the true cost of agentic compute. The headline prices held, but the unit changed, and the same dollar now buys less real work for anyone running heavy agent sessions. The removal of the cheaper-model fallback turned a soft ceiling into a hard one.

The constructive read is that almost all of the risk is controllable. The output-token spread across the model menu is the biggest lever you have, and it is a dropdown, not a contract negotiation. A five-step audit — measure burn, set defaults, enforce caps, separate free from metered, and benchmark alternatives — converts an anxious unknown into a managed line item. Most of the people posting 100x screenshots have not yet done step two.

The broader signal is that “usage-based” is a direction, not a destination — Windsurf moved the opposite way the same quarter. Whatever your assistant charges, the durable discipline is the same: own your instrumentation, choose your models on purpose, and price your AI tooling on arithmetic rather than headlines. That is the habit that survives the next pricing change, and there will be a next one.

Get your AI tooling spend under control

Turn an unpredictable Copilot meter into a managed line item.

Our team audits AI-tooling spend, sets model-routing defaults, and builds cost-and-governance programs across coding assistants, model APIs, and agent infrastructure — so your bill is a managed line item, not a monthly surprise.

Free consultationExpert guidanceTailored solutions
What we work on

AI cost & governance engagements

  • Token-burn audits across Copilot, Cursor, and direct APIs
  • Model-routing defaults that cut output-token spend
  • Budget-control setup and overage governance for teams
  • Breakeven analysis: bundled credits vs direct API access
  • Vendor-mix strategy as assistant pricing keeps shifting
FAQ · Copilot AI Credits

The questions teams ask about the change.

AI Credits are GitHub Copilot's token-metered billing system, which went live on June 1, 2026, replacing the older Premium Request Unit (PRU) model. Instead of charging a flat bucket of requests, Copilot now bills against the input tokens, output tokens, and cached tokens each request consumes, priced at the published API rate for whichever model handled it. The conversion is fixed and simple: 1 AI Credit equals $0.01 USD. The practical effect is that a quick chat question and a long autonomous agent session no longer cost the same — usage now scales with the actual compute a request triggers, which makes heavy agentic workflows far more expensive than they were under the flat PRU model.