GitHub Copilot AI Credits went live on June 1, 2026, replacing the flat Premium Request Unit (PRU) model with token-metered billing where one AI Credit equals one US cent. Subscription prices did not move, but the unit of measurement did — and for power users running agentic workflows, that quietly turned a predictable monthly bill into a usage meter that can spike fast.
The reaction was immediate. Developers report 10x to 100x cost swings on heavy workloads; one editor watched 82% of a monthly allowance evaporate on the first day; a GitHub Community discussion on the change collected 958 downvotes against 24 upvotes. The anxiety is less about the headline price and more about predictability: under PRUs you knew roughly what a month cost, and under AI Credits you do not, unless you instrument it.
This playbook does the part most coverage skips — the arithmetic. We lay out exactly what changed, why the output-token rate is the single biggest cost lever you control, and a concrete five-step spend audit any team can run this week to cap exposure before the invoice lands. Every number traces to GitHub’s own documentation or named primary reporting; user-reported extremes are flagged as such.
- 01PRUs became token-metered AI Credits on June 1, 2026.Usage is now billed against input, output, and cached tokens at each model's published API rate, where 1 AI Credit equals $0.01. Code completions and Next Edit Suggestions stay free on paid plans.
- 02Plan prices held; included usage effectively shrank.Per GitHub's announcement, Pro includes $10 in credits at $10/month and Pro+ includes $39 at $39/month. Because agentic sessions burn far more tokens than a chat question, the same dollar buys less real work.
- 03Model choice is worth more than upgrading your plan.GitHub's published per-model rates span roughly 40x on output tokens between the cheapest and most expensive options. Switching to an efficient model stretches your allowance further than buying a higher tier.
- 04The cheaper-model fallback is gone.Under PRUs, exhausting your quota downgraded you to a free model. Under AI Credits, running out is a hard wall unless an organization has pre-configured budget controls to allow overage at per-token rates.
- 05Run a five-step audit before you trust the new bill.Measure token burn, set a default model deliberately, enforce caps via the new four-level budget controls, separate free completions from metered agent runs, and compare Copilot's bundled value against direct API access.
01 — What ChangedFrom flat requests to a token meter.
The old system charged in Premium Request Units. A “premium request” was a discrete action — one chat question, one agent invocation — and each plan included a monthly bucket of them. The unit was coarse, but it was legible: you could count requests and roughly predict a month.
AI Credits replace that with metered token consumption. As of June 1, 2026, Copilot bills you against the input tokens, output tokens, and cached tokens each request consumes, priced at the published API rate for whichever model handled it. The conversion is deliberately simple — 1 AI Credit = $0.01 USD — so a model that costs $30 per million output tokens burns 3,000 credits for a million tokens of output. GitHub frames this as a usage-based model; the practical effect is that an autonomous, multi-hour coding session and a one-line chat question no longer cost the same.
GitHub’s stated rationale is that asymmetry. In its announcement, the company argued that a quick chat question and a sustained agentic run previously consumed the same PRU, and that this flat model “is no longer sustainable” as agent workloads grow. That reasoning is coherent. The friction is that the cost of agentic asymmetry has been moved onto the user’s meter without a price cut to absorb it.
Premium Request Units
A fixed monthly bucket of premium requests per plan. One action cost one request regardless of how much compute it triggered. Coarse, but easy to forecast — and it fell back to a free model when the bucket ran dry.
AI Credits
Billed against input, output, and cached tokens at each model's published rate. A chat question and a multi-hour agent session no longer cost the same. More accurate to true compute — and far harder to predict month to month.
02 — Plans & CreditsThe plans, the allowances, and a new Max tier.
Subscription prices were not raised. According to GitHub’s announcement, each paid plan includes a monthly credit allowance denominated to match its price: Copilot Pro at $10/month includes $10 in AI Credits (1,000 credits), and Copilot Pro+ at $39/month includes $39 (3,900 credits). For organizations, Copilot Business is $19 per user per month with $19 in credits each, and Copilot Enterprise is $39 per user with $39 each — both pooled at the billing entity rather than locked to individual seats.
| Plan | Monthly price | Included AI Credits | Pooled? | Source basis |
|---|---|---|---|---|
| Copilot Pro | $10 | $10 (1,000 credits) | Individual | GitHub announcement |
| Copilot Pro+ | $39 | $39 (3,900 credits) | Individual | GitHub announcement |
| Copilot Business | $19/user | $19/user (1,900, pooled) | Pooled | GitHub announcement |
| Copilot Enterprise | $39/user | $39/user (3,900, pooled) | Pooled | GitHub announcement |
| Copilot Max | $100 | $200 (20,000) — as listed at launch | Individual | Plans page (vendor-stated) |
The notable addition is Copilot Max, a new $100/month tier launched alongside the billing change for “sustained, high-volume agent workflows.” As listed on the plans page at launch, Max includes $200/month in AI Credits — a 2-to-1 credit-to-price ratio that is unusual enough to verify directly before you rely on it, since GitHub has used promotional credit boosts elsewhere and the figure is vendor-stated rather than confirmed as a permanent structure.
Organizations get a genuine concession during the transition. Business and Enterprise customers receive promotional enhanced credits from June 1 to September 1, 2026 — 3,000 credits per user on Business and 7,000 per user on Enterprise, well above the standard 1,900 and 3,900. After September 1, 2026, those allowances revert to standard. If you are budgeting a team rollout, model the post-promo number, not the headline you see this summer.
03 — The Cost LeverModel choice, not plan tier, is the real lever.
Here is the insight buried in GitHub’s pricing table that most coverage misses: the spread between the cheapest and most expensive model is enormous, and it is dominated by output tokens, not input. On GitHub’s published per-model rates, the cheapest option lists around $1.25 per million output tokens while the most expensive lists at $50 — roughly a 40x spread on the single rate that matters most for agentic work, because agents generate a lot of output.
Context windows get the headlines, but output rate is the dominant driver of a Copilot bill. An agent that reasons, writes code, and explains itself produces a large output stream on every step. The chart below compares published output-token rates across a sample of the Copilot model menu (per million tokens, as stated in GitHub’s docs). Model availability differs by plan, so treat this as the rate card rather than a guarantee that every model is selectable on every tier — verify what your plan exposes.
Output-token rates across the Copilot model menu · per 1M tokens
Source: GitHub Docs · Models & PricingTranslate that into capacity. Using a third-party cost-per-task analysis from TokenMix as an illustration, a heavy agent iteration (roughly 250K input / 20K output) runs about $0.28 on an efficient fine-tuned model, around $1.05 on a mid-tier Claude Sonnet configuration, and about $1.85 on a top-end general model. At 1 credit = $0.01, those are roughly 28, 105, and 185 credits per heavy step. The proprietary table below converts that into how many such steps each plan’s included allowance buys — and it shows switching models matters more than switching tiers. These TokenMix figures are third-party estimates; verify against GitHub’s own pricing for your exact prompt shapes before you budget on them.
| Model class | ~Credits / heavy step | Steps on Pro ($10) | Steps on Pro+ ($39) | Steps on Business ($19) |
|---|---|---|---|---|
| Efficient fine-tuned (~$0.28) | ~28 | ~35 | ~140 | ~67 |
| Mid-tier Sonnet (~$1.05) | ~105 | ~9 | ~37 | ~18 |
| Top-end general (~$1.85) | ~185 | ~5 | ~21 | ~10 |
Read that table the way a finance team would. On Pro+, running heavy steps on the efficient model buys roughly 140 sessions a month; running the same workload on the top-end model buys about 21. That is a ~6.7x capacity difference from a single dropdown — larger than the gap between any two adjacent plan tiers. The strategic takeaway for any team operating Copilot at scale is blunt: pick the model deliberately per task, and stop paying frontier output rates for work a cheaper model handles fine. If you want to compare those wrapped rates against calling the models directly, our LLM API pricing index and cost tracker keeps the underlying per-token rates side by side.
04 — The ReactionWhy developers are angry — and what is signal.
The backlash has been loud and specific. A GitHub Community discussion on the change collected 958 downvotes against 24 upvotes, with recurring complaints: no credit rollover month to month, the removal of the free-tier model fallback, the difficulty of predicting token costs in advance, and single agentic requests consuming hundreds of credits. Tellingly, GitHub’s own FAQ pre-answered the question of why anyone should stay after the change — a sign the company anticipated defection. Its answer was that Copilot “remains the best value and experience for agentic coding.”
The firsthand reports are where it gets concrete. David Ramel, writing in Visual Studio Magazine, documented his own first-day burn rate after the switch.
"I saw 1,227 of my allotted 1,500 free monthly credits basically eaten up on day 1, or about 82% of my allotted credits. If that pace continued, I was headed for a $180 bill for the month."— David Ramel, editor and writer, Visual Studio Magazine
Other reports are user-generated and unverified, but the pattern is consistent. One developer described cancelling on June 2 after a few questions on a codebase consumed 14% of the monthly quota. Another, in a widely-shared post, framed the math as going from “$80/m to (projected) $1,000/m.” A third, on a Hacker News thread, estimated the change as “80-100X more expensive” for their workload. We cite these explicitly as individual, self-reported experiences — they are not measured averages, and they should not be read as typical usage. What they do establish is that the variance is real and the worst case is steep.
The interpretation that matters: this is fundamentally a predictability problem, not purely a price problem. A flat $10 or $39 you can plan around. A meter that can 10x on a bad week is a budgeting hazard even if the average lands lower than the headlines. That is why the operational response — instrumentation and caps — matters more than the sticker reaction.
05 — The Hidden RiskThe fallback wall nobody mentions.
The most underreported change is buried in the changelog. Under the old PRU system, exhausting your premium requests did not stop you working — Copilot fell back to a lower-tier model so you could keep going on a degraded but usable experience. That safety net has been removed.
Under AI Credits, running out of credits is a hard wall. For organizations, GitHub provides budget controls that can either block further usage or allow overage billed at published per-token rates once the pool is depleted — but that behavior has to be configured. For an individual subscriber, or a team that has not set up budget controls, hitting the cap mid-task is an abrupt stop rather than a graceful downgrade. This is the operational risk to close first: a team that assumes the old fallback still exists can find a working day ended by a credit ceiling.
1 AI Credit
Every credit is one US cent of metered usage, priced at the handling model's published per-token rate. A $30/M output model burns 3,000 credits for a million tokens of output.
Cheapest vs priciest
The published output-token rate ranges from roughly $1.25 to $50 per million across the model menu. Output rate, not context size, dominates the cost of an agentic step.
Levels of caps
User-level, cost-center (group), organization-level, and enterprise-wide spending limits are now generally available — the mechanism that replaces the lost free-model fallback for teams.
06 — The PlaybookThe five-step spend audit to run this week.
You do not need to cancel Copilot to control it. You need to instrument it. Here is the sequence we would run for any team that depends on Copilot for agentic work, ordered so each step makes the next one cheaper.
1. Measure your actual token burn
Before changing anything, get a baseline. Pull usage from the billing dashboard for a representative week and separate what is free (completions, Next Edit Suggestions) from what is metered (chat, agent runs, code review). You cannot manage a number you have not measured, and the free-versus-metered split alone often reveals that the perceived “Copilot bill” is concentrated in a handful of heavy agent sessions.
2. Set your default model deliberately
Most developers run whatever default a feature ships with, and the default differs by feature — chat, agent, and completion can each map to a different model. Given the ~40x output spread, this is the single highest-leverage change. Pick an efficient model as the day-to-day default and reserve the expensive frontier models for the tasks that genuinely need them.
3. Enforce caps with budget controls
For organizations, configure the four-level budget hierarchy — user, cost-center, organization, and enterprise limits — and decide explicitly whether hitting a cap should block usage or allow metered overage. This is the step that replaces the lost fallback: without it, an exhausted pool is a hard stop; with it, you control what happens at the ceiling.
4. Separate free work from metered work in your habits
Reinforce the boundary the new model rewards. Inline completions are free; lean on them. Treat agent invocations and code review as metered operations with a real per-step cost, and don’t fire a full agent run at a problem an autocomplete or a cheap chat turn solves. Small workflow discipline compounds across a team.
5. Compare Copilot’s bundled value against alternatives
Finally, run the breakeven. Compare the dollar value of your included credits against what the same usage would cost calling the models directly through tools like Cline, RooCode, or Continue.dev — and against rival assistants. The point is not to churn reflexively; it is to know your threshold, so a budget conversation is grounded in arithmetic rather than headlines.
07 — The ContrastWindsurf went the other way.
The most instructive context for Copilot’s move is a near-mirror decision weeks earlier. On March 19, 2026, Windsurf replaced its opaque per-“Flow Action” credit system — where a single prompt could trigger an unpredictable number of background actions — with a flat one-prompt-credit model. That change was explicitly user-favorable: it reduced unpredictability and made spend easier to forecast.
GitHub is doing the inverse. It moved from predictable flat requests to an opaque token meter. One vendor concluded that predictability was the feature users valued and built toward it; the other concluded that metered accuracy was worth the predictability cost. Both can be defensible business decisions, but the contrast is sharp, and it tells you that “usage-based” is not automatically the direction the market is heading. For the full picture of how the assistant landscape now prices itself, our deeper look at Windsurf’s March 2026 credit repricing walks through the opposite trade-off, and our broader take on token-based versus outcome-based agent pricing explains why token billing feels punitive on agentic work in the first place.
"This is removing the one real advantage GHCP had over Claude Code et al."— Anonymous developer, GitHub Community FAQ thread
08 — The DecisionStay, cap, or switch.
The right move depends on how you actually use Copilot, not on the loudest take. Four profiles cover most teams.
You mostly use inline autocomplete
Code completions and Next Edit Suggestions stay free. If agent runs are occasional, the metered change barely touches you — stay on your current plan and don't overthink it.
Heavy agentic workflows
This is where the meter bites. Set an efficient default model, treat agent runs as metered operations, and measure a baseline week before deciding whether Pro+ or a direct-API setup is cheaper for your pattern.
Business or Enterprise at scale
Pooled credits and the promo through September 1, 2026 buy you runway, but model the post-promo allowance now. Configure the four-level budget controls before September so the fallback wall never surprises a working day.
The bundled value no longer pencils out
If your breakeven analysis shows direct API access is materially cheaper, evaluate Cline, RooCode, or Continue.dev — or rival assistants. Weigh switching cost honestly; don't churn on a headline.
Looking forward, expect the model menu and the credit allowances to keep moving — this is an early, contested pricing design, not a settled one. New sign-ups across Copilot tiers were paused as of June 2026, with GitHub stating reopening is expected “in the coming weeks,” which suggests the company is still tuning the system under load. The durable lesson is independent of where GitHub lands: once an assistant is metered by tokens, your cost discipline has to live in your own instrumentation, defaults, and caps — not in the vendor’s pricing page. Teams that internalize that now will be calm whichever way the rates drift. If you are weighing Copilot against the field while the dust settles, our comparison of Copilot, Cursor, and Windsurf as alternatives lays out the switching math.
09 — ConclusionThe same price for less — unless you instrument it.
The bill is now a meter — manage it like one.
GitHub Copilot’s move to AI Credits is the clearest sign yet that AI coding assistants are being repriced around the true cost of agentic compute. The headline prices held, but the unit changed, and the same dollar now buys less real work for anyone running heavy agent sessions. The removal of the cheaper-model fallback turned a soft ceiling into a hard one.
The constructive read is that almost all of the risk is controllable. The output-token spread across the model menu is the biggest lever you have, and it is a dropdown, not a contract negotiation. A five-step audit — measure burn, set defaults, enforce caps, separate free from metered, and benchmark alternatives — converts an anxious unknown into a managed line item. Most of the people posting 100x screenshots have not yet done step two.
The broader signal is that “usage-based” is a direction, not a destination — Windsurf moved the opposite way the same quarter. Whatever your assistant charges, the durable discipline is the same: own your instrumentation, choose your models on purpose, and price your AI tooling on arithmetic rather than headlines. That is the habit that survives the next pricing change, and there will be a next one.