Claude Fable 5 returned to Pro, Max, Team, and select Enterprise plans on July 1, 2026 — with a deadline attached. Through July 7, the model is included for up to 50% of each plan’s weekly usage limits. From July 8, every Fable 5 token is billed through metered usage credits at standard API rates. That leaves six days, counted from today, in which the most capable coding model Anthropic has shipped is effectively bundled into a subscription you already pay for.

The stakes are concrete. At Fable 5’s listed API rates of $10 per million input tokens and $50 per million output tokens, the kind of heavy agentic session the model was built for — multi-hour autonomous runs, whole-repo migrations, deep architecture reviews — moves from “covered by my plan” to “a line item I approve.” Teams that sequence the window deliberately will bank weeks of high-leverage work on subscription tokens. Teams that treat this like a normal week will spend their included allowance on tasks a cheaper model could have done.

This guide covers what actually changed at the relaunch, how the 50% inclusion mechanic works, what the July 8 flip to usage credits means operationally, and — the part nobody else has published — a task-by-task sequencing matrix for deciding what deserves window tokens before Tuesday and what can wait for metered access.

Key takeaways

01
Fable 5 is back, but only briefly inside plan limits.Anthropic restored Fable 5 and Mythos 5 on July 1, 2026 after a 19-day export-control pause. Pro, Max, Team, and select Enterprise plans get Fable 5 for up to 50% of weekly usage limits — through July 7 only.
02
July 8 flips the model to metered usage credits.After the window, Fable 5 is billed via prepaid usage credits at standard API rates — $10/$50 per million input/output tokens — with a $2,000/day redemption limit and configurable monthly caps. No credits enabled means access simply stops.
03
Fable 5 drains the shared weekly pool faster than Opus.Anthropic-adjacent reporting notes Fable 5 draws down usage faster than Opus 4.8 for equivalent work. Don’t route everything through it just because it’s included — reserve it for tasks where its edge actually matters.
04
Bank heavy, durable work now; hand execution off later.Large migrations, architecture reviews, security sweeps, and durable plan documents are the highest-leverage uses of window tokens. Cheaper models can execute against those artifacts after July 7.
05
Cache and Batch discounts matter most after the flip.Cache reads run about 90% off and the Batch API is 50% off standard rates. Inside the window those savings are masked by plan tokens; from July 8 they hit your credit balance directly.

01 — The RelaunchWhat changed on July 1.

On Wednesday, July 1, 2026, Anthropic announced it was redeploying Claude Fable 5 and Claude Mythos 5 across the Claude Platform, Claude.ai, Claude Code, and Claude Cowork, with cloud-marketplace access on AWS, Google Cloud, and Microsoft Foundry to follow. The restore ended a 19-day pause — June 12 through June 30 — triggered by a reported jailbreak-bypass technique and resolved once U.S. export controls were lifted on June 30. We covered why Fable 5 disappeared for 19 days in the first place in depth when the pause hit.

The relaunch is not a simple rewind. Anthropic shipped a new safety classifier alongside the restore, which it says blocks the reported jailbreak technique in over 99% of cases — a vendor-stated figure, not an independently audited one. Requests the classifier flags are redirected to Opus 4.8 rather than answered by Fable 5, at what Anthropic estimated pre-incident would be under 5% of sessions. That fallback behavior has real implications for coding sessions, which we unpack in the coding tradeoffs of the new safety classifier.

The second change is the one this playbook exists for: the access terms. Fable 5 did not come back as a permanent subscription feature. It came back with an expiry date on plan inclusion — and a metering model waiting on the other side.

02 — Window MechanicsThe 50% window, decoded.

The core mechanic fits in one sentence of Anthropic’s announcement, and it rewards careful reading.

Anthropic, verbatim · July 1, 2026

“For Pro, Max, Team, and select Enterprise plans, Fable 5 will be included for up to 50% of weekly usage limits through July 7, after which it will be available via usage credits.” — from the official “Redeploying Claude Fable 5” announcement.

Three details matter. First, the 50% is relative — half of your plan’s weekly usage limit, whatever that is. Anthropic has not published exact per-plan weekly token counts for Pro, Max, or Team, so any absolute number you see quoted is invented. Second, the wording is “select Enterprise plans” — standard Enterprise seats get no inclusion window at all and are on usage credits from day one. Third, this cap is new. Hacker News commenters comparing notes against the original June 9 launch messaging concluded the 50% restriction wasn’t part of the first rollout.

The window is also shorter than what subscribers were originally promised. PCWorld reports the promotional access works out to half the originally promised two weeks: subscribers were told 14 days of included access when Fable 5 first launched on June 9, the export-control pause consumed most of that, and the makeup window on relaunch is seven days at up to half of weekly limits — of which six remain as of this writing, July 2 through July 7 inclusive.

One caveat before you plan the week: at least one user report on Hacker News suggests weekly usage counters were not reset at the relaunch, meaning usage consumed before the June 12 pause may still count against the current week. Anthropic has not confirmed or denied this — treat it as anecdotal, and check your own usage meter before committing to a heavy schedule.

The window

July 2–7, inclusive

6days

Seven days of plan inclusion from the July 1 relaunch; six remain from this post’s publish date. After July 7, plan inclusion ends per Anthropic’s announced schedule.

announced July 1, 2026

Inclusion cap

of weekly plan limits

50%

Relative to each plan’s existing weekly cap — Anthropic publishes no absolute per-plan token counts. Applies to Pro, Max, Team, and select Enterprise plans only.

Pro · Max · Team · select Enterprise

Daily credit cap

usage-credit redemption limit

$2,000

The post-window billing rail: prepaid credits at standard API rates, capped at $2,000 of redemption per day, with configurable monthly spend caps and optional auto-reload.

support.claude.com

03 — The FlipWhat actually changes on July 8.

From July 8, Fable 5 is not part of any subscription tier. It becomes a usage-based add-on: you enable usage credits under Settings > Usage in your Claude account, attach a payment method, set spending preferences, and every Fable 5 token is billed at standard API rates — $10 per million input tokens, $50 per million output tokens — as charges separate from your subscription. Credits come with guardrails: a $2,000 daily redemption limit, a configurable monthly spending cap (or unlimited, if you choose), usage alerts as you approach the cap, and an optional auto-reload that tops up the balance when it crosses a threshold you set.

The full mechanics — credit packs, plan-by-plan differences, and the billing edge cases — are in the full usage-credits pricing breakdown. The operational summary is the table below.

Side-by-side comparison of Claude Fable 5 access mechanics through July 7, 2026 versus from July 8, 2026, covering access model, token source, marginal cost, default behavior, spend controls, discounts, and safety classifier behavior.
Dimension	Through July 7 (in-window)	From July 8 (usage-credits era)
Access model	Included for up to 50% of weekly plan limits on Pro, Max, Team, and select Enterprise plans	Usage credits only — an opt-in, prepaid add-on outside every subscription tier
Where tokens come from	Your plan’s shared weekly usage pool (also feeding Opus and Sonnet sessions)	A prepaid credit balance, billed separately from the subscription
Marginal cost	Covered by the plan up to the 50% allowance; credits can extend beyond it	Standard API rates — $10 / $50 per million input / output tokens
If you do nothing	Fable 5 works inside the allowance, no setup required	Access stops — there is no automatic fallback to metered billing without opting in
Spend controls	Plan limits are the ceiling; nothing extra to configure	$2,000/day redemption cap, configurable monthly cap, usage alerts, optional auto-reload
Cache / Batch discounts	Apply, but savings are masked while plan tokens cover the bill	~90% off cache reads and 50% off via the Batch API hit your credit balance directly
Safety classifier	Flagged requests redirect to Opus 4.8 instead of Fable 5	Same redirect behavior — the classifier is independent of the billing change

Sources: Anthropic “Redeploying Claude Fable 5” (Jul 1, 2026); support.claude.com usage-credits documentation; Anthropic batch processing docs. Retrieved July 2, 2026.

The silent cliff

If usage credits are not enabled before the allowance runs out, Fable 5 access simply stops — mid-project, mid-week, whenever the pool empties. There is no automatic metered fallback. If any July work depends on Fable 5, enabling credits and setting a monthly cap now is cheap insurance, even if you never intend to spend a dollar.

04 — Burn RateThe wrinkle: Fable 5 drains the pool faster.

The under-covered detail in the relaunch coverage is a single line that changes the optimal strategy. TestingCatalog’s breaking-news summary of the new terms — corroborated verbatim by Hacker News commenters — ends with: “Fable 5 draws down usage faster than Opus 4.8.” The 50% inclusion shares a pool with everything else you run on the plan, and Fable 5 consumes that pool at a higher rate than Opus 4.8 does for equivalent work.

The naive strategy — route everything through Fable 5 this week because it’s included — is therefore exactly wrong. Every routine task you give Fable 5 burns allowance that a heavy migration or architecture review needed, and burns it faster than the same task would have on Opus. The window rewards discrimination, not enthusiasm. Our standing guidance on which model to reach for once the window closes applies inside the window too — just with the thresholds shifted.

Frontier-only work

Run on Fable 5 now

Multi-hour autonomous runs, whole-repo migrations, deep architecture and security reviews — tasks where Fable 5’s capability edge over Opus is the difference between done and not-done. These justify the faster pool drain.

Spend window tokens

Opus-class work

Keep on Opus 4.8

Solid multi-step engineering that Opus already handles well. Running it on Fable 5 buys little capability and drains the shared pool faster — preserving allowance is the point.

Preserve the pool

Bulk execution

Fan out to Sonnet

Implementation against an existing plan, test writing, mechanical refactors, doc updates. Cheaper models execute these fine — especially when a Fable 5-authored plan is steering.

Cheap execution

Post-window candidates

Defer to metered credits

Non-urgent heavy work that suits async processing. After July 8, cache reuse and the Batch API’s 50% discount make deferred metered runs meaningfully cheaper than live sessions.

Wait for July 8

05 — The PlaybookThe six-day sequencing matrix.

What does “bank the heavy work” mean concretely? The first Fable 5 window, June 9–12, produced a public record of what the model is uniquely good at. Anthropic’s launch announcement cited Stripe reporting that Fable 5 completed a code change across a 50-million-line Ruby codebase in one day — work estimated at roughly two months for a team. Community build roundups from those 72 hours catalogued the same pattern at smaller scale: one developer ran nine concurrent overnight workstreams without losing context; another ran a persistent 10–12 hour benchmark-harness session that reportedly consumed over 100 million tokens; a full design-validation loop for a robotics actuator compressed into about 30 minutes and roughly 400,000 tokens.

Those examples share a profile: high context, high autonomy, high token volume, and output that stays valuable after the session ends. That profile is what the window is for. The matrix below is our triage tool — no other coverage we’ve seen organizes the window this way. When we run this triage for clients, it happens inside our AI transformation engagements — the same matrix, applied to a real backlog.

Task types mapped to why Claude Fable 5 specifically, token intensity, whether to bank the task before July 7, and what to hand off to cheaper models afterward.
Task type	Why Fable 5 specifically	Token intensity	Window call	Post-window handoff
Bank before July 7 — window tokens earn their keep
Large multi-repo migrations	Sustained cross-file coherence over huge codebases — the Stripe-scale profile from the launch announcement	Heavy	Bank now	Cleanup, test coverage, and follow-up PRs to Sonnet
Architecture reviews & codebase audits	Whole-system reasoning where breadth of context determines the quality of the findings	Heavy	Bank now	Remediation implementation to Opus or Sonnet
Security sweeps	High-stakes, one-shot depth — a missed finding costs more than the tokens ever will	Medium–heavy	Bank now	Patch PRs and re-tests to Sonnet
Durable plan authoring (specs, PRDs, roadmaps)	The plan outlives the window — highest leverage per token of anything on this list	Light–medium	Bank now	Execution fanned out to Sonnet subagents
Deep research synthesis	Long-horizon multi-source reasoning benefits from the strongest model available	Medium	Bank if allowance permits	Follow-up queries to cheaper models against the synthesis
Fine to defer — don’t spend window tokens here
Routine feature work & bug fixes	No meaningful edge over Opus 4.8 or Sonnet for well-scoped tasks	Light	Defer	Stays on plan models indefinitely
Async, repeatable batch jobs	Tolerant of delay — and cheapest run through the Batch API’s 50% discount after the flip	Varies	Defer to metered	Batch API on usage credits from July 8

Digital Applied analysis, July 2, 2026. Task examples drawn from Anthropic’s June 9 launch announcement, community build roundups of the June 9–12 window, and the Hacker News relaunch thread.

Sequenced across the six days, the matrix becomes a simple three-phase plan:

Days 1–2 · Jul 2–3

Triage and bank the plans

backlog audit → durable plan docs

Check your actual remaining weekly allowance first. Then spend light tokens on the highest-leverage artifact class: specs, migration plans, and roadmaps that cheaper models will execute for months.

Light–medium burn

Days 3–5 · Jul 4–6

The heavy runs

migrations · reviews · sweeps

The long autonomous sessions that justify Fable 5 over everything else. Queue them back to back while the allowance is deepest, and keep routine work off the model entirely.

Heavy burn — the point of the window

Day 6 · Jul 7

Wrap and snapshot

handoff packets for cheaper models

Reserve margin for overruns. Close each workstream with a written state-of-play so Opus and Sonnet can pick up on July 8 without re-running Fable 5-priced context.

Reserve margin

06 — The PatternPlan now, execute later.

The single most reusable idea from the relaunch discussion isn’t Anthropic’s — it’s the community’s. Multiple Hacker News commenters independently described the same workflow: use the strongest available model to author a durable plan document, then hand execution to cheaper models running as subagents. One called the Opus-plans-Sonnet-implements split “great”; another described the fanout version directly:

"I've been using Opus to write a plan and fanout sonnet subagents to implement it. Cheaper and faster"— Congeec, Hacker News relaunch thread, July 1, 2026

The window supercharges this pattern. For six days, the planner seat is occupied by a strictly stronger model at zero marginal cost inside your allowance. A plan document authored by Fable 5 this week is an asset that keeps paying out after July 7 — every Sonnet subagent that executes against it inherits the quality of the plan without inheriting the price of the planner. The failure mode to avoid is the inverse: burning window allowance on execution-grade work while the planning-grade opportunities expire unused.

We’ve broken the full workflow — what makes a plan “durable,” how to structure handoff packets, and the prompts that produce executable plans rather than essays — into a dedicated companion piece on banking durable plans this week for cheaper models to execute later.

07 — Cost EngineeringThe cache and Batch math for after the flip.

Two standard discounts reshape the post-window economics, and both stack with usage-credit billing. Prompt caching cuts repeated context to roughly 90% off — about $1 per million tokens on cache reads against Fable 5’s $10 input rate. The Batch API prices asynchronous work at 50% off standard rates: $5 per million input tokens and $25 per million output tokens. Inside the window these savings are invisible, because plan tokens are covering the bill. From July 8 they are the difference between a metered workload you can justify and one you can’t.

Fable 5 input cost per million tokens · by billing path

Source: Anthropic Fable 5 pricing and Batch API documentation, retrieved July 2, 2026

Live interactive session$10 input / $50 output per Mtok · standard API rate

$10

Batch API (async)$5 input / $25 output per Mtok · 50% off standard

Cache read (repeated context)~$1 per Mtok effective · ~90% off input rate

~$1

One community observation worth stealing: an HN commenter asked whether coding harnesses could route planning-type, non-interactive work through the Batch API for the extra discount. That instinct is right — anything tolerant of async turnaround belongs on the cheapest rail, and structuring workloads to maximize cache hits (stable system prompts, reused repo context) compounds the saving. The worked numbers, harness configuration, and spend-cap setup are in the cache-and-batch math for after July 8.

08 — Sentiment & OutlookRelief, frustration, and what the window signals.

The subscriber reaction is genuinely split. The relief is real — one Hacker News commenter described canceling a Max plan and requesting a refund during the pause, then re-subscribing on July 1 the day Fable 5 returned. So is the frustration: subscribers were promised 14 days of included access at the June 9 launch, got about three before the pause, and returned to a shorter window at half of weekly limits.

"Not a good look to bring Fable back and then both half the usage and take away days."— Claude subscriber, quoted by PCWorld, July 1, 2026

On the post-window economics, one commenter’s back-of-envelope estimate — an individual guess, not a vendor or analyst figure — put heavy Fable 5 usage via credits at 10–20 times the price of a top-tier subscription for someone trying to sustain their in-window pace. Whatever the true multiple, the direction is right: from July 8, always-on Fable 5 usage is an enterprise-grade line item, not a hobbyist upgrade.

Our read of the trend: the window is less a promotion than a pricing experiment. Anthropic has a model that, by its own positioning and the community’s usage reports, is more expensive to serve and drains plan allowances faster than anything else in the lineup. A hard flip to pure metering — with a 50%-of-limits transition week as the offramp — lets Anthropic observe genuine willingness-to-pay before deciding whether Fable 5 ever fits inside a subscription tier again. One HN commenter argued market pressure will force exactly that outcome, speculating Anthropic may need “another pricing tier” for the model. That is plausible but unannounced; plan as if metering is permanent.

Looking forward, the sensible default for teams is a split portfolio: durable Fable 5 artifacts banked this week, Opus 4.8 as the everyday senior model, Sonnet for execution volume, and metered Fable 5 reserved for the workloads where its edge is provable. That calculus also depends on the competitive picture — how Fable 5 compares to the strongest alternative is the other half of the routing decision, and it’s why we treat model choice as a per-workload call rather than a vendor loyalty question.

09 — ConclusionSix days is enough — if you sequence.

The window, July 2–7, 2026

Bank the work that outlives the window.

The terms are what they are: up to 50% of weekly plan limits through July 7, metered usage credits at $10/$50 per million tokens from July 8, and no automatic fallback if credits aren’t enabled. You can spend the week relitigating the shortened promise, or you can spend it converting included tokens into assets that keep paying out after the meter starts.

The sequencing logic is simple. Fable 5’s edge is deepest on heavy, high-context, high-autonomy work — and it drains the shared pool faster than Opus, so everything else should stay off it. Bank the migrations, the architecture reviews, the security sweeps, and above all the durable plans. Then let cheaper models execute against those artifacts for months.

The meta-lesson from June applies too: a frontier model disappeared for 19 days with almost no notice, came back with different terms, and will change terms again. Teams that treat any single model as permanent infrastructure keep getting surprised. Teams that build routing, plan-banking, and second-source habits treat weeks like this one as arbitrage instead of crisis.

Fable 5 Before July 7: The Six-Day Window Playbook

01 — The RelaunchWhat changed on July 1.

02 — Window MechanicsThe 50% window, decoded.

July 2–7, inclusive

of weekly plan limits

usage-credit redemption limit

03 — The FlipWhat actually changes on July 8.

04 — Burn RateThe wrinkle: Fable 5 drains the pool faster.

Run on Fable 5 now

Keep on Opus 4.8

Fan out to Sonnet

Defer to metered credits

05 — The PlaybookThe six-day sequencing matrix.

Triage and bank the plans

The heavy runs

Wrap and snapshot

06 — The PatternPlan now, execute later.

07 — Cost EngineeringThe cache and Batch math for after the flip.

Fable 5 input cost per million tokens · by billing path

08 — Sentiment & OutlookRelief, frustration, and what the window signals.

09 — ConclusionSix days is enough — if you sequence.

Bank the work that outlives the window.

Six days of included Fable 5 can fund months of cheap execution.

Frontier-model operations

The questions teams are asking this week.

Continue exploring the Fable 5 playbooks.

Fable 5 as the Planner Brain in Hermes and OpenClaw

Fable 5 Cost Engineering: Cache, Batch and Spend Caps

10 Fable 5 Prompts to Upgrade Your AI Agent Setup 2026

Fable 5 ROI: Turning Output Tokens Into Revenue 2026

Sakana Fugu: A Multi-Agent AI Orchestration Model 2026

Do Not Single-Source Your AI: A Second-Source Playbook