AI DevelopmentPlaybook12 min readPublished July 2, 2026

Six days of plan-included access · up to 50% of weekly limits · metered usage credits from July 8

Fable 5 Before July 7: The Six-Day Window Playbook

Claude Fable 5 came back on July 1, 2026 — but the terms changed. Pro, Max, Team, and select Enterprise plans get it inside plan limits only through July 7; from July 8 it moves to metered usage credits at standard API rates. This playbook turns six days of included access into a sequencing plan: what to bank now, what to defer, and what to hand to cheaper models.

DA
Digital Applied Team
Senior strategists · Published July 2, 2026
PublishedJuly 2, 2026
Read time12 min
Sources10 primary
Window remaining
6
days · July 2–7 inclusive
announced Jul 1
In-window inclusion
50%
of weekly plan limits, max
through Jul 7
Metered rate from Jul 8
$10/$50
per Mtok in / out
Daily credit cap
$2,000
usage-credit redemption

Claude Fable 5 returned to Pro, Max, Team, and select Enterprise plans on July 1, 2026 — with a deadline attached. Through July 7, the model is included for up to 50% of each plan’s weekly usage limits. From July 8, every Fable 5 token is billed through metered usage credits at standard API rates. That leaves six days, counted from today, in which the most capable coding model Anthropic has shipped is effectively bundled into a subscription you already pay for.

The stakes are concrete. At Fable 5’s listed API rates of $10 per million input tokens and $50 per million output tokens, the kind of heavy agentic session the model was built for — multi-hour autonomous runs, whole-repo migrations, deep architecture reviews — moves from “covered by my plan” to “a line item I approve.” Teams that sequence the window deliberately will bank weeks of high-leverage work on subscription tokens. Teams that treat this like a normal week will spend their included allowance on tasks a cheaper model could have done.

This guide covers what actually changed at the relaunch, how the 50% inclusion mechanic works, what the July 8 flip to usage credits means operationally, and — the part nobody else has published — a task-by-task sequencing matrix for deciding what deserves window tokens before Tuesday and what can wait for metered access.

Key takeaways
  1. 01
    Fable 5 is back, but only briefly inside plan limits.Anthropic restored Fable 5 and Mythos 5 on July 1, 2026 after a 19-day export-control pause. Pro, Max, Team, and select Enterprise plans get Fable 5 for up to 50% of weekly usage limits — through July 7 only.
  2. 02
    July 8 flips the model to metered usage credits.After the window, Fable 5 is billed via prepaid usage credits at standard API rates — $10/$50 per million input/output tokens — with a $2,000/day redemption limit and configurable monthly caps. No credits enabled means access simply stops.
  3. 03
    Fable 5 drains the shared weekly pool faster than Opus.Anthropic-adjacent reporting notes Fable 5 draws down usage faster than Opus 4.8 for equivalent work. Don’t route everything through it just because it’s included — reserve it for tasks where its edge actually matters.
  4. 04
    Bank heavy, durable work now; hand execution off later.Large migrations, architecture reviews, security sweeps, and durable plan documents are the highest-leverage uses of window tokens. Cheaper models can execute against those artifacts after July 7.
  5. 05
    Cache and Batch discounts matter most after the flip.Cache reads run about 90% off and the Batch API is 50% off standard rates. Inside the window those savings are masked by plan tokens; from July 8 they hit your credit balance directly.

01The RelaunchWhat changed on July 1.

On Wednesday, July 1, 2026, Anthropic announced it was redeploying Claude Fable 5 and Claude Mythos 5 across the Claude Platform, Claude.ai, Claude Code, and Claude Cowork, with cloud-marketplace access on AWS, Google Cloud, and Microsoft Foundry to follow. The restore ended a 19-day pause — June 12 through June 30 — triggered by a reported jailbreak-bypass technique and resolved once U.S. export controls were lifted on June 30. We covered why Fable 5 disappeared for 19 days in the first place in depth when the pause hit.

The relaunch is not a simple rewind. Anthropic shipped a new safety classifier alongside the restore, which it says blocks the reported jailbreak technique in over 99% of cases — a vendor-stated figure, not an independently audited one. Requests the classifier flags are redirected to Opus 4.8 rather than answered by Fable 5, at what Anthropic estimated pre-incident would be under 5% of sessions. That fallback behavior has real implications for coding sessions, which we unpack in the coding tradeoffs of the new safety classifier.

The second change is the one this playbook exists for: the access terms. Fable 5 did not come back as a permanent subscription feature. It came back with an expiry date on plan inclusion — and a metering model waiting on the other side.

02Window MechanicsThe 50% window, decoded.

The core mechanic fits in one sentence of Anthropic’s announcement, and it rewards careful reading.

Anthropic, verbatim · July 1, 2026
“For Pro, Max, Team, and select Enterprise plans, Fable 5 will be included for up to 50% of weekly usage limits through July 7, after which it will be available via usage credits.” — from the official “Redeploying Claude Fable 5” announcement.

Three details matter. First, the 50% is relative — half of your plan’s weekly usage limit, whatever that is. Anthropic has not published exact per-plan weekly token counts for Pro, Max, or Team, so any absolute number you see quoted is invented. Second, the wording is “select Enterprise plans” — standard Enterprise seats get no inclusion window at all and are on usage credits from day one. Third, this cap is new. Hacker News commenters comparing notes against the original June 9 launch messaging concluded the 50% restriction wasn’t part of the first rollout.

The window is also shorter than what subscribers were originally promised. PCWorld reports the promotional access works out to half the originally promised two weeks: subscribers were told 14 days of included access when Fable 5 first launched on June 9, the export-control pause consumed most of that, and the makeup window on relaunch is seven days at up to half of weekly limits — of which six remain as of this writing, July 2 through July 7 inclusive.

One caveat before you plan the week: at least one user report on Hacker News suggests weekly usage counters were not reset at the relaunch, meaning usage consumed before the June 12 pause may still count against the current week. Anthropic has not confirmed or denied this — treat it as anecdotal, and check your own usage meter before committing to a heavy schedule.

The window
July 2–7, inclusive
6days

Seven days of plan inclusion from the July 1 relaunch; six remain from this post’s publish date. After July 7, plan inclusion ends per Anthropic’s announced schedule.

announced July 1, 2026
Inclusion cap
of weekly plan limits
50%

Relative to each plan’s existing weekly cap — Anthropic publishes no absolute per-plan token counts. Applies to Pro, Max, Team, and select Enterprise plans only.

Pro · Max · Team · select Enterprise
Daily credit cap
usage-credit redemption limit
$2,000

The post-window billing rail: prepaid credits at standard API rates, capped at $2,000 of redemption per day, with configurable monthly spend caps and optional auto-reload.

support.claude.com

03The FlipWhat actually changes on July 8.

From July 8, Fable 5 is not part of any subscription tier. It becomes a usage-based add-on: you enable usage credits under Settings > Usage in your Claude account, attach a payment method, set spending preferences, and every Fable 5 token is billed at standard API rates — $10 per million input tokens, $50 per million output tokens — as charges separate from your subscription. Credits come with guardrails: a $2,000 daily redemption limit, a configurable monthly spending cap (or unlimited, if you choose), usage alerts as you approach the cap, and an optional auto-reload that tops up the balance when it crosses a threshold you set.

The full mechanics — credit packs, plan-by-plan differences, and the billing edge cases — are in the full usage-credits pricing breakdown. The operational summary is the table below.

Side-by-side comparison of Claude Fable 5 access mechanics through July 7, 2026 versus from July 8, 2026, covering access model, token source, marginal cost, default behavior, spend controls, discounts, and safety classifier behavior.
DimensionThrough July 7 (in-window)From July 8 (usage-credits era)
Access modelIncluded for up to 50% of weekly plan limits on Pro, Max, Team, and select Enterprise plansUsage credits only — an opt-in, prepaid add-on outside every subscription tier
Where tokens come fromYour plan’s shared weekly usage pool (also feeding Opus and Sonnet sessions)A prepaid credit balance, billed separately from the subscription
Marginal costCovered by the plan up to the 50% allowance; credits can extend beyond itStandard API rates — $10 / $50 per million input / output tokens
If you do nothingFable 5 works inside the allowance, no setup requiredAccess stops — there is no automatic fallback to metered billing without opting in
Spend controlsPlan limits are the ceiling; nothing extra to configure$2,000/day redemption cap, configurable monthly cap, usage alerts, optional auto-reload
Cache / Batch discountsApply, but savings are masked while plan tokens cover the bill~90% off cache reads and 50% off via the Batch API hit your credit balance directly
Safety classifierFlagged requests redirect to Opus 4.8 instead of Fable 5Same redirect behavior — the classifier is independent of the billing change
Sources: Anthropic “Redeploying Claude Fable 5” (Jul 1, 2026); support.claude.com usage-credits documentation; Anthropic batch processing docs. Retrieved July 2, 2026.
The silent cliff
If usage credits are not enabled before the allowance runs out, Fable 5 access simply stops — mid-project, mid-week, whenever the pool empties. There is no automatic metered fallback. If any July work depends on Fable 5, enabling credits and setting a monthly cap now is cheap insurance, even if you never intend to spend a dollar.

04Burn RateThe wrinkle: Fable 5 drains the pool faster.

The under-covered detail in the relaunch coverage is a single line that changes the optimal strategy. TestingCatalog’s breaking-news summary of the new terms — corroborated verbatim by Hacker News commenters — ends with: “Fable 5 draws down usage faster than Opus 4.8.” The 50% inclusion shares a pool with everything else you run on the plan, and Fable 5 consumes that pool at a higher rate than Opus 4.8 does for equivalent work.

The naive strategy — route everything through Fable 5 this week because it’s included — is therefore exactly wrong. Every routine task you give Fable 5 burns allowance that a heavy migration or architecture review needed, and burns it faster than the same task would have on Opus. The window rewards discrimination, not enthusiasm. Our standing guidance on which model to reach for once the window closes applies inside the window too — just with the thresholds shifted.

Frontier-only work
Run on Fable 5 now

Multi-hour autonomous runs, whole-repo migrations, deep architecture and security reviews — tasks where Fable 5’s capability edge over Opus is the difference between done and not-done. These justify the faster pool drain.

Spend window tokens
Opus-class work
Keep on Opus 4.8

Solid multi-step engineering that Opus already handles well. Running it on Fable 5 buys little capability and drains the shared pool faster — preserving allowance is the point.

Preserve the pool
Bulk execution
Fan out to Sonnet

Implementation against an existing plan, test writing, mechanical refactors, doc updates. Cheaper models execute these fine — especially when a Fable 5-authored plan is steering.

Cheap execution
Post-window candidates
Defer to metered credits

Non-urgent heavy work that suits async processing. After July 8, cache reuse and the Batch API’s 50% discount make deferred metered runs meaningfully cheaper than live sessions.

Wait for July 8

05The PlaybookThe six-day sequencing matrix.

What does “bank the heavy work” mean concretely? The first Fable 5 window, June 9–12, produced a public record of what the model is uniquely good at. Anthropic’s launch announcement cited Stripe reporting that Fable 5 completed a code change across a 50-million-line Ruby codebase in one day — work estimated at roughly two months for a team. Community build roundups from those 72 hours catalogued the same pattern at smaller scale: one developer ran nine concurrent overnight workstreams without losing context; another ran a persistent 10–12 hour benchmark-harness session that reportedly consumed over 100 million tokens; a full design-validation loop for a robotics actuator compressed into about 30 minutes and roughly 400,000 tokens.

Those examples share a profile: high context, high autonomy, high token volume, and output that stays valuable after the session ends. That profile is what the window is for. The matrix below is our triage tool — no other coverage we’ve seen organizes the window this way. When we run this triage for clients, it happens inside our AI transformation engagements — the same matrix, applied to a real backlog.

Task types mapped to why Claude Fable 5 specifically, token intensity, whether to bank the task before July 7, and what to hand off to cheaper models afterward.
Task typeWhy Fable 5 specificallyToken intensityWindow callPost-window handoff
Bank before July 7 — window tokens earn their keep
Large multi-repo migrationsSustained cross-file coherence over huge codebases — the Stripe-scale profile from the launch announcementHeavyBank nowCleanup, test coverage, and follow-up PRs to Sonnet
Architecture reviews & codebase auditsWhole-system reasoning where breadth of context determines the quality of the findingsHeavyBank nowRemediation implementation to Opus or Sonnet
Security sweepsHigh-stakes, one-shot depth — a missed finding costs more than the tokens ever willMedium–heavyBank nowPatch PRs and re-tests to Sonnet
Durable plan authoring (specs, PRDs, roadmaps)The plan outlives the window — highest leverage per token of anything on this listLight–mediumBank nowExecution fanned out to Sonnet subagents
Deep research synthesisLong-horizon multi-source reasoning benefits from the strongest model availableMediumBank if allowance permitsFollow-up queries to cheaper models against the synthesis
Fine to defer — don’t spend window tokens here
Routine feature work & bug fixesNo meaningful edge over Opus 4.8 or Sonnet for well-scoped tasksLightDeferStays on plan models indefinitely
Async, repeatable batch jobsTolerant of delay — and cheapest run through the Batch API’s 50% discount after the flipVariesDefer to meteredBatch API on usage credits from July 8
Digital Applied analysis, July 2, 2026. Task examples drawn from Anthropic’s June 9 launch announcement, community build roundups of the June 9–12 window, and the Hacker News relaunch thread.

Sequenced across the six days, the matrix becomes a simple three-phase plan:

Days 1–2 · Jul 2–3
Triage and bank the plans
backlog audit → durable plan docs

Check your actual remaining weekly allowance first. Then spend light tokens on the highest-leverage artifact class: specs, migration plans, and roadmaps that cheaper models will execute for months.

Light–medium burn
Days 3–5 · Jul 4–6
The heavy runs
migrations · reviews · sweeps

The long autonomous sessions that justify Fable 5 over everything else. Queue them back to back while the allowance is deepest, and keep routine work off the model entirely.

Heavy burn — the point of the window
Day 6 · Jul 7
Wrap and snapshot
handoff packets for cheaper models

Reserve margin for overruns. Close each workstream with a written state-of-play so Opus and Sonnet can pick up on July 8 without re-running Fable 5-priced context.

Reserve margin

06The PatternPlan now, execute later.

The single most reusable idea from the relaunch discussion isn’t Anthropic’s — it’s the community’s. Multiple Hacker News commenters independently described the same workflow: use the strongest available model to author a durable plan document, then hand execution to cheaper models running as subagents. One called the Opus-plans-Sonnet-implements split “great”; another described the fanout version directly:

"I've been using Opus to write a plan and fanout sonnet subagents to implement it. Cheaper and faster"— Congeec, Hacker News relaunch thread, July 1, 2026

The window supercharges this pattern. For six days, the planner seat is occupied by a strictly stronger model at zero marginal cost inside your allowance. A plan document authored by Fable 5 this week is an asset that keeps paying out after July 7 — every Sonnet subagent that executes against it inherits the quality of the plan without inheriting the price of the planner. The failure mode to avoid is the inverse: burning window allowance on execution-grade work while the planning-grade opportunities expire unused.

We’ve broken the full workflow — what makes a plan “durable,” how to structure handoff packets, and the prompts that produce executable plans rather than essays — into a dedicated companion piece on banking durable plans this week for cheaper models to execute later.

07Cost EngineeringThe cache and Batch math for after the flip.

Two standard discounts reshape the post-window economics, and both stack with usage-credit billing. Prompt caching cuts repeated context to roughly 90% off — about $1 per million tokens on cache reads against Fable 5’s $10 input rate. The Batch API prices asynchronous work at 50% off standard rates: $5 per million input tokens and $25 per million output tokens. Inside the window these savings are invisible, because plan tokens are covering the bill. From July 8 they are the difference between a metered workload you can justify and one you can’t.

Fable 5 input cost per million tokens · by billing path

Source: Anthropic Fable 5 pricing and Batch API documentation, retrieved July 2, 2026
Live interactive session$10 input / $50 output per Mtok · standard API rate
$10
Batch API (async)$5 input / $25 output per Mtok · 50% off standard
$5
Cache read (repeated context)~$1 per Mtok effective · ~90% off input rate
~$1

One community observation worth stealing: an HN commenter asked whether coding harnesses could route planning-type, non-interactive work through the Batch API for the extra discount. That instinct is right — anything tolerant of async turnaround belongs on the cheapest rail, and structuring workloads to maximize cache hits (stable system prompts, reused repo context) compounds the saving. The worked numbers, harness configuration, and spend-cap setup are in the cache-and-batch math for after July 8.

08Sentiment & OutlookRelief, frustration, and what the window signals.

The subscriber reaction is genuinely split. The relief is real — one Hacker News commenter described canceling a Max plan and requesting a refund during the pause, then re-subscribing on July 1 the day Fable 5 returned. So is the frustration: subscribers were promised 14 days of included access at the June 9 launch, got about three before the pause, and returned to a shorter window at half of weekly limits.

"Not a good look to bring Fable back and then both half the usage and take away days."— Claude subscriber, quoted by PCWorld, July 1, 2026

On the post-window economics, one commenter’s back-of-envelope estimate — an individual guess, not a vendor or analyst figure — put heavy Fable 5 usage via credits at 10–20 times the price of a top-tier subscription for someone trying to sustain their in-window pace. Whatever the true multiple, the direction is right: from July 8, always-on Fable 5 usage is an enterprise-grade line item, not a hobbyist upgrade.

Our read of the trend: the window is less a promotion than a pricing experiment. Anthropic has a model that, by its own positioning and the community’s usage reports, is more expensive to serve and drains plan allowances faster than anything else in the lineup. A hard flip to pure metering — with a 50%-of-limits transition week as the offramp — lets Anthropic observe genuine willingness-to-pay before deciding whether Fable 5 ever fits inside a subscription tier again. One HN commenter argued market pressure will force exactly that outcome, speculating Anthropic may need “another pricing tier” for the model. That is plausible but unannounced; plan as if metering is permanent.

Looking forward, the sensible default for teams is a split portfolio: durable Fable 5 artifacts banked this week, Opus 4.8 as the everyday senior model, Sonnet for execution volume, and metered Fable 5 reserved for the workloads where its edge is provable. That calculus also depends on the competitive picture — how Fable 5 compares to the strongest alternative is the other half of the routing decision, and it’s why we treat model choice as a per-workload call rather than a vendor loyalty question.

09ConclusionSix days is enough — if you sequence.

The window, July 2–7, 2026

Bank the work that outlives the window.

The terms are what they are: up to 50% of weekly plan limits through July 7, metered usage credits at $10/$50 per million tokens from July 8, and no automatic fallback if credits aren’t enabled. You can spend the week relitigating the shortened promise, or you can spend it converting included tokens into assets that keep paying out after the meter starts.

The sequencing logic is simple. Fable 5’s edge is deepest on heavy, high-context, high-autonomy work — and it drains the shared pool faster than Opus, so everything else should stay off it. Bank the migrations, the architecture reviews, the security sweeps, and above all the durable plans. Then let cheaper models execute against those artifacts for months.

The meta-lesson from June applies too: a frontier model disappeared for 19 days with almost no notice, came back with different terms, and will change terms again. Teams that treat any single model as permanent infrastructure keep getting surprised. Teams that build routing, plan-banking, and second-source habits treat weeks like this one as arbitrage instead of crisis.

Put the window to work

Six days of included Fable 5 can fund months of cheap execution.

Our team builds model-routing, plan-banking, and cost-engineering workflows around frontier models — so pricing changes like July 8 become a config update, not a fire drill.

Free consultationExpert guidanceTailored solutions
What we work on

Frontier-model operations

  • Backlog triage for limited-access model windows
  • Plan-banking workflows — frontier plans, cheap execution
  • Usage-credit spend caps, cache & Batch cost engineering
  • Multi-model routing — Fable 5 / Opus / Sonnet / GPT-5.5
  • Vendor-resilience and second-source playbooks
FAQ · The six-day window

The questions teams are asking this week.

July 7 is the last day Fable 5 is included in subscription plans, per Anthropic’s July 1 relaunch announcement. Through that date, Pro, Max, Team, and select Enterprise plans can use Fable 5 for up to 50% of their weekly usage limits at no extra cost. From July 8, the model is no longer part of any subscription tier — access runs exclusively through prepaid usage credits billed at standard API rates. Nothing about the model itself changes at the cutoff; only the billing rail does. The date is an announced schedule, so treat it as firm but verify against Anthropic’s official channels if you’re committing significant budget around it.
Related dispatches

Continue exploring the Fable 5 playbooks.

AI Development

Fable 5 as the Planner Brain in Hermes and OpenClaw

Run Claude Fable 5 as the planner and route execution to cheaper models in Hermes and OpenClaw. The exact config keys, cost math, and marketplace hardening.

July 2, 2026 · 14 minRead
AI Development

Fable 5 Cost Engineering: Cache, Batch and Spend Caps

Claude Fable 5 lists at $10/$50 per million tokens and meters after July 7. How prompt caching, the Batch API, and spend caps cut the bill — with worked math.

July 2, 2026 · 12 minRead
Development

10 Fable 5 Prompts to Upgrade Your AI Agent Setup 2026

Ten one-shot audit and upgrade jobs for Claude Fable 5, each drawn from Anthropic's own prompting guide, with the cheaper model that should execute the plan.

July 2, 2026 · 14 minRead
Business

Fable 5 ROI: Turning Output Tokens Into Revenue 2026

Claude Fable 5 costs $10/$50 per million tokens, double Opus 4.8. How individuals and small firms recoup that metered spend — with the honest counterexamples.

July 2, 2026 · 12 minRead
AI Development

Sakana Fugu: A Multi-Agent AI Orchestration Model 2026

Sakana Fugu wraps a pool of frontier models behind one orchestration API. We cover the two models, vendor benchmarks, pricing, and the export-control angle.

June 22, 2026 · 11 minRead
AI Development

Do Not Single-Source Your AI: A Second-Source Playbook

The Fable 5 export shutdown showed single-vendor AI can halt your business overnight. A four-step second-source playbook with open-weight failover backups.

June 21, 2026 · 11 minRead