DevelopmentNew Release10 min readPublished July 1, 2026

Vendor-stated 1.5T params · private beta only · zero independent benchmarks

Grok 4.5 and the 1.5T V9 Model: SpaceX’s Closing Flywheel

On June 28, 2026, Elon Musk said Grok 4.5 — built on xAI’s 1.5T V9 foundation model, with Cursor data added in supplemental training — had entered private beta at SpaceX and Tesla. His “close to, perhaps exceeding Opus” line is a self-eval with no independent benchmark. The real story is the flywheel: one owner now holds the compute, the model, and the coding tool that feeds it.

DA
Digital Applied Team
Senior strategists · Published July 1, 2026
PublishedJuly 1, 2026
Read time10 min
SourcesMusk X post + press
V9 foundation model
1.5T
params · xAI-stated
self-reported
vs production v8-small
≈3×
bigger than today’s ~500B
Independent benchmarks
0
third-party evals of Grok 4.5
none published
Public availability
Beta
SpaceX & Tesla only
no public API

On June 28, 2026, Elon Musk announced that Grok 4.5 — xAI’s newest model, built on the 1.5-trillion-parameter V9 foundation model with Cursor’s developer data added in supplemental training — had entered private beta at SpaceX and Tesla. The name and the number both come straight from Musk. What makes this more than a routine model drop is what sits behind it: a single owner now controls the compute, the model, and the coding tool that generates the model’s training data.

Musk’s post also carried a headline claim — that early evaluations show Grok 4.5 performing “close to, perhaps exceeding Opus.” It is worth being precise about what that is and is not. It is a self-evaluation, run by SpaceX and Tesla engineers who now work under the same corporate roof as xAI. No independent benchmark — LMArena, Artificial Analysis, SWE-bench, Humanity’s Last Exam — has scored Grok 4.5, and there is no public API to run one on. Treat the comparison as vendor-stated, not settled.

This guide maps the flywheel Musk is building: how the February 2026 SpaceX–xAI merger, the June 16 Cursor acquisition, and the June 28 Grok 4.5 beta connect into one vertically integrated loop, why “supplemental training” is a bigger caveat than it sounds, and — most usefully for your own team — what to watch for before you standardize on a coding tool that a model vendor now owns.

Key takeaways
  1. 01
    Grok 4.5 runs on xAI’s 1.5T V9 foundation model.Musk announced it on June 28, 2026. V9 — self-reported at roughly 1.5 trillion parameters — is about 3× the ~500B v8-small architecture that powers today’s public Grok. No independent audit of the count exists.
  2. 02
    Cursor’s data was added in supplemental training, not from scratch.An xAI engineer conceded that supplemental inclusion is “not quite as good as having it in initial training.” xAI says the next model in training is being built to bake Cursor data in from the start of pre-training.
  3. 03
    The “close to, perhaps exceeding Opus” claim is a vendor self-eval.The only evaluators are SpaceX and Tesla engineers — employees of the same parent company as xAI. No independent benchmark (LMArena, Artificial Analysis, SWE-bench, HLE) has scored Grok 4.5. Treat it as one tweet, not a scoreboard.
  4. 04
    One owner now holds every link in the chain.Compute (the Colossus supercluster), the foundation model (Grok/V9), and — once the $60B Cursor deal closes — the coding tool that generates the fine-tuning data. That closed loop is the flywheel.
  5. 05
    The model being talked up is not the one you can buy.Grok 4.5 is a private beta at SpaceX and Tesla only. Paying developers still use Grok 4.3 on v8-small. Cursor’s model-agnostic posture is unchanged — for now. The lock-in risk is structural, not yet visible in the product.

01AnnouncementWhat Musk actually said on June 28.

The announcement was a single post on X — not a technical report, not a benchmark submission, not a model card. In four sentences it named the model, the foundation it runs on, the training method, and a roadmap. Here it is, lightly trimmed:

“Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early evals show performance close to, perhaps exceeding Opus. RL is continuing to significantly improve the model, and the Grok Build harness gets better every day... Completely trained from scratch new models will be released by SpaceX every month this year.”— Elon Musk, founder and CEO of xAI and SpaceX · X, June 28, 2026

Several things are stated here as fact by the founder, and none has been independently verified. The name is Grok 4.5. The foundation is the V9 model, which xAI puts at 1.5 trillion parameters — roughly three times the size of the “v8-small” architecture (about 500 billion parameters) that powers today’s public Grok. The Cursor data was “added in supplemental training,” a post-pre-training stage rather than something baked in from the start. And the harness Musk name-drops, the Grok Build harness he referenced, is xAI’s parallel coding-agent CLI.

The final sentence — that SpaceX will release “completely trained from scratch” foundation models roughly every month for the rest of 2026 — reads as a compute-capacity flex more than a shipping commitment. It is an announced plan, and we treat it as one below, paired with xAI’s own track record of slipping self-set dates. For a business reader, the useful signal in this post is not the Opus comparison; it is the confirmation that the compute, the model, and the data source are now converging under one roof.

02Beta vs ProductionThe model being talked up is not the one you can buy.

This is the expectations-versus-reality gap worth internalizing before anything else. The model xAI is promoting — Grok 4.5 on the 1.5T V9 foundation — is available to exactly two organizations, SpaceX and Tesla, with no public API and no announced release date. The model paying developers actually run today is a different one: Grok 4.3, built on the v8-small foundation (~500B parameters) that Musk himself has previously described as having “many fundamental flaws.”

Grok 4.3 shipped as the public flagship on April 30, 2026 with a 1M-token context window, and on June 15 it landed on Amazon Bedrock as a purchasable enterprise model — the last arm’s-length distribution move before the flywheel started closing. That contrast is the whole point: the model with the flywheel data is not the model anyone outside SpaceX and Tesla can touch.

In production today
Grok 4.3 · v8-small
~500B foundation · 1M context · public API + Bedrock

The flagship enterprises can actually buy. Priced around $1.25 / $2.50 per million input / output tokens. Built on the generation Musk called flawed — and it carries none of the Cursor flywheel data.

What you can buy today
In private beta
Grok 4.5 · V9
1.5T foundation (xAI-stated) · SpaceX & Tesla only

The model in every headline. Cursor data added via supplemental fine-tuning. No public API, no release date, no independent benchmark. Everything known about it comes from one X post.

What Musk is talking about

03The FlywheelCompute, model, tool, data — one owner.

Here is the loop, stated plainly. xAI owns the compute — the Colossus supercluster in Memphis, running NVIDIA Blackwell-generation GPUs that V9 was optimized for. That compute trains the foundation model (V9, and the Grok models built on it). The model powers coding tools — and after the June 16 acquisition, the flagship one is Cursor, the editor more than a million developers use. Those developers generate real coding-workflow data, which flows back into training. Compute → model → tool → data → back into the model. We do not re-litigate the deal economics here — the $60B all-stock deal mechanics get the full treatment in a separate post.

What is genuinely new is not that Cursor has models — it already ships its own in-house Composer 2.5 model — but that once the deal closes (expected Q3 2026), a single shareholder structure spans every link. No frontier lab has previously held the compute, the foundation model, the coding tool, and that tool’s user-generated training data all at once. That is the structural fact this whole post is about.

Foundation model
V9 parameters (xAI-stated)
1.5T

Roughly 3× the ~500B v8-small in production (1.5T ÷ ~500B ≈ 3×). Self-reported by xAI; no independent parameter audit exists.

Self-reported
Coding tool
Cursor active developers
1M+

The dev-workflow data source the model fine-tunes on once the acquisition closes. Cursor also gains xAI as its owner, not just a routing option.

Pre-close figure
The chain
Compute → model → tool → data
4links

Colossus, V9/Grok, Cursor, and Cursor’s user data — converging under one owner within a single calendar quarter.

One structure
The thesis in one sentence
A data flywheel is only a flywheel when the same owner controls the compute that trains the model, the model itself, and the tool whose users generate the next round of training data — and after June 2026, one owner does. The advantage compounds quietly; the lock-in risk it creates for everyone else is the part worth planning around.

04TimelineHow the loop closed, quarter by quarter.

Most coverage treats “SpaceX merges with xAI,” “SpaceX buys Cursor,” and “Grok 4.5 enters beta” as three separate news items. Lined up by date, they are the same vertically integrated stack closing in real time. One nuance matters most: V9 finished pre-training on May 26 — before the Cursor acquisition was even announced on June 16. That ordering is exactly why Cursor data could only reach Grok 4.5 through supplemental training rather than from the start of pre-training, the caveat the next section unpacks.

The flywheel timeline: six events from February to June 2026, the link in the compute-to-data chain each one closes, and whether the event is independently reported or vendor-stated.
DateEventLink in the chain it closesIndependent or vendor-stated?
Feb 2026SpaceX–xAI merger completes (~$1.25T combined valuation, reported).Compute + foundation model fall under one owner.Reported
Apr 30, 2026Grok 4.3 ships as the public flagship on the v8-small foundation.Foundation model → public API and product.Vendor-stated (launch)
May 26, 2026V9 foundation-model pre-training completes at a self-reported 1.5T params — before the Cursor deal.New foundation model built (pre-Cursor data).Vendor-stated
Jun 15, 2026Grok 4.3 lands on Amazon Bedrock as a purchasable enterprise model.Model → arm’s-length enterprise distribution.Reported
Jun 16, 2026SpaceX announces a $60B all-stock Cursor/Anysphere acquisition (close expected Q3 2026).Coding tool + its dev-workflow data join the owner.Independent (SEC filing)
Jun 28, 2026Grok 4.5 (V9) enters private beta at SpaceX & Tesla with Cursor data added in supplemental training.Loop closes: Cursor data → the model.Vendor-stated (Musk X post)

05Training NuanceSupplemental, not from scratch.

This is the detail most competing coverage skips. “Trained on Cursor data” is doing a lot of work in the headlines, but the Cursor data did not enter at the start of V9’s pre-training — it was folded in afterward, during a supplemental stage. An xAI engineer said as much on X, conceding that supplemental inclusion is “not quite as good as having it in initial training.” In other words: the flywheel exists, but Grok 4.5 is running on an early, weaker version of it.

xAI’s stated fix is the next model. Per reporting, the training run already underway — described at around 2 trillion parameters — is designed to incorporate Cursor data from the beginning of pre-training, which xAI expects will produce stronger coding performance than Grok 4.5. Treat that 2T figure with more caution than the 1.5T one: it traces to a single outlet’s reporting, whereas the V9/1.5T number has several. The direction of travel is the durable point — each successive model is being engineered to draw on Cursor data earlier and more deeply.

Read the cadence as a plan, not a track record
Musk’s “a new from-scratch foundation model every month through 2026” line has no precedent among frontier labs, and xAI has a documented habit of missing its own dates — the still-unreleased Grok 5 (a ~6T MoE, forecast for Q3 2026) has already slipped from late 2025 to Q1 to Q2 targets. Grok 4.5’s own “originally expected” window (late spring / early summer 2026) passed without a public launch. Weigh the monthly-cadence roadmap as ambition, not schedule.

06The Benchmark QuestionWho is marking the homework?

The “close to, perhaps exceeding Opus” claim deserves one careful reframe. It is not a scoreboard result; it is a methodology statement. The evaluations behind it were run internally by SpaceX and Tesla engineers — who, since the merger, are employees of the same parent company as the model’s maker. That is structurally different from an independent benchmark. As of late June 2026, no current-generation Grok model had publicly verified scores on recognized third-party evaluators, and there is no public API on which anyone outside SpaceX or Tesla could produce one for Grok 4.5.

None of this says Grok 4.5 is bad — it says the evidence is in-house. For a business audience, the honest translation is simple: SpaceX and Tesla are marking xAI’s homework, and until an outside party can grade it, “beats Opus” is a claim, not a finding.

What the independent record actually shows

Per TechTimes’ reporting, Claude-family models currently top the Artificial Analysis Intelligence Index — an independently run composite — and Grok models do not appear in those rankings. The only public SWE-bench Verified figure for a Grok coding model is 70.8%, posted by the older, narrower grok-code-fast-1, versus 87.6% for Claude Code on Opus 4.7 — a 16.8-point gap (87.6 − 70.8). Critically, that is not a Grok 4.5 score: Grok 4.5 has no SWE-bench number at all. Use the pairing only to size the pre-V9 baseline xAI is trying to close, never as a Grok-4.5-versus-Opus result.

07Lock-InCursor’s neutrality is now a structural question.

Cursor’s entire pre-acquisition market position was model-agnostic: route your requests to Anthropic’s Claude, OpenAI’s GPT, Google’s Gemini, or Cursor’s own Composer — pick whichever frontier model you already trust. That neutrality is now owned by one of the model vendors it used to treat evenly. To be exact about the present tense: nothing has changed in the product. TechTimes explicitly confirms Cursor’s model-agnostic posture had not changed as of late June 2026. The risk here is prospective and structural, not an already-broken promise — but the incentive that did not exist before June 16 exists now.

The scorecard below is the artifact to screenshot. It contrasts the closing SpaceX/xAI/Cursor stack with a model-agnostic stack (Cursor as it was, or an Anthropic-only tool like Claude Code today) across the five layers a CTO evaluating coding vendors in H2 2026 actually cares about.

Vendor-concentration scorecard comparing the closing SpaceX/xAI/Cursor stack against a model-agnostic stack across five layers: compute, foundation model, coding tool, fine-tuning data, and switching cost.
LayerSpaceX / xAI / Cursor stack (closing Q3 2026)Model-agnostic stack (Cursor pre-close / Claude Code today)
ComputeOwned — xAI’s Colossus supercluster (Memphis, Blackwell GPUs).Vendor-neutral — cloud capacity rented per provider.
Foundation modelSingle owner — Grok / V9, no external option in the loop.Swappable — Claude, GPT, or Gemini, chosen per request.
Coding-tool layerCursor becomes a SpaceX subsidiary on close.Cursor (pre-close), Claude Code, or another independent tool.
Source of fine-tuning dataCursor’s dev-workflow data flows to the owner’s model.Not routed into any single vendor’s training loop.
Who can switch, and how easilyStructurally incentivized toward one stack — no product change yet.Developer picks per request; switching cost stays low.

08ImplicationsWhat this means for your stack.

The flywheel is xAI’s advantage to compound; the actionable part for everyone else is optionality. None of the lock-in risk has materialized in Cursor’s product — but the structural incentive is new, and it is worth building your own early-warning system before you standardize a team on one AI coding vendor. If you are weighing how to structure that decision, our AI transformation engagements start with exactly this kind of vendor-concentration and model-routing review.

If you use Cursor today
Keep using it — but instrument it

Nothing has changed in the product. Track three signals as your early warning: default-model changes, routing or pricing that favors Grok, and any training-data terms that appear in Cursor’s ToS. None has happened yet.

Stay, but watch
If you’re standardizing a team
Don’t hard-wire one vendor

The entire value of a model-agnostic tool is the optionality. Preserve an exit path and keep a second tool warm before a coding editor owned by a model vendor quietly becomes your only default.

Preserve optionality
If you’re weighing Grok 4.5
Wait for an independent score

There is no public API, no release date, and no third-party benchmark. A founder’s tweet is not a procurement input. Re-evaluate the moment someone outside SpaceX and Tesla can measure it on your own tasks.

Not yet
If you need coding AI now
Buy what is actually shipping

Grok 4.3 is on Amazon Bedrock; Claude, GPT, and Gemini are generally available. Route by measured performance on your own repositories, not by a monthly-cadence roadmap that has yet to be demonstrated.

Route by evidence

09ConclusionThe flywheel is real; the benchmark is not yet.

The shape of vertical integration, mid-2026

One owner now holds the compute, the model, and the tool that feeds it.

Strip away the Opus comparison and the news underneath is bigger than any single model. Within one calendar quarter, SpaceX and xAI merged their compute and their models, acquired the coding tool a million developers use, and shipped a private beta trained on that tool’s data. That is a vertically integrated AI stack closing in real time — compute, model, tool, and training data under one shareholder structure, a configuration no frontier lab has held before.

The honest caveats are just as important as the thesis. The 1.5T parameter count is self-reported. The “beats Opus” line is an in-house eval with no independent verification. The Cursor data arrived through supplemental training, not from scratch. The monthly-model cadence is a plan from a company that slips its own dates. And Grok 4.5 itself is unavailable to anyone who is not SpaceX or Tesla. The right posture is skeptical patience: watch the flywheel, do not buy the scoreboard.

For your own team, the takeaway is not about Grok at all. It is that the tool many developers reach for by default is now owned by one of the model vendors it used to treat neutrally. Nothing has broken yet — but the incentive to erode that neutrality is new, and the cheapest insurance is to keep your stack model-agnostic and your switching cost low while you still can.

Choose AI coding vendors with eyes open

Keep your coding stack model-agnostic while switching costs are still low.

We help teams evaluate AI coding tools and frontier models on the axis that actually matters — measured performance on your own repositories, and how much vendor concentration you're quietly signing up for. Delivered in days, not quarters.

Free consultationExpert guidanceTailored solutions
What we work on

AI coding-stack engagements

  • Vendor-concentration and lock-in risk reviews
  • Model-routing across Claude, GPT, Gemini, and Grok
  • Benchmarking coding tools on your own repositories
  • Exit-path and second-tool contingency planning
  • Governance for a mixed open + closed model stack
FAQ · Grok 4.5 & the SpaceX flywheel

The questions we get every week.

Grok 4.5 is xAI's newest model, announced by Elon Musk in a post on X on June 28, 2026. Per Musk, it is built on the V9 foundation model — self-reported at roughly 1.5 trillion parameters — with Cursor's developer data added during a supplemental training stage. As of publication it is in private beta at SpaceX and Tesla only, with no public API, no announced release date, and no independent benchmark. Everything publicly known about the model traces to that single X post and the outlets that reported on it. Treat the specifications as vendor-stated rather than independently confirmed, and note that the model most developers can actually use today is the earlier Grok 4.3, not Grok 4.5.
Related dispatches

Continue exploring the AI coding stack.