On June 28, 2026, Elon Musk announced that Grok 4.5 — xAI’s newest model, built on the 1.5-trillion-parameter V9 foundation model with Cursor’s developer data added in supplemental training — had entered private beta at SpaceX and Tesla. The name and the number both come straight from Musk. What makes this more than a routine model drop is what sits behind it: a single owner now controls the compute, the model, and the coding tool that generates the model’s training data.
Musk’s post also carried a headline claim — that early evaluations show Grok 4.5 performing “close to, perhaps exceeding Opus.” It is worth being precise about what that is and is not. It is a self-evaluation, run by SpaceX and Tesla engineers who now work under the same corporate roof as xAI. No independent benchmark — LMArena, Artificial Analysis, SWE-bench, Humanity’s Last Exam — has scored Grok 4.5, and there is no public API to run one on. Treat the comparison as vendor-stated, not settled.
This guide maps the flywheel Musk is building: how the February 2026 SpaceX–xAI merger, the June 16 Cursor acquisition, and the June 28 Grok 4.5 beta connect into one vertically integrated loop, why “supplemental training” is a bigger caveat than it sounds, and — most usefully for your own team — what to watch for before you standardize on a coding tool that a model vendor now owns.
- 01Grok 4.5 runs on xAI’s 1.5T V9 foundation model.Musk announced it on June 28, 2026. V9 — self-reported at roughly 1.5 trillion parameters — is about 3× the ~500B v8-small architecture that powers today’s public Grok. No independent audit of the count exists.
- 02Cursor’s data was added in supplemental training, not from scratch.An xAI engineer conceded that supplemental inclusion is “not quite as good as having it in initial training.” xAI says the next model in training is being built to bake Cursor data in from the start of pre-training.
- 03The “close to, perhaps exceeding Opus” claim is a vendor self-eval.The only evaluators are SpaceX and Tesla engineers — employees of the same parent company as xAI. No independent benchmark (LMArena, Artificial Analysis, SWE-bench, HLE) has scored Grok 4.5. Treat it as one tweet, not a scoreboard.
- 04One owner now holds every link in the chain.Compute (the Colossus supercluster), the foundation model (Grok/V9), and — once the $60B Cursor deal closes — the coding tool that generates the fine-tuning data. That closed loop is the flywheel.
- 05The model being talked up is not the one you can buy.Grok 4.5 is a private beta at SpaceX and Tesla only. Paying developers still use Grok 4.3 on v8-small. Cursor’s model-agnostic posture is unchanged — for now. The lock-in risk is structural, not yet visible in the product.
01 — AnnouncementWhat Musk actually said on June 28.
The announcement was a single post on X — not a technical report, not a benchmark submission, not a model card. In four sentences it named the model, the foundation it runs on, the training method, and a roadmap. Here it is, lightly trimmed:
“Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early evals show performance close to, perhaps exceeding Opus. RL is continuing to significantly improve the model, and the Grok Build harness gets better every day... Completely trained from scratch new models will be released by SpaceX every month this year.”— Elon Musk, founder and CEO of xAI and SpaceX · X, June 28, 2026
Several things are stated here as fact by the founder, and none has been independently verified. The name is Grok 4.5. The foundation is the V9 model, which xAI puts at 1.5 trillion parameters — roughly three times the size of the “v8-small” architecture (about 500 billion parameters) that powers today’s public Grok. The Cursor data was “added in supplemental training,” a post-pre-training stage rather than something baked in from the start. And the harness Musk name-drops, the Grok Build harness he referenced, is xAI’s parallel coding-agent CLI.
The final sentence — that SpaceX will release “completely trained from scratch” foundation models roughly every month for the rest of 2026 — reads as a compute-capacity flex more than a shipping commitment. It is an announced plan, and we treat it as one below, paired with xAI’s own track record of slipping self-set dates. For a business reader, the useful signal in this post is not the Opus comparison; it is the confirmation that the compute, the model, and the data source are now converging under one roof.
02 — Beta vs ProductionThe model being talked up is not the one you can buy.
This is the expectations-versus-reality gap worth internalizing before anything else. The model xAI is promoting — Grok 4.5 on the 1.5T V9 foundation — is available to exactly two organizations, SpaceX and Tesla, with no public API and no announced release date. The model paying developers actually run today is a different one: Grok 4.3, built on the v8-small foundation (~500B parameters) that Musk himself has previously described as having “many fundamental flaws.”
Grok 4.3 shipped as the public flagship on April 30, 2026 with a 1M-token context window, and on June 15 it landed on Amazon Bedrock as a purchasable enterprise model — the last arm’s-length distribution move before the flywheel started closing. That contrast is the whole point: the model with the flywheel data is not the model anyone outside SpaceX and Tesla can touch.
Grok 4.3 · v8-small
The flagship enterprises can actually buy. Priced around $1.25 / $2.50 per million input / output tokens. Built on the generation Musk called flawed — and it carries none of the Cursor flywheel data.
Grok 4.5 · V9
The model in every headline. Cursor data added via supplemental fine-tuning. No public API, no release date, no independent benchmark. Everything known about it comes from one X post.
03 — The FlywheelCompute, model, tool, data — one owner.
Here is the loop, stated plainly. xAI owns the compute — the Colossus supercluster in Memphis, running NVIDIA Blackwell-generation GPUs that V9 was optimized for. That compute trains the foundation model (V9, and the Grok models built on it). The model powers coding tools — and after the June 16 acquisition, the flagship one is Cursor, the editor more than a million developers use. Those developers generate real coding-workflow data, which flows back into training. Compute → model → tool → data → back into the model. We do not re-litigate the deal economics here — the $60B all-stock deal mechanics get the full treatment in a separate post.
What is genuinely new is not that Cursor has models — it already ships its own in-house Composer 2.5 model — but that once the deal closes (expected Q3 2026), a single shareholder structure spans every link. No frontier lab has previously held the compute, the foundation model, the coding tool, and that tool’s user-generated training data all at once. That is the structural fact this whole post is about.
V9 parameters (xAI-stated)
Roughly 3× the ~500B v8-small in production (1.5T ÷ ~500B ≈ 3×). Self-reported by xAI; no independent parameter audit exists.
Cursor active developers
The dev-workflow data source the model fine-tunes on once the acquisition closes. Cursor also gains xAI as its owner, not just a routing option.
Compute → model → tool → data
Colossus, V9/Grok, Cursor, and Cursor’s user data — converging under one owner within a single calendar quarter.
04 — TimelineHow the loop closed, quarter by quarter.
Most coverage treats “SpaceX merges with xAI,” “SpaceX buys Cursor,” and “Grok 4.5 enters beta” as three separate news items. Lined up by date, they are the same vertically integrated stack closing in real time. One nuance matters most: V9 finished pre-training on May 26 — before the Cursor acquisition was even announced on June 16. That ordering is exactly why Cursor data could only reach Grok 4.5 through supplemental training rather than from the start of pre-training, the caveat the next section unpacks.
| Date | Event | Link in the chain it closes | Independent or vendor-stated? |
|---|---|---|---|
| Feb 2026 | SpaceX–xAI merger completes (~$1.25T combined valuation, reported). | Compute + foundation model fall under one owner. | Reported |
| Apr 30, 2026 | Grok 4.3 ships as the public flagship on the v8-small foundation. | Foundation model → public API and product. | Vendor-stated (launch) |
| May 26, 2026 | V9 foundation-model pre-training completes at a self-reported 1.5T params — before the Cursor deal. | New foundation model built (pre-Cursor data). | Vendor-stated |
| Jun 15, 2026 | Grok 4.3 lands on Amazon Bedrock as a purchasable enterprise model. | Model → arm’s-length enterprise distribution. | Reported |
| Jun 16, 2026 | SpaceX announces a $60B all-stock Cursor/Anysphere acquisition (close expected Q3 2026). | Coding tool + its dev-workflow data join the owner. | Independent (SEC filing) |
| Jun 28, 2026 | Grok 4.5 (V9) enters private beta at SpaceX & Tesla with Cursor data added in supplemental training. | Loop closes: Cursor data → the model. | Vendor-stated (Musk X post) |
05 — Training NuanceSupplemental, not from scratch.
This is the detail most competing coverage skips. “Trained on Cursor data” is doing a lot of work in the headlines, but the Cursor data did not enter at the start of V9’s pre-training — it was folded in afterward, during a supplemental stage. An xAI engineer said as much on X, conceding that supplemental inclusion is “not quite as good as having it in initial training.” In other words: the flywheel exists, but Grok 4.5 is running on an early, weaker version of it.
xAI’s stated fix is the next model. Per reporting, the training run already underway — described at around 2 trillion parameters — is designed to incorporate Cursor data from the beginning of pre-training, which xAI expects will produce stronger coding performance than Grok 4.5. Treat that 2T figure with more caution than the 1.5T one: it traces to a single outlet’s reporting, whereas the V9/1.5T number has several. The direction of travel is the durable point — each successive model is being engineered to draw on Cursor data earlier and more deeply.
06 — The Benchmark QuestionWho is marking the homework?
The “close to, perhaps exceeding Opus” claim deserves one careful reframe. It is not a scoreboard result; it is a methodology statement. The evaluations behind it were run internally by SpaceX and Tesla engineers — who, since the merger, are employees of the same parent company as the model’s maker. That is structurally different from an independent benchmark. As of late June 2026, no current-generation Grok model had publicly verified scores on recognized third-party evaluators, and there is no public API on which anyone outside SpaceX or Tesla could produce one for Grok 4.5.
None of this says Grok 4.5 is bad — it says the evidence is in-house. For a business audience, the honest translation is simple: SpaceX and Tesla are marking xAI’s homework, and until an outside party can grade it, “beats Opus” is a claim, not a finding.
Per TechTimes’ reporting, Claude-family models currently top the Artificial Analysis Intelligence Index — an independently run composite — and Grok models do not appear in those rankings. The only public SWE-bench Verified figure for a Grok coding model is 70.8%, posted by the older, narrower grok-code-fast-1, versus 87.6% for Claude Code on Opus 4.7 — a 16.8-point gap (87.6 − 70.8). Critically, that is not a Grok 4.5 score: Grok 4.5 has no SWE-bench number at all. Use the pairing only to size the pre-V9 baseline xAI is trying to close, never as a Grok-4.5-versus-Opus result.
07 — Lock-InCursor’s neutrality is now a structural question.
Cursor’s entire pre-acquisition market position was model-agnostic: route your requests to Anthropic’s Claude, OpenAI’s GPT, Google’s Gemini, or Cursor’s own Composer — pick whichever frontier model you already trust. That neutrality is now owned by one of the model vendors it used to treat evenly. To be exact about the present tense: nothing has changed in the product. TechTimes explicitly confirms Cursor’s model-agnostic posture had not changed as of late June 2026. The risk here is prospective and structural, not an already-broken promise — but the incentive that did not exist before June 16 exists now.
The scorecard below is the artifact to screenshot. It contrasts the closing SpaceX/xAI/Cursor stack with a model-agnostic stack (Cursor as it was, or an Anthropic-only tool like Claude Code today) across the five layers a CTO evaluating coding vendors in H2 2026 actually cares about.
| Layer | SpaceX / xAI / Cursor stack (closing Q3 2026) | Model-agnostic stack (Cursor pre-close / Claude Code today) |
|---|---|---|
| Compute | Owned — xAI’s Colossus supercluster (Memphis, Blackwell GPUs). | Vendor-neutral — cloud capacity rented per provider. |
| Foundation model | Single owner — Grok / V9, no external option in the loop. | Swappable — Claude, GPT, or Gemini, chosen per request. |
| Coding-tool layer | Cursor becomes a SpaceX subsidiary on close. | Cursor (pre-close), Claude Code, or another independent tool. |
| Source of fine-tuning data | Cursor’s dev-workflow data flows to the owner’s model. | Not routed into any single vendor’s training loop. |
| Who can switch, and how easily | Structurally incentivized toward one stack — no product change yet. | Developer picks per request; switching cost stays low. |
08 — ImplicationsWhat this means for your stack.
The flywheel is xAI’s advantage to compound; the actionable part for everyone else is optionality. None of the lock-in risk has materialized in Cursor’s product — but the structural incentive is new, and it is worth building your own early-warning system before you standardize a team on one AI coding vendor. If you are weighing how to structure that decision, our AI transformation engagements start with exactly this kind of vendor-concentration and model-routing review.
Keep using it — but instrument it
Nothing has changed in the product. Track three signals as your early warning: default-model changes, routing or pricing that favors Grok, and any training-data terms that appear in Cursor’s ToS. None has happened yet.
Don’t hard-wire one vendor
The entire value of a model-agnostic tool is the optionality. Preserve an exit path and keep a second tool warm before a coding editor owned by a model vendor quietly becomes your only default.
Wait for an independent score
There is no public API, no release date, and no third-party benchmark. A founder’s tweet is not a procurement input. Re-evaluate the moment someone outside SpaceX and Tesla can measure it on your own tasks.
Buy what is actually shipping
Grok 4.3 is on Amazon Bedrock; Claude, GPT, and Gemini are generally available. Route by measured performance on your own repositories, not by a monthly-cadence roadmap that has yet to be demonstrated.
09 — ConclusionThe flywheel is real; the benchmark is not yet.
One owner now holds the compute, the model, and the tool that feeds it.
Strip away the Opus comparison and the news underneath is bigger than any single model. Within one calendar quarter, SpaceX and xAI merged their compute and their models, acquired the coding tool a million developers use, and shipped a private beta trained on that tool’s data. That is a vertically integrated AI stack closing in real time — compute, model, tool, and training data under one shareholder structure, a configuration no frontier lab has held before.
The honest caveats are just as important as the thesis. The 1.5T parameter count is self-reported. The “beats Opus” line is an in-house eval with no independent verification. The Cursor data arrived through supplemental training, not from scratch. The monthly-model cadence is a plan from a company that slips its own dates. And Grok 4.5 itself is unavailable to anyone who is not SpaceX or Tesla. The right posture is skeptical patience: watch the flywheel, do not buy the scoreboard.
For your own team, the takeaway is not about Grok at all. It is that the tool many developers reach for by default is now owned by one of the model vendors it used to treat neutrally. Nothing has broken yet — but the incentive to erode that neutrality is new, and the cheapest insurance is to keep your stack model-agnostic and your switching cost low while you still can.