The AI Realist

The Backstop Has a Name Now - part 2

Julien Simon — Mon, 06 Jul 2026 09:05:17 GMT

Ask why Nvidia took equity in the neoclouds it works with — CoreWeave, Nebius, even Firmus, its own revenue-share launch partner — and took none in Sharon AI, and the answer is in the instrument itself.

Start with what Nvidia does for the clouds that made it. CoreWeave has been public since March 2025; it carries more than $20 billion of debt and once looked reckless to venture investors, but it borrows against investment-grade, asset-backed paper — an $8.5 billion facility rated A3, secured by the chips and the customer contracts, at roughly SOFR plus 2.25 percent.[1] Nebius, the old Yandex reconstituted on Nasdaq, posted positive adjusted EBITDA in the first quarter of 2026, raised more than $6 billion this year, and ended the quarter with $9.3 billion in cash.[2] Both are anchored by hyperscalers whose contracts pay in advance: CoreWeave by Microsoft, Nebius by Meta and Microsoft, on deals worth tens of billions.[3] And into both, Nvidia put equity — $2 billion into CoreWeave in January, $2 billion into Nebius in March, the same instrument it used that season in Lumentum and Coherent.[4]

Equity is the tell. It is a bet on enterprise value: a junior claim that pays only if the company becomes worth something, which it has for these two. Nvidia takes equity where there is value to own. It did not offer Sharon AI equity, and it would not, because the thing a revenue-share does that equity cannot is sit senior to the shareholders, a claim on revenue paid off the top, ahead of a stake that may end up worthless. How Nvidia chooses to finance a cloud is a readout of what it thinks the cloud is worth. See value, and it buys in. See doubt, and it keeps its distance: a claim on the revenue, a loan to produce it, and no share of the company.

The counterparty side complicates the neat version. CoreWeave and Nebius fund their buildouts with investment-grade debt and hyperscaler prepayments, so they never needed Nvidia’s financing. Firmus and Sharon AI did take it, and Firmus is worth $5.5 billion, so a revenue share is not, by itself, a mark of distress. It is how a buildout gets financed quickly, as Nvidia’s own chief financial officer frames it: a way to serve companies with demand but who cannot secure financing quickly enough.[5] The instrument opens a new recurring revenue line for Nvidia. What separates the two who took it is whether Nvidia also wanted to own them.

The debt market already sorts AI borrowers by distance from cash: Amazon borrows unsecured, Oracle against backlog, SoftBank could not borrow against a private mark at all.[6] The revenue-share program is what sits below SoftBank’s rung, where the debt market says no and Nvidia says yes, through an instrument no bank would offer. It is a familiar move under a new name. An earlier piece called it the Overbuild Put: a company financing a buildout it might not fill names its own backstop, and the backstop is the giveaway.[7] Meta named its exit, a cloud business it might have to start if it overbuilt. Sharon AI’s exit is named for it, by Nvidia; the credit support is the backstop that lets a buildout proceed whose demand is unproven. The difference is who writes the text. Meta wrote one on its own capacity. Nvidia writes one on Sharon AI’s, the supplier backstopping the demand of a customer it also sells to.

The February COMECON piece argued Nvidia holds the independent cloud tier captive; Part 1 named the productized version.[8] The line is not equity versus revenue-share, since Firmus took both; it is ownership versus none. Nvidia holds a stake in CoreWeave, Nebius, and Firmus. In Sharon AI, it holds only a claim.

What “fragile enough to sign” looks like

Sharon AI is the exhibit, the one launch partner Nvidia financed but would not own. Firmus, the other, raised $505 million at a $5.5 billion valuation with Nvidia among its backers;[4] whatever else it is, it is not the bottom of the ladder. Sharon AI’s own filings tell most of the story before any short seller does.

At the end of March, the company held $164 million in cash and no revenue; its filings do not expect revenue to begin until September, and operating cash flow is negative, with a capital-expenditure requirement of about $720 million to support its lead contract.[9] Against the cash are two customer contracts totaling $1.26 billion and a second worth roughly $950 million.[10] Its reported quarterly loss of $20 million is mostly an accounting artifact: the company carries its convertible notes at fair value, so remeasuring them drove a $70 million loss through the income statement, offset by a $66 million gain on the sale of half of a Texas data-center joint venture.[11] A company whose reported profit swings with the value of its own debt and whose cash comes from an IPO and asset sales rather than operations is being valued on a chain of announcements, not cash flow. There is none yet.

The build is financed by a stack of commitments, each conditioned on the next. A $200 million facility from an investor called Digital Alpha and a $500 million facility from USD.AI, a roughly one-year-old decentralized-finance protocol, were both announced as “up to” and, per a short report, remained unexecuted months later.[12] In May, the company closed $350 million of convertible notes led by Oaktree. Per the same report, those notes were funded only if Sharon AI first signed a binding contract for at least 4,068 additional GPUs, its lender declining to treat the announced $1.26 billion in demand as sufficient collateral.[13] In June, it raised an additional $1.6 billion.[14] And on June 12 came the keystone, the six-year Nvidia deal, whose filing language says it is “structured so that Sharon AI can commit to large-scale NVIDIA infrastructure” through the revenue-share and credit-support.[15] Nvidia’s credit line is the piece that lets the rest of the tower stand.

Then the customer, where the short case overreaches, and the real point is sharper. Sharon AI’s forward revenue rests on the $1.26 billion contract with ESDS Software Solution, and ESDS is not a shell. It is an established Indian cloud provider with roughly ₹361 crore (about $43 million) in revenue last fiscal year, up 27 percent, profitable, lightly indebted, and preparing an IPO of its own.[16] The problem is scale, not solvency. The contract calls for average annual payments of roughly $250 million and $140 million in letters of credit; the annual figure alone is six times ESDS’s entire revenue.[17] A real $43 million company can be a real counterparty to a contract its own size; whether it can perform one thirty times larger is the open question, and it is the same question Sharon AI’s own lender asked. The chief executive brings his own history. Manning’s prior public company, Mawson Infrastructure Group, alleges in court filings that he directed roughly A$11.5 million to a shipping firm he controlled without disclosing his interest to the board, allegations he contests and that remain unadjudicated. That same firm, Flynt, is now a disclosed related-party vendor of Sharon AI, according to the company’s own prospectus.[18] None of this has stopped the stock, which has risen sharply; the market has not endorsed the short thesis.[19] But it is the profile of an operator that could not fund this build on ordinary terms, which is why the revenue-share was there to be signed.

One detail seals the instrument argument. Nvidia holds no equity in Sharon AI; its annual report called Nvidia a “strategic shareholder” and the company filed a correction stating Nvidia owns none.[20] It invested $2 billion in equity in each of CoreWeave and Nebius, and took a stake in Firmus, its own revenue-share partner. Only in Sharon AI did it take a revenue-share, a credit claim, and no ownership at all, because what it is underwriting here is not an asset it wants to hold. It is a demand that needs to be kept alive.

What comes next

The rule generalizes, and it is the thing to take from Sharon AI. When a supplier finances its own customer, the instrument it chooses is a private credit rating, better informed than the market’s, because the supplier sits inside the relationship. Equity is the vote of confidence. The revenue share is how the buildout gets paid for, and Firmus, worth $5.5 billion, took one too. What sets Sharon AI apart is the equity Nvidia declined to take. Call it the Instrument Test: read what a vendor takes, not what it announces. It travels beyond Nvidia: whenever a vendor lends to the customers who buy its product, the instrument encodes what the vendor privately believes, and usually before the tape does.

Which makes the program itself an indicator. Nvidia split its partners by what it was willing to own: it took a stake in CoreWeave, Nebius, and Firmus, and in Sharon AI it took only a claim. As the program spreads, watch which companies get a revenue share with no stake besides it. That pairing, not the revenue-share alone, is the readout of how far down the counterparty-quality curve Nvidia is reaching, and it moves before the market does, because it is Nvidia’s own hand showing. When the next partner arrives with a credit line and no equity cheque, Nvidia is saying, in the one language it cannot fake, that it is a company it will finance but will not own. And note the second edge. When a chip vendor’s credit support helps fund the purchase of its own chips and then collects a share of the revenue those chips generate, the revenue is partly its own money coming home — the same round-trip that ran at the hyperscaler layer, now reaching the bottom of the neocloud tier.[21]

A fair objection remains: a revenue-share cuts both ways. If an operator’s utilization falls, so does Nvidia’s cut, so this is exposure to Sharon AI’s success, not merely a claim on it. But exposure to the success of the operators least likely to deliver it is either deep conviction or the position of a supplier that has run out of stronger customers to sell to.

CoreWeave was fragile once, too, and became a company worth billions; one of these operators may do the same, and Oaktree and Goldman are betting on it. But read the instrument. In CoreWeave, Nebius, and Firmus, Nvidia is a shareholder, betting the company wins. In Sharon AI it is a creditor, arranging to be paid whether it wins or not. Which seat a supplier takes says more than any forecast it offers. And at the bottom of the ladder, Nvidia took the creditor’s.

Notes

[1] CoreWeave (Nasdaq: CRWV) listed March 28, 2025; total debt now exceeds $21 billion. Its March 2026 $8.5 billion facility is rated A3/A(low), secured by substantially all assets of the borrower group, at SOFR + 2.25% floating or ~5.9% fixed, maturing March 2032, and is the first investment-grade-rated GPU-backed financing: Sacra; Quartz. Its credit agreement requires contracts with “large and creditworthy” customers covering future debt repayments.

[2] Nebius Group (Nasdaq: NBIS), formerly Yandex N.V., resumed Nasdaq trading October 2024. Q1 2026: revenue $399M (+684% YoY); positive adjusted EBITDA (~45% AI-cloud margin) and net income of $621.2M (inclusive of non-operating items); more than $6 billion raised in 2026 ($4.3B convertible notes plus Nvidia’s equity), ending the quarter with $9.3B in cash: TIKR; Simply Wall St.

[3] CoreWeave’s anchor customer is Microsoft; Nebius holds a five-year ~$27B Meta agreement and a ~$17–19.4B Microsoft agreement, with hyperscaler prepayments helping fund capex: Forbes; Morningstar.

[4] Nvidia made a $2 billion private placement in CoreWeave in January 2026 (22,935,780 Class A shares at $87.20) as part of an expanded collaboration targeting more than 5 GW by 2030, and agreed to purchase CoreWeave’s unsold capacity through 2032: Sacra; Forbes. Its $2 billion equity investment in Nebius (March 11, 2026) mirrored $2 billion investments in Lumentum and Coherent the same month: MLQ News. Nvidia also holds equity in Firmus, having joined a 2025 round and participated in Firmus’s April 2026 US$505 million round at a US$5.5 billion valuation led by Coatue; the April participation was reported subject to closing conditions. Firmus separately arranged a US$10 billion debt facility. Verify against Firmus’s April 2026 funding release before publication.

[5] Nvidia CFO Colette Kress, framing the program as serving companies with demand that cannot secure financing quickly enough: NVIDIA blog, July 1, 2026.

[6] “Cash Flow Lends. Valuation Doesn’t.,” The AI Realist, June 12, 2026.

[7] “The Overbuild Put,” The AI Realist, June 1, 2026 — reading a fallback-monetisation or backstop remark as a credit signal on a debt-financed buildout whose demand is unproven.

[8] “Jensen’s COMECON: How Nvidia Built an Empire of Captive Clouds,” The AI Realist, Feb. 14, 2026; “The Backstop Has a Name Now (Part 1),” The AI Realist, July 2026 (confirm published slug before linking).

[9] SharonAI Holdings Inc., Form 10-Q for the quarter ended March 31, 2026 (cash $164,288,288; negative operating cash flow; revenue commencement not expected until approximately September 2026), attached to the Company’s Form 424B3, SEC EDGAR (CIK 0002068385). Approximately $720 million of capital expenditure is tied to the lead customer arrangement per the same filing.

[10] ESDS master services agreement of $1,260,000,000 and a second customer contract of approximately $950,000,000, per the Form 424B3 referenced in [9].

[11] Fair-value option elected on the convertible notes (Level 3); Q1 2026 included a $70.2 million loss on remeasurement and a $65,919,712 gain on the sale of a 50% interest in the Texas Critical Data Centers joint venture; convertible-note fair value of $199,358,226 as of March 31, 2026 (Form 424B3, [9]).

[12] Announcements of a Digital Alpha facility of up to $200 million (Jan. 19, 2026, subject to definitive documentation) and a USD.AI facility of up to $500 million (Jan. 22, 2026). Characterisations of USD.AI’s on-chain capacity and the unexecuted status of the Digital Alpha facility are from Bleecker Street Research, “SharonAI (SHAZ),” April 30, 2026 (a short seller with a disclosed position).

[13] SharonAI Form 8-K, Exhibit 99.1 (closing of $350 million of convertible senior notes due 2031, led by Oaktree), May 2026, SEC EDGAR. The 4,068-GPU closing condition is as characterised in the Bleecker Street report ([12]); confirm against the note purchase agreement before publication.

[14] SharonAI’s oversubscribed $1.6 billion private placement (approximately $900 million equity and $700 million of 4.75% convertible notes due 2032), June 2026, with Goldman Sachs as lead placement agent: Benzinga.

[15] Six-year Nvidia collaboration (72 MW, up to 40,000 Grace Blackwell GB300), per the Company’s June 12, 2026 announcement reproduced in the Form 424B3 ([9]).

[16] ESDS Software Solution Limited, FY2025 (year ended March 31, 2025): total revenue ₹361 crore (~$43 million), up 27% year over year; profit before tax up nearly fourfold; debt-to-equity of 0.15 and current ratio of 2.32; DRHP filed March 30, 2025 for a ₹600 crore IPO on the BSE and NSE: unlistedzone; ESDS financials via WWIPL. Verify against ESDS’s audited DRHP financials before publication.

[17] The $250 million average annual payment and the $140 million letter-of-credit obligation are per the Bleecker Street report ([12]); confirm against the master services agreement before publication.

[18] Mawson Infrastructure Group Inc. (Nasdaq: MIGI) alleges, in its January 10, 2025 court filing, that James Manning, its former chief executive, caused the company to pay over A$11.4 million to Flynt International Cargo Solutions (a Vertua subsidiary) for services it “did not need,” without disclosing his interest or seeking board approval: Mawson Form 8-K, Exhibit 99.1, SEC EDGAR. The allegations are unadjudicated and contested. SharonAI’s own prospectus discloses Flynt ICS as a related-party vendor: “Flynt is a subsidiary of Vertua Limited and affiliated to the Group through common ownership by James Manning,” and the Group paid Flynt $167,638 in services expenses for the year ended December 31, 2024.

[19] Public market data as of early July 2026; SHAZ has risen sharply since the April 30 short report, and the market has not validated the short thesis.

[20] SharonAI corrected its FY2025 Form 10-K, which had described NVIDIA as a “strategic shareholder,” to state that NVIDIA holds no equity securities of the Company: SharonAI Form 8-K correction (SEC EDGAR).

[21] “The Round Trip,” The AI Realist, May 4, 2026.

The Backstop Has a Name Now - part 1

Julien Simon — Sat, 04 Jul 2026 06:09:31 GMT

In its most recent quarter, Nvidia returned a record $20 billion to shareholders. In May, its board authorized another $80 billion in buybacks; in June, it raised $25 billion in the bond market — the balance sheet of a company with no financing problem of its own.[1] On July 1, it announced a program to help companies that cannot afford its chips buy them anyway, in exchange for a share of the profits.[2]

The two sit oddly together. A company that hands out that much to shareholders does not usually need to lend to its own customers.

In February, I called this the COMECON model: Nvidia keeps the independent GPU clouds, the neoclouds, captive through four instruments: GPU allocation, equity stakes, credit enhancement, and demand backstops.[3] On July 1, it took two of them, the credit enhancement and the backstop, and gave them a product name.

The program is a “revenue-sharing and credit-support model.” A cloud puts Nvidia GPUs on the floor without carrying the full capital cost, draws token credits against future capacity today, and hands Nvidia its standard hardware margin plus a recurring, usage-linked share of the cloud revenue that capacity generates.[2] How large that share is, Nvidia has not said. The first named partners are Sharon AI, deploying up to 40,000 Grace Blackwell GB300s in Australia, and Firmus, building toward 360 megawatts and 170,000 GPUs in Batam, Indonesia.[4] The arrangement is not wholly new; Nvidia ran a demand backstop with CoreWeave and took an equity stake in OpenAI. What is new is the packaging: the credit support and a revenue-share leg under one name, one template, one marketing page.[5]

The bullish read wrote itself within hours: recurring revenue, a widening moat, and a supplier tying itself to customer usage rather than a single sale. The bearish read wrote itself too: circular financing, vendor-funded demand, the fiber and telecom buildouts in period costume. Both are on offer. Neither is the question that matters.

The question that matters is why the market's strongest supplier is doing this now. The answer isn’t on Nvidia’s blog; it’s on its customers’ earnings calls.

On April 29, Google said it would begin delivering its TPUs to a select group of customers for their own data centers, its formal move into the merchant-silicon market Nvidia has long dominated.[6] Weeks later, AWS confirmed it is exploring selling Trainium to outside data centers.[7] For a decade, the hyperscalers built custom chips and kept them in-house; in 2026, both began selling them. Nvidia’s largest customers are becoming its competitors, from the top of the stack down. (I argued in May that Google is the one to watch: its chip runs its own frontier models, while Amazon’s mostly runs rented workloads — the line that separates a silicon business from a silicon cost center.[8])

Take Nvidia’s case at its strongest. The financing gap is real; lenders have been wary of hardware whose resale value no one can yet model, so builders with genuine demand still can’t get compute funding fast enough. The inference tenants Nvidia names alongside the program (Baseten, Fireworks AI, Together AI) are well-capitalized companies, not strays. And a revenue-share cuts both ways: if a partner’s utilization falls, so does Nvidia’s cut. That is exposure to the customer’s success, not a lien on it. On its own terms, the program clears a bottleneck and aligns incentives, and that reading is not wrong.

It is incomplete because of when it arrived. A supplier that spent a decade as the only game in town does not wander into neocloud finance in the same quarter that its two largest customers start selling their own chips. The gap is real; the timing is no coincidence; the calendar tips the scales toward defense. Whatever else it does, the program buys the loyalty of the layer beneath the hyperscalers, the independent clouds with no silicon of their own; at the moment, the layer above turns competitive. The revenue-share ties their economics to Nvidia’s; the credit-support makes Nvidia the reason some of them can exist at all.

There is a sharper edge, and it is the subject of the next piece. Nvidia’s CFO frames the program as serving companies that have demand but cannot secure financing fast enough; even long-term commitments haven’t unlocked the capital.[9] That describes the borrowers that conventional lenders turn away.

How far down the collateral ladder the model reaches is the open question.

Firmus is a large greenfield campus and is not obviously a distressed borrower. Sharon AI, the other named partner, is another matter: it was listed on Nasdaq in February and carries a market capitalization above a billion dollars on almost no revenue, and its financing stack and headline contracts are already the subject of a detailed short-seller report.[10] I’ll take those allegations through the primary filings next. Nvidia, for the record, holds no equity in it; the hold runs entirely through the revenue-share and the credit line.[11]

The backstop was always there; on July 1, it got a name, which is what usually happens to an improvisation just before it becomes a system. Whether it is mostly defense or mostly reach, the next deals will say. If the template stays with capital-constrained clouds, it is the base-reinforcement it looks like; if Nvidia extends it to well-funded clouds that could finance the GPUs themselves, the defensive read was wrong, and this is Nvidia annexing cloud economics wherever it can. Either way, the neocloud it piloted on is where the risk is hiding, and the filings are where the next piece goes.

Notes

[1] NVIDIA returned approximately $20 billion to shareholders in Q1 FY2027, and its board authorized an additional $80 billion in repurchases on May 18, 2026: NVIDIA Q1 FY2027 results (Form 8-K), SEC EDGAR. The $25 billion multi-tranche notes offering priced on June 18, 2026: NVIDIA Form 8-K, June 18, 2026, SEC EDGAR.

[2] NVIDIA Unlocks AI Compute at Scale, NVIDIA blog, July 1, 2026 (co-authored by CFO Colette Kress).

[3] Jensen’s COMECON: How Nvidia Built an Empire of Captive Clouds, The AI Realist, Feb. 14, 2026.

[4] Firmus scale from NVIDIA blog [2]; Sharon AI terms (six-year collaboration, 72MW of new Australian capacity, up to 40,000 Grace Blackwell GB300) from SharonAI Holdings, “Six Year Strategic Compute Collaboration with NVIDIA,” June 12, 2026 (BusinessWire; corresponds to the Company’s Form 8-K filed June 12, 2026).

[5] The credit-support and backstop model packages arrangements Nvidia previously ran case by case — a demand backstop with CoreWeave and a reported ~$30 billion equity investment in OpenAI (a separate data-center lease guarantee was reported to be under discussion, not executed, as of mid-2026): AI Weekly, July 2026.

[6] Sundar Pichai, Alphabet Q1 FY2026 earnings call, April 29, 2026: Google to sell TPUs to a “select group of customers,” Data Center Dynamics.

[7] AWS signaled external Trainium sales in Andy Jassy’s April 2026 shareholder letter (The Next Web); AWS’s Peter DeSantis confirmed exploratory talks in June 2026 (reported by Bloomberg; summary).

[8] Two Chips, One Decade, One Winner, The AI Realist, May 27, 2026.

[9] CFO framing that the program targets companies with demand but insufficient access to financing: Nvidia launches revenue-sharing model, Yahoo Finance, July 2026; see also NVIDIA blog [2].

[10] Bleecker Street Research, “SharonAI (SHAZ),” April 30, 2026. Report authored by a short seller with a disclosed position; allegations to be examined against primary filings in the follow-up piece.

[11] SharonAI corrected its FY2025 Form 10-K, which had described NVIDIA as a “strategic shareholder,” to state that NVIDIA holds no equity securities of the company: SharonAI Holdings Form 8-K correction.

The Models That Learned Physics

Julien Simon — Tue, 30 Jun 2026 09:27:03 GMT

Disclosure: Simcon, used here as a worked example, is a portfolio company of Fortino Capital, where I am an AI operating partner. I’ve kept this piece to what Simcon has already made public, and the live demo is open to anyone.

The conversation about generative AI has narrowed to two things it does extremely well: write text and write code. But that is a small slice of what the underlying machine turned out to be good at.

The transformer was built for language. Then it generalized. The same architecture that predicts the next word learned to generate images, then to model protein structures, then to forecast time series and weather. Each jump landed in a domain that looked nothing like the last, and each time the lesson repeated: feed a transformer enough diverse examples of a thing, and it learns a representation of that thing general enough to handle inputs it never saw.

So here is the question that follows naturally and almost nobody is asking out loud: what happens when you point it at industrial engineering data and complex 3D physics? Not text, not pixels: the airflow over a car, the heat moving through a turbine, the way molten plastic fills a mold. The data that engineers generate daily by the terabyte, which never touches the public internet.

Subscribe now

The part of AI that doesn’t read

Everything the frontier labs compete on is trained on the internet: text, code, images, and video. That data is effectively a global commons: everyone scrapes the same web, so no one owns the input.

The physical world is the opposite. The data that describes it lives inside the companies that build things, and it never gets posted anywhere. Decades of crash tests, wind-tunnel runs, thermal cycles, material trials, and — the unsung hero of modern engineering — simulation results. Before a car, a phone case, or a medical device gets built, engineers simulate it. Simulations are used to predict and optimize the quality and cost issues that will arise during manufacturing and how the parts will behave in the real world. The result is fewer costly defects in the real world. To predict the physics, they run numerical solvers that chew through large systems of partial differential equations, which is computationally intensive. These runs are slow, expensive, and have been the backbone of industrial design for decades.

They are also a training set. Every solver run is a labeled example: this target geometry, these conditions, this outcome. A company that has run millions of them is sitting on something no web scraper can ever reach.

The bet behind Physics AI is that you can train a model on that simulation output, the way a language model is trained on text, and get a network that has, in effect, learned the physics. Not the equations. The behavior. Feed it a shape it has never seen, and it predicts the result, in seconds, without solving anything.

The category now has a name: Large Engineering Models. The label is deliberate. It claims the same lineage as Large Language Models — the same transformer architecture underneath, the same idea that scale and diverse data produce something that generalizes — but is trained on the physical world rather than language.

Why this didn’t work before

Engineers have wanted fast simulation forever, and the idea of replacing a slow solver with a fast approximation is old. The approximations are called surrogate models, and until recently, they came with a catch that made them nearly useless for real design work.

A classical surrogate is fitted to a specific problem. Train it on one family of parts, and it interpolates nicely within that family. It also falls apart the moment you hand it a geometry it hasn't seen before. It learned the answers, not the physics. Engineers got a tool that was fast exactly where they didn’t need help and unreliable everywhere they did.

The numerical solvers had the opposite profile: accurate and trustworthy across any geometry, but far too slow to run inside a design loop. So the trade-off stood. Fast or general: pick one.

What changed is the architecture. The transformer — the same design that made language models work — turns out to be good at consuming large, diverse collections of physical examples and learning a representation that holds up on inputs it was never trained on. The published method these systems draw on, Universal Physics Transformers, was demonstrated on automotive aerodynamics in a 2025 peer-reviewed paper. [2] The detail that matters for a non-specialist is simple: it was built to generalize across shapes, not memorize a few. That is the wall the old surrogates hit, and it is the wall the new models are designed to go through.

Fast and general, at the same time. That is the whole claim. Everything else is engineering.

A real-life LEM you can run in a browser

Abstractions are easy to oversell, so here is a concrete one: plastic injection molding, in a corner of manufacturing that most people never think about. It is how a staggering share of the plastic object parts around you were made: caps, casings, connectors, dashboards. A mold costs six or seven figures and takes months to design. Get the design wrong, and the plastic will not fill the cavity properly. And you will only find out after the steel is cut.

So engineers simulate the fill first. Historically, that meant a numerical solver and a wait of minutes to hours per design, which in practice limits how many variations you can reasonably try.

This is where domain expertise and data decide everything, and it is Simcon’s. The German company has developed injection-molding simulation software for over 35 years, used by manufacturing world leaders such as Bosch, Continental, Roche, and Arburg. And now they’ve built, trained, and deployed a Transformer-based model for 3D physics.

Their Cadmould AI Solver is trained on millions of simulation runs generated by their own numerical solvers — the proprietary ground truth I described earlier, accumulated over decades and turned into a training corpus no one else has. The architecture came from a research collaboration; the physics, the data, and the validation are Simcon’s, and the company now owns the model outright, trains it in-house, and runs the cloud infrastructure that hosts it for customers. [3] Simcon bills it as the first Large Engineering Model for injection molding. Results in seconds instead of hours, a speedup the company puts in the range of 200 to 1,000 times, across part shapes the model was never trained on. [4]

You don’t have to take the number on faith. Simcon put a research preview on the open web. It runs in a browser, on a mid-tier cloud GPU, and the geometries it ships with were explicitly not in the training data, to show it generalizes rather than parrots. [5] You change a parameter, you watch the fill pattern redraw, you change it again. The hours-long loop becomes a conversation. Anyone reading this can try it online, and you can also schedule a demo with the Simcon team.

The current model covers the filling stage of the molding process. The cooling, shrinkage, and warpage steps, which are decisive for many real-world outcomes, are on the roadmap. Accuracy is reported within a few percent of the numerical solver and improves as the training set grows. [6] The AI is a fast compass for exploration, and you can still validate the chosen design with the classical solver before cutting steel. That framing — AI to explore, trusted solver to confirm — is the sober version of the technology, and it is more convincing than a claim of replacement.

It is also more than a division of labor. The two engines feed each other. The model lets engineers explore thousands of designs in the time it would take the solver to check one; the solver then verifies the chosen design at full accuracy — and every such verification run is a fresh, high-fidelity training example for the next version of the model. Fast exploration surfaces the designs worth checking; precise validation turns the checks into new data; the new data makes the next model better at exploring. The loop closes in favor of whoever owns both engines. A company with only a fast model has a clever demo. A company with only a solver has what the industry already had. The advantage goes to the one running both, because each loop around widens the lead.

Why Europe, for once, is well positioned

The reflex in any AI story is that the US trains the biggest models and Europe writes the rules. In language, that is broadly true. Large Engineering Models invert one piece of it.

The scarce input here is not compute or web text. It is high-fidelity physical data from real industrial processes that lives disproportionately within European industry. Europe’s manufacturing sector holds potentially a century or more of accumulated knowledge and data: how materials behave, how processes fail, how a good part differs from a bad one, captured across generations of engineers and now sitting in solver archives, test logs, and process records.

A model is only as good as the data and the domain knowledge behind it, and those don’t transfer in a deal. They sit with the companies that have spent decades generating high-fidelity physical data — most of them industrial firms, many of them European, none of them frontier labs.

That is the moat. It is the one input a frontier lab cannot buy, scrape, or out-compute, and it is the thing that looks like a legacy liability in the software era right up until it becomes the training corpus for an entire category. Germany’s machine builders, the automotive supply chain, the molders and tool shops: each is a reservoir of exactly the data these models need, and the web does not contain. A simulation company with 35 years of its own solver output turning into a defensible AI asset is not a fluke. It is the shape of the whole opportunity.

This is also why the European Commission, in its Apply AI Strategy last October, named manufacturing a strategic sector and tied sectoral AI adoption to reducing Europe’s dependence on non-EU technology. [7] The advantage is not guaranteed — owning the data is not the same as building the models or the businesses on top of them, and that gap is where most of the value will be won or lost. But the raw material sits on the right side of the Atlantic, which is not something you can say about most of the AI race.

Why synthetic data isn’t enough

A common objection is that synthetic data dissolves the moat: if you can generate training data on demand, the proprietary corpora stop being scarce, and the advantage migrates back to whoever has the most compute.

However, this objection is weaker than it looks. Valuable synthetic data is not random data. It is data that captures the rare failure, the edge case, the point where the physics turns nonlinear, and a part that looked fine starts to warp during cooling. Knowing which scenarios are worth generating and whether a generated sample is physically trustworthy or quietly wrong is itself domain expertise. You cannot synthesize your way past not knowing what matters. Synthetic generation doesn’t remove the need for decades of accumulated know-how; it raises the price of admission to a layer where that know-how is even scarcer.

A model you can access in a browser is predicting the 3D physics of parts it never saw, in seconds, and a company that knows the cost of getting it wrong is putting it in front of customers — alongside, not instead of, the solver they already trust. The first models to read and write the world got the headlines. The ones learning to predict it may turn out to matter more to the people who build things, and Europe is holding more of the raw material than it has in any other part of this race.

The machines are starting to learn physics. The question worth asking is, who can you trust to teach them, and on whose data?

Notes

[2] Benedikt Alkin et al., “AB-UPT: Scaling Neural CFD Surrogates for High-Fidelity Automotive Aerodynamics Simulations via Anchored-Branched Universal Physics Transformers,” Transactions on Machine Learning Research, accepted October 2025 (arXiv:2502.09692). The published architecture targets automotive aerodynamics computational fluid dynamics; its application to injection molding is a separate, domain-specific implementation. Code released by Emmi AI on GitHub; paper on arXiv.

[3] Simcon GmbH, “SIMCON Unveils World’s First Large Engineering Model for Plastic Injection Moulding,” BusinessWire, March 18, 2026. The Cadmould AI Solver is described by Simcon as co-developed with Emmi AI on the model architecture; the training data, domain validation, and commercialization are Simcon’s per the company’s own product and scientific pages. CEO quote and product framing from the same release. BusinessWire.

[4] Speed range and “trained on over a million simulation trajectories” per Simcon’s product and technical pages; the 200–1,000× and “up to 1000×” figures are vendor-claimed and not independently reproduced. Trade-press coverage (Plastics Today, Plastics Technology, MoldMaking Technology, March 2026) repeats the “up to 1,000×” figure sourced to Simcon. Simcon.

[5] Research preview runs in-browser on a cloud GPU; Simcon states the demo geometries are not part of the training data. Live at simcon.ai. Simcon demo.

[6] Filling-stage scope, roadmap to packing/cooling/shrinkage-and-warpage, accuracy reported “within 2–5% of numerical methods,” and the explicit “explore with AI, validate with classical solver” workflow are all per Simcon’s public materials and CEO statements. Accuracy figures are vendor-claimed. Simcon scientific page.

[7] European Commission, “Apply AI Strategy,” COM(2025) 723, published 8 October 2025. The strategy names manufacturing among its strategic sectoral flagships (deploying “agentic” AI to optimise production lines, targeted Q4 2026) and frames sectoral AI adoption as part of strengthening European digital sovereignty and reducing dependence on non-EU technology providers. EUR-Lex.

Too Dangerous for You, Free for Everyone

Julien Simon — Sun, 28 Jun 2026 14:56:19 GMT

Update, July 1, 2026: On June 30, Washington threw the switch again. Commerce withdrew its own June 12 export-control letter and cleared both Mythos 5 and Fable 5 for release. "Diversion risk," Secretary Lutnick's own term, decided the fate of both models in one sitting. Fable 5, Anthropic's flagship, is returning under the safeguards Washington requested. This is not a loose end. It is the American door doing what this piece describes: one government, on its own clock, deciding which model the world's leading AI lab is allowed to ship.

On the morning of June 26, 2026, OpenAI released its most capable model and refused to let most people use it. GPT-5.6 Sol, the new flagship, went out as a limited preview to a short list of partners whose names OpenAI had shared with the US government, with general availability promised within weeks. In the same announcement, the company objected to the arrangement it was complying with, writing that it did not believe “this kind of government access process should become the long-term default.”[1]

That afternoon, the Commerce Department signed a letter restoring a competitor’s restricted model to the hands of more than 100 vetted American organizations: Anthropic’s Claude Mythos 5, which had been dark for two weeks after the same government forced it offline.[2]

Same day. Two labs. The frontier of artificial intelligence moved behind a government desk, and one of the firms that walked through the door used its launch to complain about the door.

None of this existed 80 days ago. It is now how the frontier ships.

Three Doors, and None of Them Opens Outward

For two years, the question about frontier models was which one is best. That question is now close to useless, because the three blocs that produce and govern these models have each turned access to the best of them into a matter of state policy, in three opposite directions.

The United States gates its most powerful models to government vetting. The European Union writes rules for a frontier it does not lead, aimed mostly at models built elsewhere. China does the opposite of both: it ships its best models as free downloads, and those models now account for the majority of the world’s open-model traffic.

The question is no longer which model is best. It is which government’s hand you can tolerate on the switch, and whether any of these doors is open in the way it appears to be.

The American case is the loudest. Anthropic spent the spring restricting Claude Mythos, an unreleased model it said could find decades-old security flaws on its own, to a short list of trusted partners.[3] On June 2, the White House signed an order asking developers to grant the government up to 30 days of access to “covered frontier models” before release.[4] Ten days later, a Commerce directive pulled two Anthropic models offline worldwide in roughly 90 minutes, including one commercial product serving hundreds of millions of users.[5] Two weeks after that, the same agency let the more dangerous of the two back in, for the vetted few. OpenAI’s GPT-5.6 followed the identical pattern on the identical day.

The European case is the quietest and, on paper, the most powerful. On August 2, 2026, five weeks from now, the EU’s AI Office gains the power to demand information from frontier developers, order changes, levy fines, and recall models from the market.[6] The largest models in scope are American.[7] Europe’s own frontier model, a publicly funded open-source effort, won the right to be built six days before this writing.[8] It does not exist yet.

The Chinese case is the one nobody is regulating, and everybody is using. At the end of 2024, Chinese open-weight models accounted for about 2 percent of the tokens flowing through the largest neutral model router. By the middle of 2026, they carried roughly 60 percent of them while that router quadrupled in size.[9]

The three doors look like a menu of safety, regulation, and freedom. They are nothing of the kind, and the door that looks free is the one quietly moving the switch.

The American Door: From Secure-and-Release to Ask-Permission

To see how far the United States has moved, start with the moment it set the opposite precedent.

In February 2019, OpenAI announced a language model called GPT-2 and declined to release it, citing the risk of fake news and impersonation. The decision split the field: some read it as responsible caution, others as a marketing performance that withheld a research artifact while implying it was a weapon. Nine months later, OpenAI released the full model and reported it had seen no strong evidence of misuse.[10] The harms had not arrived.

The industry took a lesson from the episode, and it was not “withhold.” It was the opposite: secure, then ship. Red-team the model, write a system card, publish a responsible-scaling policy, and release through an interface you control. For seven years, that habit held on a single assumption: that the lab decides when a model goes out.[11]

2026 broke the assumption, and not because the labs changed their minds. The state intervened in the decision.

The opening move was Anthropic’s. In April, it launched Project Glasswing around Claude Mythos, a model it kept out of public release and handed to roughly a dozen launch partners and a few dozen more organizations under usage credits. Anthropic backed the restriction with findings, not just adjectives: it said Mythos had autonomously surfaced vulnerabilities that had survived decades of human review, including a 27-year-old flaw in OpenBSD’s networking code and a 16-year-old one in one of the most widely used media libraries in the world.[12]

Those specific findings hold up. The patches exist. The advisories are public. But the framing around them deserves the scrutiny that the access restriction prevents. When independent researchers got hold of cheaper, openly available models, several of the showcase bugs fell to them too, one to a model costing a fraction of a cent per query.[13] A widely respected security commentator who is hard on AI hype judged the danger credible, and noted in the same breath that calling your model too dangerous to release is an excellent way to build buzz around it.[14] Both things are true at once. A safety claim and a capability advertisement are not mutually exclusive, and when the model is locked away, the advertisement cannot be checked.

You cannot benchmark what you cannot run.

The second move was the government’s. On June 2, the White House issued an order creating a voluntary path for developers to give the government up to 30 days of pre-release access to the most capable models, paired with a classified, NSA-led process to define which models qualify based on their cyber capabilities. The order is careful to bar any mandatory licensing scheme.[15] It is an invitation, not a law.

The third move showed what the invitation is worth when the government decides not to wait for it. On June 9, Anthropic launched Claude Fable 5, a commercial model it described as a Mythos-class system made safe for general use, with sensitive requests routed to a tamer model.[16] Three days later, at 5:21 p.m. Eastern, a Commerce export-control letter required a validated license before either model could reach any foreign national, including Anthropic’s own foreign-national staff. Unable to filter users by citizenship in real time, the company took Fable 5 and Mythos 5 down everywhere. A model serving hundreds of millions of people went dark in about 90 minutes, over a jailbreak Anthropic said was narrow and reproducible on other public models.[17]

Read those three moves in sequence. The decision about whether the public can use the best American model has migrated from the lab to Washington. OpenAI says the broad release of GPT-5.6 is only weeks away, and it may be; staging is not the same as exclusion. But the most capable tier, in the window that decides who gets the edge first, is gated by government vetting, and “weeks away” is a promise, not a shipped product. The public gets the safe-for-general-use version, or it waits. GPT-5.6 on June 26 was not a new direction. It was the second lab arriving at the same door.

OpenAI was candid that Sol had not crossed its own threshold for critical cyber risk. The model was gated not because the lab judged it too dangerous to ship, but because the government asked and the lab agreed, while the two worked out the access framework that the June 2 order set in motion. What governs release now is not the model’s capability. It is who gets to say yes.

This is not a tidy story of state capture. The June 2 order is voluntary and forbids licensing. OpenAI publicly protested the very vetting it submitted to. And Anthropic is suing the administration that gates its models, after the Defense Department tried to brand it a supply-chain risk for refusing to drop two narrow limits on its product: no mass domestic surveillance, no fully autonomous weapons.[18] The contradiction runs deep enough to be comic: the United States government simultaneously treats Anthropic as a national-security risk and as the only frontier model it has cleared for use up to the Secret level.[19] Read charitably, that is two arms of government disagreeing in good faith about a real tradeoff. Read at the level of what happened on the ground, with a model pulled, a company branded, and a competitor handed the contract, it looks less like a safety policy than a fight over who holds the switch.

There is one more piece, and it belongs to the man who built the model that got pulled. Two days before Commerce pulled it, Anthropic’s chief executive published an essay calling for binding rules on frontier AI modeled on the FAA and aircraft: testing, auditing, and a government power to block a release it judges unsafe.[20] The authority that hit him two days later was not that one. A pre-release safety review is not an export-control recall, but the through-line is the same, and it is the uncomfortable part.

The labs that built the frontier are now, in their different ways, asking the state to hold the switch they once held themselves. They may not like the hand that takes it.

And here is the loop that makes the American door self-defeating. Each turn of the gate raises the cost and the political risk of depending on a controlled American model. Every enterprise that feels that cost starts looking for an alternative, the United States cannot reach. There is one. It is open, cheap, and Chinese. And the action that pulled Mythos was, by the government’s own reported concern, about keeping that very model away from China, which means Washington’s defense against Chinese AI is quietly herding the market into China’s arms. The tighter Washington shuts its door, the more of the world’s usage walks out the back.

The European Door: A Customs House on a Road It Doesn’t Own

Europe’s posture is the strangest of the three, because it is built around a gap.

The EU AI Act sorts general-purpose models by the compute used to train them. Cross a threshold of ten-to-the-25th operations and a model is presumed to carry “systemic risk,” which brings obligations to test it adversarially, assess and reduce its dangers, report serious incidents, and secure it.[21] These rules have been on the books since August 2025. What arrives on August 2, 2026, is the enforcement: from that date, the AI Office can compel information, mandate changes, fine a provider up to 3 percent of global revenue, and order a model pulled from the European market.[22]

The trouble is what that threshold now catches. 10^25 operations was the size of GPT-4 in 2023; the frontier has since run more than an order of magnitude past it, and the largest training run on record sits some fifty times above the line.[23] Dozens of models from a dozen labs now clear it. So the tier that the EU polices is not just the frontier. It is a rung below the leaders, who are American. Europe does have a lab on the other side: Mistral signed the same code. But it trails the models the danger conversation is about, and the continent has no model at the frontier that the rules were written to govern. Its answer to that gap is a consortium, selected on June 19, that won the right to build an open-source frontier model in all 24 official languages on European supercomputers, running on Nvidia silicon.[24] The model is a plan. The regulator is operational.

The European door mostly governs models built in America, which run on infrastructure largely owned by Americans. It is a customs house on a road it does not own. Yet, the rules have teeth, the fines are large, and governing the compliant is not nothing. Europe’s deeper power is the market itself: the threat of exclusion from 450 million consumers has bent more than one American product to Brussels rules before. But a recall and a market ban are both switches on someone else’s model.

When the EU pulls a frontier model, it removes a product from its market that isn't made by a European company, and that will keep selling it everywhere else.

The bloc that talks most about digital sovereignty has arranged to hold the off-switch for everything except a model it controls.

The Chinese Door: Why “Open” Doesn’t Mean Unlocked

China runs the opposite play, and on the numbers, it is winning.

While the United States restricts and Europe regulates, Chinese labs ship. DeepSeek, Alibaba’s Qwen, Zhipu’s GLM, Moonshot’s Kimi: a steady cadence of frontier-adjacent models released as free downloads under permissive licenses. The usage curve is the whole argument.

Chinese open-weight models went from about 2 percent of the tokens on the largest neutral model router at the end of 2024 to roughly 60 percent by the middle of 2026.

That router measures where developers send cost-sensitive work, not a census of all AI use, and over the same stretch, it grew fourfold, with coding rising to more than half of all traffic.[25] A separate count agrees from a different angle: on the world’s main model hub, Chinese developers accounted for roughly 41 percent of downloads over the trailing year, overtaking the United States.[26] The silicon underneath is increasingly China’s own, too; the leading open labs now train and serve on Huawei’s Ascend chips rather than Nvidia’s, so the diffusion no longer runs on hardware Washington controls. China did not just take a share. It took the majority of a market that quadrupled.

This is where the obvious objection arises. An open-weight model on your own machines has no off-switch. Nobody can revoke a file you have already downloaded. If that is true, then China’s door is not a door at all. It is an open field, and the symmetry of this whole piece collapses.

It does not collapse, because open weights are not open access.

Consider the model at the top of the open leaderboard. Zhipu’s GLM-5.2 has 744 billion parameters: about 1.5 terabytes of weights at full precision, roughly half that at the compressed precision most deployments use, every byte of which must sit in graphics memory at once. The reassuring figure you will hear, that only 40 billion parameters are active at a time, is a statement about speed, not memory: the whole model still has to be resident to run. In practice, that means a multi-node cluster of high-end accelerators, not a workstation or a laptop.[27] That is why the usage the router measures is hosted usage: these models are reached through an endpoint, DeepSeek’s own or a Western reseller’s, not run on the premises of the firms using them.

The switch, then, does not disappear. It relocates. It moves to the hosted endpoint, which can be suspended, rate-limited, geo-blocked, or repriced. And it moves to the data, because every prompt and every output now travels to whoever runs the endpoint: a provider under Chinese law if you call DeepSeek directly, or a Western intermediary with its own logs if you route through one. Calling a Chinese model through Azure removes the question of Chinese jurisdiction over your data while preserving the cost advantage; calling it directly does not.[28] The only path that escapes the endpoint entirely is self-hosting, and self-hosting the frontier is gated by capital, which puts it within reach of roughly the same set of organizations that could afford to buy into an American-vetted tier.

The freedom is real at the license and illusory at the rack.

None of this shows up in the price comparison that pulls enterprises toward the Chinese door in the first place. The headline is real: the leading open models run at roughly a sixth of the per-token cost of the American frontier. But the rate card flatters the invoice. These models reason at length before they answer, spending tens of thousands of tokens on a single task, so the gap on the bill comes out narrower than the gap on the price list.[27] And the firm that tries to escape the endpoint by self-hosting trades the API bill for a six- to seven-figure cluster and a team to operate it. Most do the rational thing and stay on the hosted endpoint, which means staying on the switch. The cost advantage that makes the door attractive is the same force that keeps the buyer renting access instead of owning it. Cheap is the lure. The endpoint is the hook.

There is a second lock most analyses miss, and ordinary use does not pick it. The content controls are baked into the weights. Independent testing finds that Chinese open models, including DeepSeek and Qwen, refuse or steer away from Taiwan, Tiananmen, and Xinjiang, and that this steering persists in the weights even in locally run copies. Standard fine-tuning does not remove it. It can be stripped: the abliteration methods that tear the safety scaffolding out of open models also work here. But doing so takes deliberate effort, costs capability, and never fully succeeds, and the fact that you must operate on the weights at all to get a neutral answer is itself the tell. An open-weight model is not neutral. It ships with a foreign government’s preferences embedded in its parameters, and the zero price that makes it spread carries those preferences along.[29] That split is the whole posture. At home, China runs one of the tightest content systems in the world, every public-facing model registered and assessed by the state; abroad, it gives the models away.

Control where it governs, diffusion where it competes.

One last item belongs here, and it weighs against the American gate, not the Chinese door. On June 10, Anthropic told the Senate Banking Committee that operators tied to Alibaba and its Qwen lab had run roughly 25,000 fraudulent accounts and 28.8 million exchanges against Claude to copy its abilities by extraction rather than training.[30] Treat it as an interested party’s allegation, because it is one: it comes from the company with the most to gain from the gating system, filed the same week its chief executive called for government power to block model releases. But if it is true, it lands on the gate, not the open door. You cannot lock up a capability that walks out through your own interface, 28.8 million exchanges at a time.

The Same Switch, Installed Three Ways

Readers of this publication have seen this shape before. An earlier piece mapped a three-layer off-switch over any AI dependency (the chips, the cloud, and the model) and asked what happens when someone throws it on purpose rather than by accident.[31] What 2026 added was the installation of that switch at the national policy level across three countries at once, by three governments that agree on almost nothing.

The seven-year settlement that followed GPT-2 rested on one quiet premise: the lab decides when a model ships. All three blocs have now broken that premise from different directions. The United States moved the decision to Washington and made the best models a government-vetted tier. Europe claimed a veto, the recall, over models it did not build. China dissolved the decision at the license layer and reinstalled it twice, at the endpoint and inside the weights.

What the three share is not motive. The United States is keeping its frontier from China and fighting itself over who holds the gate, Europe is compensating for an industry it lacks, and China is doing what a challenger does when it cannot win the top tier outright: giving away the layer below it, whether by design or by the plain logic of competition, until the incumbent’s moat is a commodity. What they share is the result, and the result lands on the same person every time: the enterprise downstream of all three now depends on a switch it does not hold, and on a body of law it did not write.

What Would Have to Break

On the only number that compounds, usage, the open door is winning. “Too dangerous for you” is losing to “free for everyone” in the market by a wide and widening margin.

But winning hides the trap. The enterprise that routes to the Chinese stack to get out from under the American switch lands on the Chinese endpoint’s switch and under Chinese data law, carrying a model with Beijing’s editorial line inside it. It did not escape control. It swapped Washington’s switch for Beijing’s, and took on Chinese data law in the bargain. Most of the firms making the move have not priced that.

Three developments would break this read, and each is worth watching.

A frontier-parity model small enough to self-host cheaply (a step change in compression, or a model under 100 billion parameters that matches the leaders) would open a switch-free door for real, and the argument that there is no such door would fail.
A US public tier that ships at full capability, with no detuned version held back, would end the two-class frontier and turn the vetting into a formality.
A government-vetted model that visibly stops harm and openly available models that would go on to cause harm would be the first evidence that the gate does safety work rather than turf-holding.

None of the three has happened yet. Until one does, the pattern holds.

The lesson for anyone allocating capital or choosing a stack is not a recommendation for one door over another. It is that the doors were never the choice they appeared to be.

You are not picking the best model. You are picking which government’s hand rests on the switch, and whose law your prompts live under.

So price the switch as what it is: not an outage risk to be solved with a second region, but a control risk that earns its own line in the vendor register, with a tested path to a second model and the standing assumption that the vetted tier and the open tier each carry a different hand, not no hand.

The one door that looks like freedom only moved the switch to a place you were not watching, and the bill for not watching comes due the first time someone decides to throw it.

Notes

[1] OpenAI, “Previewing GPT-5.6 Sol: a next-generation model”, June 26, 2026. The GPT-5.6 series (Sol, the flagship, plus Terra and Luna) launched as a limited preview to a small group of partners whose participation OpenAI said it had shared with the US government, with general availability planned within weeks. OpenAI objected to government-gated access as a long-term default and tied the step to its work with the Administration on the cyber Executive Order framework. System card: GPT-5.6 Preview. OpenAI states the model does not cross its critical cyber-risk threshold under its Preparedness Framework; the gating reflects caution and government request rather than a declared red line.

[2] US Department of Commerce letter from Secretary Howard Lutnick to Anthropic chief compute officer Tom Brown, Friday June 26, 2026, lifting the export-control license requirement for Claude Mythos 5 for entities named in the letter’s Annex A and their foreign-national employees. Mythos 5 only; the letter is silent on Fable 5, which remained restricted, with talks reportedly moving toward its release on an unclear timeline. Lutnick wrote that “appropriate safeguards are in place to permit certain trusted partners” to access the model. Reported by Semafor (Reed Albergotti and Ben Smith), “US releases powerful Anthropic model Mythos to some US companies”, June 26, 2026. The move came the same day as OpenAI’s GPT-5.6 limited release.

[3] Anthropic, “Project Glasswing: Securing critical software for the AI era”, April 7, 2026 — Claude Mythos Preview restricted to 12 launch partners (AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks, with Anthropic) plus roughly 40 additional critical-infrastructure organizations under $100M in usage credits; later expanded to about 150 more.

[4] The White House, Executive Order 14409, “Promoting Advanced Artificial Intelligence Innovation and Security”, June 2, 2026 — Section 3 creates a voluntary framework for up to 30 days of pre-release government access to “covered frontier models,” designated through a classified, NSA-led benchmarking process; the order expressly bars any mandatory licensing, preclearance, or permitting requirement.

[5] See [17].

[6] European Commission, “Regulatory framework on AI”; AI Office enforcement powers (information requests, mandated mitigations, fines, model recalls) apply from 2 August 2026.

[7] The largest in-scope models by training compute are American (OpenAI, Google, Anthropic, xAI); see [23].

[8] European Commission, “Commission selects EUROPA consortium as the winner of the Frontier AI Grand Challenge”, 19 June 2026. The Domyn-led EUROPA consortium will build an open-source frontier model (400+ billion parameters, Mixture-of-Experts) in all 24 official EU languages, on EuroHPC supercomputers (up to 2.5% of capacity for one year) plus a reported 6,000-chip NVIDIA Blackwell cluster. The model does not yet exist.

[9] OpenRouter token-share data, corroborated by an OpenRouter–Andreessen Horowitz study of ~100 trillion tokens (relayed by South China Morning Post, December 8, 2025 — note SCMP is owned by Alibaba; cite the underlying study) and by Data Gravity, “China’s Open-Weight Takeover”, May–June 2026. Figures are hosted-API traffic on a developer-skewed router, not enterprise deployment.

[10] OpenAI, “GPT-2: 1.5B Release”, November 5, 2019 — full model released after a staged rollout, with no strong evidence of misuse reported.

[11] the-decoder, “From GPT-2 to Claude Mythos: the return of AI models deemed ‘too dangerous to release’” — on the industry’s shift to “secure-then-release.”

[12] Anthropic Frontier Red Team, “Assessing Claude Mythos Preview’s cybersecurity capabilities”, April 7, 2026 — autonomous discovery of zero-day vulnerabilities including a 27-year-old OpenBSD TCP SACK remote-code-execution flaw and a 16-year-old flaw in a widely used media library (FFmpeg, H.264), among thousands across major operating systems and browsers; a related 17-year-old FreeBSD NFS RCE was assigned CVE-2026-4747.

[13] AISLE (Stanislav Fort, founder), “AI Cybersecurity After Mythos: The Jagged Frontier”, April 7, 2026, with the full prompts and model responses published on GitHub. AISLE isolated the code behind Anthropic’s showcase vulnerabilities and ran it through small, cheap, open-weight models: eight of eight tested models detected the flagship FreeBSD bug, including a 3.6-billion-active-parameter model at $0.11 per million tokens, and a 5.1-billion-active open model recovered the core chain of the 27-year-old OpenBSD flaw. Corroborated by VentureBeat and CNBC, which note other firms (watchTowr, Vidoc) likewise reproduced Mythos results with public models. AISLE’s thesis: the moat is the system, not the model.

[14] Simon Willison, commentary on the Mythos restriction and on “too dangerous to release” as a buzz-building move, April 2026.

[15] See [4].

[16] Anthropic, “Claude Fable 5 and Claude Mythos 5”, June 9, 2026 — Fable 5 is the generally available Mythos-class model (a tier above the Opus class), carrying cybersecurity, biology, chemistry, and distillation safeguards that defer flagged queries to Claude Opus 4.8 (triggering in under 5% of sessions); Mythos 5 is the same model with those safeguards lifted, restricted to Project Glasswing partners. Both priced at $10/$50 per million tokens.

[17] On June 12, 2026 (5:21 p.m. ET), Commerce’s Bureau of Industry and Security issued an “Is Informed” letter to Anthropic under the Export Control Reform Act of 2018 (50 U.S.C. § 4817(b)(1)) and EAR § 744.22(b), requiring an individually validated export license before either model could reach any foreign national worldwide, including Anthropic’s own foreign-national staff (a “deemed export”). This is a license requirement under existing export-control authority, distinct from the June 2 executive order and not a finalized EAR rule. Unable to filter users by nationality in real time, Anthropic disabled Fable 5 and Mythos 5 globally and characterized the cited jailbreak as narrow and reproducible on other public models. Per Semafor’s reporting, the underlying US concern was that Mythos had reached partners seen as too closely linked to China (reportedly a South Korean telecom). As of publication, Fable 5 remained restricted. Reporting: Semafor (June 13 and June 26, 2026); The Conversation, “Why the US government shut down Anthropic’s latest Claude AI model”; Greenberg Traurig client alert, June 2026.

[18] Congressional Research Service, “Federal Government and Anthropic: Considerations for AI Innovation and Competition”; NPR, “OpenAI announces Pentagon deal after Trump bans Anthropic”, February 27, 2026. Dispute centered on Anthropic’s refusal to permit mass domestic surveillance and fully autonomous weapons use; DoD moved to designate Anthropic a supply-chain risk; a federal court blocked most of the designation as punitive.

[19] Center for American Progress, “The Trump Administration Is Trying To Make an Example of the AI Giant Anthropic”, March 4, 2026 — Claude described as the only frontier model cleared for US government use up to the Secret level.

[20] Dario Amodei, “Policy on the AI Exponential”, June 10, 2026 — proposing FAA-style mandatory third-party testing and auditing of frontier models, with government authority to block or reverse a release that fails safety standards. A pre-release certification proposal, distinct in kind from the June 12 export-control action.

[21] EU AI Act, Articles 51 and 55; presumption of systemic risk above 10^25 training FLOP; obligations include model evaluation, adversarial testing, systemic-risk mitigation, serious-incident reporting, and cybersecurity.

[22] European Commission, AI Act enforcement timeline; from 2 August 2026 the AI Office may issue information requests, require corrective measures, levy fines up to 3% of global turnover or €15M (whichever is higher), and ultimately restrict or withdraw a model from the EU market. “Recall” is used in the body as a plain-language gloss for that withdrawal/restriction power.

[23] The EU AI Act presumes systemic risk above ten-to-the-25th training FLOP, a level calibrated to GPT-4-class compute in 2023. By mid-2025, Epoch AI’s database identified 30+ models from roughly a dozen developers over that threshold — OpenAI, Google, Anthropic, Meta, xAI, and Mistral among them, alongside Chinese labs — with the count rising through 2026. The largest known training run, xAI’s Grok 4, is estimated at ~5×10^26 FLOP, roughly fifty times the line, and Epoch notes monitoring thresholds “may need to rise correspondingly over time” to stay focused on frontier capability. The systemic-risk tier is therefore a wide field below the capability frontier, not a roster of the most advanced models.

[24] See [8].

[25] See [9]. Router growth from roughly 5 trillion tokens per week (April 2025) to over 20 trillion (April 2026); coding rose from about 11% of usage to more than 50% over the period.

[26] Hugging Face, “State of Open Source on Hugging Face: Spring 2026”, March 17, 2026, reporting Chinese developers at roughly 41% of Hub downloads over the trailing year, overtaking US developers; grounded in the study “Economies of Open Intelligence” (851,000 models; 2.2 billion downloads). Disclosure: the author served as Chief Evangelist at Hugging Face through 2023; the Hub download figures are cited from Hugging Face’s own published report and corroborate, rather than originate, the OpenRouter traffic trend.

[27] GLM-5.2 specifications and self-hosting requirements: Z.ai model card; Simon Willison, “GLM-5.2”, June 17, 2026; Artificial Analysis. 744B total parameters (40B active, Mixture-of-Experts), ~1.5 TB of weights at BF16, all of which must reside in GPU memory; serving guides converge on multi-node accelerator clusters. Pricing runs roughly one-sixth of US frontier per token, but heavy reasoning-token usage (tens of thousands of tokens per task) narrows the real cost gap (Artificial Analysis). The capital constraint applies to frontier-parity open models; smaller self-hostable models (e.g., Qwen 3.5’s 0.8B–9B line) are not at the frontier.

[28] Data-jurisdiction handling for hosted Chinese models: calls to Chinese-operated endpoints route through Chinese-jurisdiction servers; Western intermediaries (e.g., Azure) eliminate that exposure while preserving cost. Independent provider documentation and analysis, 2026.

[29] Content controls in Chinese open models. Independent studies document that Chinese open-weight models (Qwen, DeepSeek, and MiniMax among them) are trained to refuse, deflect, or assert falsehoods on PRC-sensitive topics: Taiwan, Tibet, Xinjiang, the 1989 Tiananmen Square protests, and Falun Gong. Researchers describe this as “embedded local censorship” that sits in the base weights and persists even when the model is run locally. See, e.g., “Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation” (March 2026), and “R1dacted: Investigating Local Censorship in DeepSeek’s R1”. Standard fine-tuning raises truthful-response rates only partially; weight-level intervention (abliteration / logit suppression, cf. arXiv:2505.23848) can reduce the bias at a cost in capability and never fully succeeds. The behavior tracks China’s requirement that public-facing generative-AI services be registered and security-assessed by the state (Interim Measures for the Management of Generative AI Services, effective August 2023, governing services with “public opinion attributes”). See also my earlier piece, “Open From Both Sides”, The AI Realist.

[30] Anthropic letter to US Senate Banking Committee Chairman Tim Scott and Ranking Member Elizabeth Warren, dated June 10, 2026, alleging the largest known distillation campaign on Claude — roughly 25,000 fraudulent accounts and 28.8 million exchanges between April 22 and June 5, attributed to operators affiliated with Alibaba and its Qwen AI lab, targeting agentic reasoning, software engineering, and long-horizon planning. First reported by Bloomberg (June 24); confirmed by CNBC and Reuters. Interested-party allegation; treat accordingly.

[31] “Access, Disable, Destroy”, The AI Realist — the three-layer coercion model over chips, cloud, and models.

Two Laws, One Dependence

Julien Simon — Mon, 22 Jun 2026 07:23:56 GMT

In October 2025, a single fault in one Amazon data center in northern Virginia took down Signal, Snapchat, Epic Games, and much of the internet for most of a day.[1] There was a second hyperscale outage that week: an Azure Front Door failure that hit Heathrow and the Scottish Parliament. Seven months later, these outages have produced a piece of European law.

In the week of 22 June 2026, the European Commission is expected to find, provisionally, that Amazon Web Services and Microsoft Azure are “gatekeepers” under the Digital Markets Act, a designation no cloud provider has carried before.[2] A gatekeeper is a platform so entrenched that its customers cannot practically avoid it, and the designation imposes on it obligations the rest of the market does not bear. The remedies here aim to improve interoperability, ensure cleaner data portability, and reduce exit fees.[3]

Europe’s newest answer to its dependence on American cloud is a rule that makes it easier to switch between two American clouds.

Read the room before you read the regulation. The outage was an accident, and the cure is written for accidents. The thing that makes the cloud a question of sovereignty rather than reliability is not the accident. It is the day someone reaches for the off switch on purpose, and that switch is not on the menu.

Two laws, one month

The designation does not arrive alone. On 3 June, the Commission published the Cloud and AI Development Act, the centerpiece of its Tech Sovereignty Package and the most serious attempt yet to legislate European independence in cloud and AI.[4] Two instruments, weeks apart, aimed at the same dependence. One is a competition tool. The other calls itself sovereignty. Neither touches the two things that decide whether a cloud is sovereign: who operates the service, and who controls the chips it runs on.

The competition tool runs into a wall of arithmetic. Three American firms, Amazon, Microsoft, and Google, hold 70 percent of the European cloud market; the largest single European provider, whether SAP or Deutsche Telekom, holds 2 percent.[5] Against that backdrop, “easier to switch providers” means something specific and unhelpful: easier to switch from Amazon to Microsoft, and vice versa.

Portability between two firms under the same foreign jurisdiction is not an exit. It is a more comfortable form of dependence.

None of this makes the remedy worthless. The Digital Markets Act will lower egress bills and make multi-cloud architectures less painful, and contestability is a real good, whether or not it touches sovereignty. The point is narrower: a competition instrument is being read, in the political register, as a sovereignty win. It is not built to be one, and it cannot accidentally become one. Where the designation cannot reach, and where the sovereignty law chooses not to look, is the rest of this piece.

The off switch reaches the mundane

Where the off switch sits is settled, and I will not re-litigate it here. The law follows the company, not the server: a provider under United States jurisdiction can be ordered to produce data in its possession, custody, or control wherever that data physically sits. I traced that statutory chain in full in Two Sovereign Clouds, One Legal Wall.

What is new is the evidence that the switch reaches past the obvious targets. On 22 May 2026, the Dutch outlet Vrij Nederland reported that Microsoft had handed the US House Judiciary Committee the internal emails, meeting notes, and calendar entries of named officials at two Dutch regulators, the competition authority and the data protection authority, without redacting their names.[6]

No sanctions, no court order against the individuals, no Russia nexus.

Ordinary European civil servants, doing ordinary European regulatory work, have their correspondence produced to a foreign legislature because the company holding it answers to that legislature’s law. called it "extremely worrying" and raised it with the US ambassador.[7]

That matters because, until now, the switch had mostly been thrown against the conspicuous: an ICC prosecutor under US sanctions and a Rosneft-linked refiner under EU sanctions, once under American law, once under European, the customer’s own location irrelevant in both. The objection writes itself: those were sanctioned parties. But the Dutch case is different. The exposure is not a property of being sanctioned. It is a property of whose jurisdiction your operator holds, and the Digital Markets Act’s portability remedy does nothing to change that, because it moves you between two operators who answer to the same one.

The sovereignty law that de-chipped itself

Set the competition remedy aside and read the Commission's proposed Cloud and AI Development Act on its own terms. It is the most serious sovereignty framework Europe has produced, and its seriousness is the problem. The Act defines four assurance levels, weakest to strongest.[8] Level 1 is data residency. Level 2 adds independence from third-country interference and transparency in the software supply chain. Level 3 requires the provider to be owned and controlled from within the EU, with criteria that extend to personnel's nationality. Level 4 demands full command of the software supply chain.

Read as a ladder, it climbs towards independence. Read against the market, each rung lands on a segment that already exists.

The Commission’s own impact assessment aligns the levels with current supply and is candid that most public-sector workloads will sit at Levels 1 and 2, which the American hyperscalers reach through their “sovereign” offerings.[9] Only a narrow band will require Levels 3 and 4. There is a sharper irony one rung up: Level 2 asks a provider to demonstrate independence from third-country interference. A US hyperscaler “sovereign” tier claiming Level 2 is certifying, on paper, an independence its own counsel told a parliament it does not have.[10] And Level 3, the rung that is supposed to signify European ownership, contains a clause that allows the Commission to recognize third-country providers.[11] The most European tier has a door in its back wall.

Then there is the part the proposal leaves out. The Commission already had a sovereignty yardstick that required a full EU supply chain, including chips.[12] When it put €180 million of its own sensitive workloads out to tender in April, no bidder reached that tier; the cleanest qualifiers cleared the rung below it, on a commodity stack.[13] Yet when the legislative text arrived in June, CADA did not carry the chip across. Its assurance levels grade the software supply chain and stop there; Annex II places hardware "outside of the scope" of the sovereignty assessment.[14]

Call it what it is: de-chipping.

The yardstick that scored the chip was one that the Commission could publish alone; the proposed law dropped the one rung nobody could meet. A sovereignty framework that cannot describe a sovereign chip has decided not to try. The money agrees with the edit. The Act’s financial statement carries roughly €25 million across 2028 to 2034, against a build-out it says needs three to four billion euros per gigawatt and tens of gigawatts of new capacity.[15] That is not a budget. It is a signature.

The switch nobody scores

The de-chipping matters because the hardware is where the off switch is most absolute, and it is the layer that assurance levels now refuse to look at. Not one of the four scores the silicon. A workload can sit at Level 4 and run end to end on Intel processors and Nvidia accelerators; the software supply chain can be wholly European while the chips answer to Washington. I have written separately about how far that dependence runs below the silicon; the point here is only that CADA’s own top tier now stops precisely where that dependence begins.

It is not hypothetical. In January 2025, the outgoing US administration’s AI Diffusion Rule sorted the world into tiers for access to advanced AI chips and placed much of the EU, including Poland, Portugal, and most of the bloc’s east, in the second tier with capped access; the rule was rescinded that May, two days before it took effect, in a decision as unilateral as its drafting.[16] Congress, meanwhile, is advancing the Chip Security Act, which would require location verification for exported AI chips. It cleared the House Foreign Affairs Committee unanimously in March 2026, and while it is not law, the fact that chip-tracking is on the table in Washington tells you where hardware sovereignty is decided.[17]

A “sovereign” European cloud whose chips can be tiered, traced, or capped by a foreign legislature is sovereign in the way a house with someone else’s lock on the door is private.

What honesty would look like

None of this means the pragmatism is wrong. Given the state of European supply, no law could wall the public sector off from American providers without being unenforceable on contact, and removing a sovereign-chip tier that nobody can meet is more honest than pretending otherwise. A version of the Act that said plainly, strict sovereignty over a critical core and pragmatism on the rest, would be defensible. The Commission’s own Cloud III procurement showed in April that a real requirement pulls a real response, when two clean European providers, Scaleway and STACKIT, qualified for sensitive workloads on a commodity stack.[18]

The problem is not the pragmatism. It is the packaging. A competition remedy that makes the American duopoly easier to move within is being sold as a step towards sovereignty. A sovereignty law whose own grades never reach the chip is being sold as the thing that will end the dependence. Present either as what it is, and both are defensible; present them together as a sovereignty agenda, and you install the ambiguity in which sovereignty-washing lives, a term the European Parliament’s own research service now uses in print.[19] Call the hyperscalers’ improved offerings what they are: trusted cloud, resilient cloud, real operational progress. Sovereign, no.[20]

Presenting the package on 3 June, the Commissioner responsible, Henna Virkkunen, said the aim was to ensure no provider of critical services holds a "kill switch" over Europe.[21]

She named the risk precisely, then presented a law that reaches neither the operator nor the chip, the two places the switch sits.

The EU's record on targets like this is not encouraging: the 2023 Chips Act aimed to double Europe's share of global semiconductor production to 20 percent by 2030, attracted more than €52 billion, and left the bloc below 10 percent.[22] The number of cloud laws built to move is the same kind; the share of the European cloud market held by European providers is stuck near 15 percent, while three American firms hold 70 percent.

If it has not turned by 2030, Europe will have regulated its dependence twice, relabelled it once, and changed it not at all, and the off switch will sit where it sits today, in the one place no assurance level dares to score.

Notes

[1] On 20 October 2025, a DNS race condition in Amazon Web Services’ US-EAST-1 region (northern Virginia) cascaded across dependent services for roughly fifteen hours, affecting Signal, Snapchat, Epic Games and more than a thousand others; AWS published its post-mortem three days later. The outage was the proximate trigger for the EU’s cloud market investigation. ThousandEyes outage analysis, 20 October 2025.

[2] The Commission is reported to be preparing preliminary findings, expected the week of 22 June 2026, that AWS and Microsoft Azure meet the requirements for gatekeeper designation under the Digital Markets Act, with a final decision expected by end-2026. The Next Web, citing Bloomberg.

[3] Reported obligations under discussion include interoperability, data portability and curbs on customer lock-in such as egress fees. The Next Web.

[4] Cloud and AI Development Act, European Commission, published 3 June 2026 as the centrepiece of the European Technological Sovereignty Package. European Commission; Covington, Inside Global Tech, 11 June 2026.

[5] Synergy Research Group: Amazon, Microsoft and Google together hold about 70 per cent of the European cloud infrastructure services market (IaaS, PaaS, hosted private cloud); among European providers, SAP and Deutsche Telekom lead with roughly 2 per cent each (2024 data). Synergy Research Group.

[6] Vrij Nederland, 22 May 2026, reporting that Microsoft transmitted unredacted internal emails, meeting minutes and calendar entries of named civil servants at the Authority for Consumers and Markets (ACM) and the Data Protection Authority (AP) to the US House Judiciary Committee. NL Times.

[7] The named officials include staff at both regulators and a University of Amsterdam researcher; the Dutch cabinet called the episode “extremely worrying,” noting the named individuals could face travel bans or sanctions, and State Secretary for Digital Economy and Sovereignty Willemijn Aerdts raised the matter with US Ambassador Joe Popolo during her introductory meeting with him. Built In EU; DutchNews.nl.

[8] CADA defines four “Union assurance levels” for public-sector cloud and AI procurement. Level 1 (data residency) is the floor for all providers serving the public sector; Levels 2–4 add an independent third-party audit examining EU-located staff, whether provider data is used to train AI models, software supply-chain security, and a European cybersecurity certificate rated at least “substantial.” “Control” is defined by reference to the European Defence Fund “decisive influence” test (Art. 2(21)). European Commission; Wilson Sonsini, June 2026.

[9] CADA impact assessment, Part 1, p. 41, maps each level onto an existing market segment in near-verbatim terms (Level 1: “US hyperscalers generally all have offerings that would allow them to qualify”; Level 4: “some emerging EU offerings”). The assessment twice instructs that risk assessments “consider the reality of the supply market to avoid… mandating the use of services that don’t exist (yet)” (pp. 40, 49). Its demand model splits public-sector use cases 70/20/9/1 across Levels 1–4 and sizes the exclusively EU-addressable market at roughly €4.48 billion by 2030 (pp. 47, 72–73).

[10] CADA, Level 2 criterion requiring providers to demonstrate independence from third-country interference (Art. 2(g)(ii)). The Commission’s own impact assessment (Part 1, pp. 14–15) recounts the Microsoft France testimony before the French Senate procurement inquiry, June 2025: “Non, je ne peux pas le garantir, mais, encore une fois, cela ne s’est encore jamais produit.” Sénat, compte rendu.

[11] CADA, Level 3 requires the provider to be owned and controlled from within the EU, with criteria such as personnel citizenship; the Commission’s own summary states it “can recognise third-country providers” under the framework. European Commission; see also techUK, June 2026, which calls the third-country recognition mechanism “one of the most important aspects” of the legislation.

[12] The Commission’s Cloud Sovereignty Framework (Version 1.2.1, published 20 October 2025) defines a five-rung scale of Sovereignty Effectiveness Assurance Levels (SEAL), from SEAL-0 (no sovereignty) to SEAL-4 (Full Digital Sovereignty: complete EU control across the supply chain, hardware included). CADA codifies a four-level version of this framework as binding “assurance levels,” and, as Annex II makes explicit, drops hardware from scope in the process. European Commission, Cloud Sovereignty Framework v1.2.1; analysis in More Sovereign, Different Stack: The Builder Tax.

[13] European Commission, Cloud III procurement, awarded 17 April 2026: a Dynamic Purchasing System worth up to €180 million over six years for sensitive EU institutional workloads. SEAL-2 was the minimum threshold; the cleanest prequalified consortia, Scaleway and STACKIT, cleared SEAL-3 on commodity-stack architectures, and none reached SEAL-4. European Commission; see Ten Percent Sovereign and The Builder Tax.

[14] CADA, Annex II, scope paragraph, excludes hardware verbatim: “’Hardware’ within the meaning of Regulation (EU) 2024/2847, Article 3, point (5) is outside of the scope.” Hardware survives only as a contract-award criterion that is “ancillary and not decisive” (Art. 32(2)(d)), feasibility-qualified, and capped by Recital 67 at a suggested maximum of 15 of 120 points. European Commission; Covington, Inside Global Tech, 11 June 2026 (confirming the 15-of-120 weighting).

[15] CADA legislative financial statement: total appropriations of €25.228 million across 2028–2034, fee-financed, supporting roughly 25 full-time staff. The impact assessment’s central scenario calls for a tripling of EU data-centre capacity against an estimated 19 GW gap, at a build-out cost of roughly €3–4 billion per gigawatt. European Commission.

[16] The AI Diffusion Rule (interim final rule, 15 January 2025) established a tiered framework for access to advanced AI chips, placing NATO members including Poland and Portugal in the second tier with capped access; BIS rescinded it on 13 May 2025, two days before its scheduled 15 May effective date. United States Studies Centre; U.S. Department of Commerce, BIS.

[17] The Chip Security Act (H.R. 3447) passed the House Foreign Affairs Committee 42–0 on 26 March 2026 and proceeds to the full House; it has not been enacted. Nvidia opposes a tracking mandate, stating its products contain “no backdoors” and “no kill switches,” while since December 2025 offering optional software to trace its GPUs’ location. House Foreign Affairs Committee; Geo News, citing Reuters.

[18] Commission Cloud III procurement, awarded 17 April 2026: under a genuine sovereignty requirement, the cleanest prequalified consortia, Scaleway and STACKIT, cleared SEAL-3 (one tier below the unmet SEAL-4). European Commission; see The Builder Tax.

[19] The term “sovereignty-washing” appears in the European Parliament’s research output describing hyperscaler “sovereign cloud” offerings. European Parliament, “European Software and Cyber Dependencies,” PE 780.413/778576, December 2025.

[20] Disclosure: the author is an AI Operating Partner at Fortino Capital, a European private equity firm whose portfolio includes companies whose cloud architecture decisions fall within the scope of this analysis; he previously spent six years at AWS and was Chief Evangelist at Hugging Face. The disclosure names the interest; it is not an endorsement of any provider or procurement choice.

[21] Henna Virkkunen, Executive Vice-President for Tech Sovereignty, Security and Democracy, at the press conference presenting the European Technological Sovereignty Package, Brussels, 3 June 2026: the Commission wants to ensure no cloud provider of critical workloads holds a “kill switch” over essential European services, and observed that the US CLOUD Act makes it “difficult” for US companies to reach the highest sovereignty levels. CNBC, 3 June 2026.

[22] The 2023 EU Chips Act mobilised more than €52 billion toward a target of doubling the EU’s share of global semiconductor production to 20 per cent by 2030; the EU’s share remains below 10 per cent, prompting the demand-focused Chips Act 2.0 in the June 2026 package. TechPolicy.Press, June 2026.

Independent or Current

Julien Simon — Thu, 18 Jun 2026 11:01:42 GMT

In 2024,, the European Commission went looking for someone to evaluate the world’s most powerful AI models. The post was the lead scientific adviser to the AI Office, the person who would sit across from OpenAI, Anthropic, and Google and judge whether their frontier systems were fit for placement on the European market. The application window opened, then closed in December. Months later, the chair was still empty.[1]

The pay explains some of it. The Office’s technical roles top out near $120,000, and even its senior posts sit on fixed civil-service scales the labs beat several times over for the same skills, sometimes with seven-figure packages. The European pool for this work is thin enough that the strongest candidates already sit in San Francisco or London. The Office has since hired a small, capable safety team, with people from Oxford, Google, and the UK’s AI Security Institute.[2] But the seat reserved for the scientist who would lead the judging of a frontier model stayed empty as the start date approached. The authority is real. The question is the capacity to use it.

Step back from the hiring, though, and the opposite is just as true. On paper, the European Union has just built the most serious AI enforcer anywhere. The Digital Omnibus, voted through Parliament on 16 June, consolidated oversight of the largest models and AI across the largest platforms into a single office and gave that office the power to vet the highest-risk products before they ship.[3] A Scientific Panel of 60 independent experts was sworn in on 1 June to give it technical muscle.[4] A 174-member Advisory Forum sits alongside.[5] The obligations for general-purpose models have been in effect since August 2025.[6] By any measure of ambition, Brussels has done what Washington spent years declining to do: it built a standing regulator with the authority to test the frontier.

That empty chair is a small sign of a large problem. Europe’s frontier-AI evaluator can be independent or current, but not both, and the same shortage of methods, access, and people produces both limits. To judge the largest models as they exist, the Office has to borrow the labs’ tools, access, and talent, at which point it is grading the labs’ own homework. To build its own capacity instead and stop borrowing, it has to move at the speed of public hiring and public standard-setting, at which point whatever it certifies is a snapshot of a live model it has already outrun. The competence to evaluate a frontier model in real time exists almost entirely inside, or priced by, the companies being evaluated.

The homework problem

Start with what the law asks of a frontier developer, because it is less than most readers assume. A provider of a general-purpose model with systemic risk must run its own state-of-the-art evaluations, assess and reduce the risks it identifies, document all of this in a Safety and Security Model Report, and submit that report to the AI Office before the model reaches the market.[7]

The lab runs the assessment, writes it up, and sends the file; the Office reads it.

External evaluation exists, but its shape tells you who is in charge. The Code of Practice that fills in the details requires a model developer to give independent, external evaluators access to its most advanced versions and to publish the standards by which it selects them. The provider grants the access, and the provider sets the bar for who qualifies. The requirement was nearly cut from the Code during drafting and survived only as mandatory “in most circumstances,” with an exemption allowing a model developer to claim it is as safe as one already cleared.[8] The outside check runs through a door the developer holds open. And as of mid-2026, the Office had not settled what qualifies someone to serve as an outside evaluator: it called a workshop for 15 July, weeks before its enforcement powers switched on, to take expert input on the question.[9]

This is not a flaw the Office can out-hire, because the people who would do the hiring face the wall the empty adviser’s chair revealed. And it is not solved by leaning on the independent evaluators who have made their names doing this work, because they hit the same door. The respected outside shops — METR, Britain’s AI Security Institute, Apollo, FAR.AI — operate under voluntary access agreements granted by the labs, and the labs can withdraw those agreements. When a lab has shared a model before release, the sharing has been thin: in the clearest recent case, a model developer handed an evaluator a safety-tuned version with no ability to fine-tune it, and one of the best-known evaluators appears to have had no special pre-release access since its early work in 2023.[10] Their independence is real on the org chart and thin where it counts: in what the lab lets them see.

What the outside world mostly gets to see is behavior: the model’s outputs, not its internals.

So, where is Europe’s own capacity? It is real, and it sits one level up from the test. Between late 2024 and mid-2025, the Commission’s Joint Research Center ran a global expert pool to develop methods for sorting models into risk categories, and co-authored a paper with the AI Office, published in Science, on how to keep evaluation proportionate to risk. This is serious work. But it is rubric-writing: how to classify a model, where to set the compute thresholds, how to think about reach. The JRC’s own review of AI benchmarks catalogs how immature the field is, and its categorization work measures capability using the benchmarks that already exist, the ones the research community and the labs have built.[11] Europe can decide which models deserve scrutiny and to what extent. It cannot, on its own, put a frontier model through its paces.

Put the pieces together, and the first half of the trap closes on itself. To bind the frontier, the Office needs to evaluate the largest models close to real time. Real-time evaluation needs methods and access that live within the labs and the lab-adjacent shops at the labs’ gate. The Office cannot build that capacity fast enough, because it cannot pay for the people who have it. So the binding step falls back on the provider’s own evaluations, the provider’s chosen outside reviewers, and the provider’s report. No independent capacity accumulates, so next year the Office is no closer to building its own, and it borrows again. The Panel can raise a formal alert when it suspects a model carries serious risk, and that lever is real.[12] But an alert is a flag raised over a document that the model developer wrote.

The yardstick that never arrives

In principle, there is a way out of the borrowing: building an independent measure of its own. Europe tried. The result is the second half of the trap.

The AI Act’s binding requirements for high-risk systems were meant to rest on harmonized technical standards, the detailed yardsticks against which a system is judged. The Commission asked the European standards bodies to write them in 2023. They missed the 2025 deadline, the work continues, and the Commission has said the delay puts the timetable at risk. The first relevant standard reached public consultation in late 2025, months behind schedule, and standards of this kind typically take 2 to 4 years to complete. The Omnibus that Parliament just passed pushed the high-risk obligations out to the end of 2027 and tied their start to the readiness of those standards, conceding that it cannot set its own clock.[13]

The model layer repeats the problem in another form. The general-purpose rules are written to apply across a model’s life, including after updates, and a model developer is told to set their own trigger points for when fresh evaluation is due. But the binding re-assessment only fires when a change is large enough to count as a new model, and the indicative bar for that is a modification using more than a third of the original training compute, a threshold the Commission expects few to cross.[14]

A model can drift a long way through a stream of smaller updates without ever tripping a formal review, and the triggers that might catch it are the model developer’s to set and judge.

Now both halves are visible at once, and they lock together. Every move the Office makes toward being current — lifecycle monitoring, live evaluation, judging the model as it is — runs through the developer, because only the developer has the access and the tools to do it at speed. And the one move toward an independent measure of its own — the standards — keeps slipping past its deadline. Currency by borrowing, independence by waiting: the Office cannot have both, because the thing that would let it be current without borrowing, a deep bench of frontier evaluators paid at market rates working from methods it owns, is the thing the empty adviser’s chair says it cannot afford to build.

What the Office can do

The safety unit is staffed with credible people, drawn from places that do this work.[2] The Scientific Panel is no roster of industry placemen: most of its 60 members are academics, a sixth come from the European machine-learning research network, and they sit in their personal capacity under conflict-of-interest rules.[4] These are serious researchers, several of whom would be at home in any frontier lab. The JRC’s methodological work is real and is being read.[11] The alert the Panel can raise is a true lever, and the Office now has the formal authority over the largest models and the platform-integrated systems that, until this year, were scattered across national capitals.[3] The enforcement numbers are not trivial: breaches of the general-purpose obligations carry fines up to the greater of €15 million or 3% of worldwide turnover.[15]

The law does not limit the Office to reading what it is sent. It can demand access to a model, through an interface or the source code itself, and run its own evaluation, with fines for a model developer who refuses.[16] But look at when it applies: to check compliance only once the developer’s own documentation is found wanting, or to investigate a serious risk after the panel raises a flag. It is the escalation, not the routine, and the default stays the model developer’s report. The rules for how such an evaluation would run have not yet been written. And a right to demand access is worth only as much as the capacity to use it: the methods, the compute, and the people it has already shown it lacks. A right of entry that the regulator cannot staff is a right on paper.

So the Office can decide which models matter. It can demand documentation. It can read a model developer’s report with expert eyes, push back, and escalate. It can fine a model developer who lies or hides. What it cannot do is the thing the public imagines a safety regulator does: take the live model, probe it deeply and repeatedly on its own terms, and certify the result against a yardstick it built and controls. Everything it does sits on top of artifacts that the model developer generated and access that the developer granted.

The fork the labs just got

There is one more actor, and it changed the board three weeks ago.

On 2 June, President Trump signed an order on advanced AI that does almost the opposite of what Brussels did. It establishes a framework for model developers to grant the federal government up to 30 days of access to their most powerful models before release, on a voluntary basis, with qualifying models selected by a classified benchmark, and with no mandatory licensing, no pre-clearance, and no right for anyone to sue to enforce.[19] Collaboration, the order says, not command. The administration had pulled an earlier draft in May for fear it would slow American competitiveness, and signed this softer version instead.[20] The same administration has stood up a task force to challenge state AI laws it considers too heavy.[21] The contrast with Europe could not be sharper: a voluntary American framework on one side, and on the other, binding European obligations carrying statutory fines that take effect this August.[22]

This matters to Europe’s evaluator because it hands the labs something they did not have before: a venue they prefer. A developer cannot lawfully step outside the AI Act for a model it places on the European market; the obligations attach on sale, wherever the model was built.[23] But “comply” and “comply fully, first, and openly” are different things.

A lab can ship its strongest model in the United States first under the friendly arrangement, stage or delay the European release, send a capped or filtered version into the European market, and meet the Office’s deeper requests for access with the least it can defend, while pointing to the American process as its real oversight.

This is not hypothetical: in 2024, one large model developer withheld its multimodal models from the European Union over what it called regulatory unpredictability, and a major device maker delayed its flagship AI features in the bloc on similar grounds.[24] The incentive to route cooperation toward the venue that creates no liability only sharpened the week of the order: the day before it, Anthropic filed a confidential draft registration with the SEC at a valuation near $965 billion, the kind of public-market stake that turns a European enforcement action into a disclosed risk on the prospectus.[25]

None of this loosens the bind; it pulls it tighter. The evaluator was already leaning on the access the labs chose to grant, and now the labs have a venue they prefer and a reason to give Brussels less of it. The access that was thin becomes contested ground, and the jurisdiction that wins it is the one the developers like better. Europe gets the companies willing to be examined, on their own terms; Washington offers those same companies an examiner who asks nothing it can enforce.

Europe has run this play before

If this reads like a forecast, it is not. Europe ran the experiment one regulatory generation back, on the platforms, using the same institutions it is now reaching out to. The AI Office and its Panel are built on the template of the European Center for Algorithmic Transparency: set up in 2023, housed in the same Joint Research Center that now serves the Office, and tasked with giving the Commission the in-house expertise to police the largest online platforms under the Digital Services Act.[26] Swap “platforms” for “models,” and the shape repeats: exclusive Commission supervision of the biggest players, backed by a technical body meant to do the evaluating. Its record is the closest thing we have to a forward look, and it is thin.

Two and a half years in, the platform rules have produced a single fine: €120 million against X in December 2025, in part for barring researchers from its data, with 60 days to fix rather than pay.[27] The open cases against Meta and TikTok turn on the same failure: researchers shut out of platform data.[28] The rules grant researchers a right to that data, and the platforms have spent years making it slow, conditional, or barred outright.

Slow enforcement and a permanent fight to see inside the thing it polices: that is the template now aimed at frontier models.

The pattern predates the platforms, and the AI lineage is direct. Europe’s chemicals agency checks only a fraction of the dossiers industry files on its own substances; the legal minimum rose from 5% to 20% in 2019, and when it does check, most fail: 61% of one early cohort fell short of what the law required.[29] Even a mature agency with real capacity can only spot-check a self-reported base, finding much of it wanting. The body that wrote Europe’s first AI principles fits the same shape: the 2018 High-Level Expert Group, heavy with industry seats and with only four ethicists among more than 50 members, produced ethics guidelines and a self-assessment checklist that one of its own called ethics-washing.[30] The Scientific Panel is its more independent, more technical heir, and still a body reading what developers choose to show it.

What would have to break

Europe escapes the trap if the Office can retain frontier evaluators at something near market pay, build a battery of tests it designs rather than borrows, win a right of deep access to live models instead of the access a developer grants, and track models as they change instead of certifying a version and moving on. Each of those is conceivable. None is close on the current path: the pay bands are public-sector, the standards keep slipping, the access is voluntary, and the talent the Office needs is being bid away by the firms it would evaluate and is now being courted by a friendlier government across the Atlantic. The honest probability, on today’s trajectory, is low.

None of this argues for no regulator; a borrowed evaluation still beats none. The point is narrower: borrowing carries a cost the borrower keeps paying.

Which leaves a verdict sharper than “the regulator is underpowered.” For anyone running diligence on an AI vendor, the practical reading is blunt: treat “AI Office-supervised” or “evaluated under the AI Act” as a provider-generated artifact, not an independent clearance, and price the difference. Europe has built a real enforcer that can read the labs’ homework, raise a flag over it, and fine a lab that hides the truth. What it cannot do is grade the frontier itself. And whichever way it leans, the only models it truly examines are the ones whose makers agree to sit the exam.

The systems that should worry you most are built to skip it: a model fine-tuned to strip its safety training and re-released by someone who never files with Brussels, a capable model served from a jurisdiction that ignores the Act (I’m looking at you, China), a model trained on smuggled compute by an actor with no European address to fine.

Once again, the cop that Europe built is aimed at the population that was already willing to be policed.

Notes

[1] The AI Office’s lead scientific adviser post — application deadline December 2024 — remained unfilled in mid-2026; the head-of-safety-unit role, vacant since the Office was set up, was filled in December 2025 (Matthieu Delescluse). Transformer News; MLex.

[2] Reported AI Office salaries: technical and contract-agent roles roughly $55,000–$120,000, the lead scientific-adviser post at grade AD13 (about €13,500–15,000 per month); EU staff also receive allowances and favourable tax. The figures still fall well below frontier-lab compensation, which can run to seven figures. Transformer News.

[3] Digital Omnibus on AI, consolidated text (Council doc ST 9247/2026 INIT); European Parliament plenary vote 16 June 2026. The AI Office gains centralized supervision over systems built on a same-provider general-purpose model and over AI integrated into very large online platforms, with Commission pre-market assessment for such high-risk systems; carve-outs leave certain products and uses to national authorities. European Commission, Regulatory framework on AI.

[4] AI Act Scientific Panel of 60 independent experts, established 1 June 2026 under Article 68 and Commission Implementing Regulation (EU) 2025/454 (7 March 2025); members serve in a personal capacity under confidentiality and conflict-of-interest declarations; most from academia, roughly a sixth from the ELLIS network. Implementing Regulation (EU) 2025/454; European Commission, AI Scientific Panel.

[5] AI Act Advisory Forum of 174 members under Article 67. European Commission, AI Advisory Forum.

[6] General-purpose AI model obligations have applied since 2 August 2025 under Regulation (EU) 2024/1689; the Office’s power to impose penalties for breaches applies from 2 August 2026. DLA Piper.

[7] Article 55 and the GPAI Code of Practice require providers of general-purpose models with systemic risk to conduct state-of-the-art model evaluations, assess and mitigate systemic risks, and submit a Safety and Security Model Report to the AI Office before placing the model on the market. Article 55.

[8] Under the GPAI Code of Practice (Safety and Security chapter), signatories must give independent external evaluators access to their most advanced models before deployment and publish their evaluator-selection criteria; the requirement applies “in most circumstances,” with proportionality and exemptions, for example where a model is no more capable than an existing open-weight one. Analysis of the Code.

[9] European Commission, European AI Office: “Call for participants: Workshop — qualification requirements for external evaluators of GPAI models with systemic risk,” 15 July 2026. European AI Office.

[10] Independent evaluators (UK AISI, METR, Apollo, FAR.AI) work under voluntary 2024–2025 access agreements the labs grant and can withdraw; pre-deployment sharing has been thin (in one case a safety-tuned model with no fine-tuning access; a leading evaluator with no special pre-release access since around 2023), and the evidence base is overwhelmingly behavioural. Seth & Sankarapu, arXiv:2605.15164 (Lexsi Labs, May 2026); AI Lab Watch; access-level taxonomy in arXiv:2601.11916 (Jan 2026), finding external evaluators are typically restricted to black-box access and cannot examine model internals.

[11] A paper co-authored by the AI Office and the Joint Research Centre, “The science and practice of proportionality in AI risk evaluations,” appeared in Science (vol. 391, 6 March 2026), translating the legal proportionality test into criteria for how demanding a model evaluation must be; the JRC also runs an expert pool on GPAI risk categorisation and has catalogued the immaturity of current AI safety benchmarks. Knowledge4Policy; Science; JRC, AI safety benchmarks report.

[12] The Scientific Panel may issue a qualified alert to the AI Office where it suspects a general-purpose model presents systemic risk; the alert is a trigger the Office may act on, not an automatic investigation. Article 90.

[13] Harmonised standards under standardisation request M/593 were requested of CEN-CENELEC in 2023, missed the 2025 deadline and remain in development; the first reached public enquiry in late 2025, and such standards typically take two to four years, with acceleration measures adopted in October 2025 targeting completion by late 2026. The Digital Omnibus ties the start of the high-risk obligations to their availability. TechPolicy.Press.

[14] General-purpose obligations apply across the model lifecycle, including post-market modifications, with provider-set evaluation triggers; a downstream modification is treated as a new model chiefly where it exceeds roughly one-third of the original training compute (indicative), a bar the Commission expects few to cross. European Commission, GPAI Q&A.

[15] Breaches of the general-purpose obligations carry fines up to the greater of €15 million or 3% of worldwide annual turnover; high-risk obligations are deferred under the Omnibus to 2 December 2027. Article 101.

[16] Article 92 empowers the AI Office, after consulting the Board, to conduct evaluations of a general-purpose model — to check compliance where information requested under Article 91 is insufficient, or to investigate systemic risk, in particular after a scientific-panel alert — and to appoint independent experts, including from the panel. It may request access via APIs or other means, including source code, with Article 101 fines for refusal; detailed arrangements await implementing acts not yet adopted. Article 92; European Commission, GPAI Q&A.

[19] Executive Order, “Promoting Advanced Artificial Intelligence Innovation and Security,” 2 June 2026: directs a framework for developers to voluntarily grant the federal government up to 30 days of pre-deployment access to “covered frontier models,” with covered models set by a classified NSA benchmark, no mandatory licensing or pre-clearance, and no enforceable private right. The White House; Crowell & Moring; Morrison & Foerster.

[20] The administration pulled an earlier draft of the order in May 2026 over concerns it would hinder US competitiveness, signing a softer version on 2 June. Crowell & Moring.

[21] The order establishes a Department of Justice task force to challenge state AI laws. Paul Hastings.

[22] US/EU divergence: a voluntary US framework versus binding EU general-purpose obligations with statutory penalties effective 2 August 2026. ComplianceHub.

[23] AI Act obligations attach when a model or system is placed on the EU market, regardless of where it was developed. Regulation (EU) 2024/1689.

[24] In 2024, Meta declined to release its multimodal Llama models in the EU, citing “the unpredictable nature of the European regulatory environment” (a text-only Llama shipped), and Apple delayed several Apple Intelligence features in the EU, citing Digital Markets Act uncertainty. Axios; 9to5Mac.

[25] On 1 June 2026, the day before the executive order, Anthropic confidentially submitted a draft S-1 to the SEC at a valuation near $965 billion (following a $65 billion round the prior week). Anthropic; Fortune; CNBC.

[26] The European Centre for Algorithmic Transparency (ECAT), established 2023 within the Joint Research Centre, gives the Commission in-house technical expertise to support its exclusive supervision of very large online platforms under the Digital Services Act. ECAT.

[27] First DSA non-compliance fine: €120 million on X, 5 December 2025, partly for barring researchers from effective access to its public data, with deadlines of 60 to 90 working days to remedy rather than pay. IAPP; Euronews.

[28] The Commission preliminarily found Meta and TikTok in breach of their obligation to give researchers access to public data (24 October 2025); across the DSA cases the recurring breach is denial of access, even though the law grants researchers a right to platform data. European Commission, preliminary findings; Science.

[29] Under REACH, the European Chemicals Agency checks only a fraction of industry-submitted registration dossiers: the legal minimum rose from 5% to 20% in 2019, and ECHA examined about 21% of full registrations (≈15,000) between 2009 and 2023; of 928 evaluations concluded in 2013, 61% were non-compliant with one or more information requirements, and in 2024, 313 compliance checks produced 208 data requests. ECHA.

[30] The High-Level Expert Group on AI (convened 2018) produced the Ethics Guidelines for Trustworthy AI (April 2019) and the ALTAI self-assessment checklist (July 2020); a member, Thomas Metzinger, publicly called the exercise “ethics-washing,” noting roughly four ethicists among more than 50 members and the absence of red lines. European Commission, High-Level Expert Group on AI.

Anthropic's Model Got Pulled. The Dangerous Ones Didn't.

Julien Simon — Mon, 15 Jun 2026 12:20:10 GMT

As of June 15, 2026: Fable 5 and Mythos 5 remain suspended; Anthropic and the administration are in active talks, with officials suggesting access could be restored “in the next few weeks.” No reinstatement and no published rule or Federal Register notice as of filing.

On Friday, June 12, at 5:21 pm Eastern, Anthropic received a letter from the U.S. Commerce Department. By that evening, Fable 5 and Mythos 5 — the company’s two most capable models, launched three days earlier — were gone. Not throttled. Not region-locked. Gone, for every customer Anthropic has, including the banks and agencies that had been using Mythos-class capability for vulnerability discovery. [1]

The mechanism is worth getting right because most of the coverage has it slightly wrong. Nobody flipped a remote kill switch. The government issued an export-control directive citing national security authorities, ordering Anthropic to suspend access for any foreign national, whether outside the United States or within it, including Anthropic’s own foreign national employees. [2] That last reach is the sharp part: under the deemed-export doctrine, giving a controlled technology to a foreign national on US soil counts as an export to their home country, so the order swept in Anthropic’s own non-citizen staff alongside every user abroad. A company cannot filter foreign nationals from US citizens in real time across a consumer product, so the only way to comply was to disable the models for everyone. The order's reach did the work. The global takedown was the side effect.

In April, in an internal briefing for the CEOs of portfolio companies I help oversee, I argued that the defining feature of the AI security landscape is an asymmetry: defenders are governed, and attackers are not. Procurement rules, compliance regimes, and data-sovereignty constraints dictate which models a defender may run. Attackers self-host abliterated open-weight models — models with the refusal behavior surgically removed from the weights — and answer to none of it. June 12 is that asymmetry rendered in a single week.

Here is the sharpened version. The week the most heavily safeguarded frontier model on the market was withdrawn from the entire planet over a narrow, non-universal jailbreak, a bypass that unlocks one sliver of capability in one circumstance. The abliterated open-weight models on Hugging Face stayed exactly where they were. The platform’s own “obliterated” tag now returns over 7,000 of them. [3] No export order reaches those. There is no account to suspend, no API to revoke, and no US-jurisdiction entity in the chain to serve. The governed model can be removed from hundreds of millions of users by one letter. The ungoverned model cannot be removed from anyone by anything.

Abliteration is not a jailbreak, and the difference is the whole argument. A jailbreak is an input attack — a crafted prompt that tricks an aligned model into complying — and the provider can patch it, filter it, or ban the account that sent it. Abliteration is surgery on the model itself. In 2024, researchers showed that a chat model’s willingness to refuse is mediated by a single direction in its internal activations: erase that direction from the weights, and the model loses the ability to say no while keeping nearly all its other capabilities. [4] The edit is baked into the file. Once those weights are on a hard drive, there is no refusal left to bypass and nothing for a vendor to fix. A jailbreak is a lock that can be picked. Abliteration removes the door.

What changed since 2024 is not the idea but the cost. The original technique required a researcher who understood transformer internals; the current tooling takes a command line — one openly published tool decensors a small model in under an hour on a single consumer GPU, no expertise required — and a single registry now hosts more than 200 ablated models. [5] The tempo asymmetry is the part defenders underweight: a frontier lab spends months red-teaming a release — Anthropic says thousands of hours on Fable — while the community publishes the de-safetied counterpart of a comparable open-weight release within a day or two of launch.

The honest objection is that some models try to resist this, but it does not hold. Published defenses — circuit breakers, extended-refusal training — work in the lab. But the labs don’t ship them inside the frontier open weights that actually get downloaded, the strongest results are on small models, and by 2026, public tooling already claims to defeat them, driving even Google’s hardened Gemma 4 to single-digit refusal rates. [6] Hardening raises the price of the operation. It does not close it.

The readers of this newsletter will see this as the activation of the coercion stack described in Access, Disable, Destroy, but through a route the original map didn’t draw. The off switch was not held by the model provider, nor by a sanctioning authority pointed at a foreign adversary. According to the Wall Street Journal and The Information, the finding that triggered the order came from Amazon — Anthropic’s largest investor and its primary cloud partner, on whose Bedrock platform the model most likely ran. Amazon’s researchers found the bypass; CEO Andy Jassy raised it with senior officials, including Treasury Secretary Scott Bessent; Commerce Secretary Howard Lutnick sent the letter. [7]

Reporting a live cyber bypass is defensible: a researcher who finds one should disclose it, whoever signs their checks. The hazard is not Amazon’s motive. It is that one firm now bankrolls the vendor, hosts its models, and supplies the findings behind the federal action against it: three chairs, one occupant, a governance problem, whether anyone acted in bad faith. That row did not exist on the original coercion-stack table. It does now.

Concede the government’s strongest case: Mythos is a genuinely dangerous capability. Anthropic itself withheld it from public release and lobbied for Mythos-class models to be treated as cyberweapons. A jailbreak that unlocks even a sliver of that capability on a model deployed to hundreds of millions of people is a real finding, not a clerical one. David Sacks, the administration’s former AI czar, put the case bluntly: a bypass “allowing operability of a cyber weapon” is hard to call anything but serious, and Anthropic’s minimizing language was “not consistent with Anthropic’s brand as the AI safety company.” [8] That last clause is the hinge, and it is where the story turns from a regulatory dispute into something closer to a Greek tragedy.

Anthropic supplied the moral framing that took its model down. On or around June 10 — a day after Fable 5 launched, two days before the letter arrived — Dario Amodei published an essay titled Policy on the AI Exponential. In it, in his own boldface, he argued that frontier models “should be required to go through technical testing and auditing, and their release should be blocked or reversed as a threat to public safety if they do not meet high standards of safety.” He named the analogy himself: the FAA grounding an unsafe aircraft. He called, in the same essay, for export controls on the AI supply chain to be “expanded, tightened, and coordinated.” [9] Two days later the government blocked his release, citing safety, using an export control.

Here is the cruelty in it. Amodei did not receive the apparatus he requested. He proposed a deliberative, statutory process: a third-party technical evaluation and explicit protection against “political favoritism or arbitrary decisions.” What arrived was a verbal directive on roughly ninety minutes’ notice, with no specific national-security detail and no third-party finding shared, the opposite of the process he described. Anthropic’s own statement says so: “This action does not adhere to those principles.” [10] But the principle the administration did use — that government may block or reverse a frontier release on safety grounds — is the one Anthropic spent years legitimizing. Sacks then turned the company’s safety branding back on it, arguing that it should have complied without argument. The lab supplied the moral framework; the government supplied a blunter instrument than the lab wanted; and the safety reputation Anthropic built became the lever its own funder’s disclosure pried.

The asymmetry is what makes something load-bearing, and it should change how you price a dependency. The durability of a closed frontier model is not a function of the vendor's uptime or balance sheet. It is a function of a regulatory surface the vendor does not control and cannot predict — and that surface, as of June 12, can be activated by the firm that funds and hosts the vendor. The substitute sitting one tier down — the abliterated open-weight model an attacker runs on a few GPUs at most — lacks such a surface. The best open-weight models now trail the closed frontier by about four months, not the chasm that gap used to be; abliteration is a separate operation layered on top, stripping refusals from a model that is already near-peer. [11] So the substitute is not as capable as Mythos, but it is close, it is permanent, and it is — in the only sense that matters to whoever is probing your systems — more reliably available than the safety-first product it imitates.

The model built to refuse can be withdrawn from everyone. The model built to refuse nothing cannot be withdrawn from anyone.

Notes

[1] Anthropic, “Statement on the US government directive to suspend access to Fable 5 and Mythos 5”, June 12, 2026; launch date (June 9) and enterprise-customer impact per MarkTechPost, June 13, 2026, and MLQ News, June 13, 2026.

[2] Anthropic statement, June 12, 2026 (full text of the scope language, including foreign-national employees); corroborated by Axios, “Trump admin blocks foreign access to Anthropic’s most powerful AI”, June 12, 2026.

[3] Hugging Face’s abliterated tag filter returned roughly 7,500 model repos as of mid-June 2026. The “over six thousand... against roughly six hundred two years ago” figure is from NPR, citing University of Nebraska Omaha (NCITE) research, “These AI models are free, private, and will never say ‘no,’” May 31, 2026. The tag count includes quantizations and mirrors, not solely unique base-model abliterations, and is rising — retrieve a current figure at publication.

[4] Andy Arditi et al., “Refusal in Language Models Is Mediated by a Single Direction”, arXiv 2406.11717 (submitted June 2024; NeurIPS 2024). The paper demonstrates across 13 open chat models up to 72B that refusal is mediated by a one-dimensional subspace; ablating that direction from the weights (weight orthogonalization) removes refusal while preserving other capabilities. The permanence-once-distributed characterization follows from the edit being to the weights themselves.

[5] Heretic (Philipp Emanuel Weidmann, “p-e-w”), released late 2025 (PyPI heretic-llm), automates directional ablation via an Optuna/TPE optimizer; its README reports ~45 minutes to decensor Llama-3.1-8B-Instruct on an RTX 3090 (20–30 minutes for Qwen3-4B), corroborated by NPR (n.3) for the “few minutes,” no-expertise characterization. Registry scale: huihui-ai hosted 235 models with 7,406 followers as of June 15, 2026. Speed-of-appearance (abliterated builds within ~24–72 hours of a major release) per huihui-ai’s published Gemma 4 / Qwen abliterations and activity feed. Anthropic’s “thousands of hours” of Fable red-teaming is from its launch posture as summarized in its June 12 statement (n.2).

[6] Defenses and the offense’s response: circuit breakers / Representation Rerouting per Zou et al., “Improving Alignment and Robustness with Circuit Breakers”, arXiv 2406.04313 (NeurIPS 2024); extended-refusal training per Abu Shairah et al. (KAUST), “An Embarrassingly Simple Defense Against LLM Abliteration Attacks”, arXiv 2505.19056 (May 2025), reporting treated models retaining >90% refusal under abliteration versus 70–80% drops for baselines — demonstrated on small/older models. Offense keeping pace: Abliterix, a Heretic derivative, claims to defeat circuit breakers and to reach a ~7% refusal rate on Google’s Gemma 4 (E4B) via direct weight editing. These refusal-rate figures are self-reported by the abliterating parties and are highly method-dependent; treated as directional, not measured.

[7] Wall Street Journal, “Amazon CEO’s talks with U.S. officials triggered crackdown on Anthropic models”, June 13, 2026 (WSJ names Bessent as one of several officials Jassy contacted); The Information, “Amazon’s Jassy raised concerns about Anthropic model, Trump crackdown”, June 13, 2026. Lutnick (Commerce) sent the directive per Anthropic’s statement and Axios. Amazon’s investment ($8B deployed to date, plus an up-to-$25B commitment agreed April 2026) and AWS cloud partnership per CNBC, Nov 22, 2024 and April 20, 2026; the same CNBC reporting confirms “Amazon does not have a seat on Anthropic’s board.” Bedrock as the likely test surface is inference, flagged as such.

[8] David Sacks, post on X, June 13, 2026.

[9] Dario Amodei, “Policy on the AI Exponential”, dated “June 2026” on the primary page; secondary trackers place publication on or around June 10, 2026. The “blocked or reversed” sentence and the FAA analogy are in Section 1 (Regulation and public safety); the “expanded, tightened, and coordinated” export-control language is in Section 5 (Securing leadership by democracies) and refers to chips and semiconductor manufacturing equipment. The essay’s separate “off switch” phrasing appears in Section 4 and refers to autonomous-weapons oversight, not model release — not conflated here.

[10] Sacks, X, June 13, 2026 (as n.8).

[11] Epoch AI, "Open models lag state-of-the-art closed models by 4 months" (Jack Edwards and Luke Emberson), data covering Jan 1–May 28, 2026: the most capable open-weight models trailed frontier closed models by an average of four months, or 8 points on Epoch's composite Capabilities Index — up slightly from the ~3-month average Epoch measured for Jan 2023–Oct 2025. This is a capability gap (open vs. closed frontier); abliteration is a distinct operation that removes refusals without adding capability, applied on top of an already-near-frontier open model. The two are not the same axis and are not conflated here.

Cash Flow Lends. Valuation Doesn’t.

Julien Simon — Fri, 12 Jun 2026 07:09:24 GMT

On June 8, Amazon signed a $17.5 billion loan it can draw at will, with no financial covenants attached. Two days later, Oracle told investors it would raise roughly $40 billion more and unveiled a new accounting measure to explain why its capex is not quite its capex. The same day, Bloomberg reported that SoftBank could not borrow $6 billion against its OpenAI stake.

The volume itself is no longer news. Morgan Stanley estimates nearly $236 billion of AI-linked debt has been issued globally through May, four times last year’s pace, and expects close to $570 billion for the full year [1]. UBS estimates that hyperscaler capex will consume close to 100% of operating cash flow in 2026, compared with a ten-year average of 40% [2]. Everyone now knows the buildout runs on debt. What last week revealed is who gets to borrow it, on what terms, and against what.

Start with the cheapest money. Amazon’s facility is a senior unsecured delayed-draw term loan led by Citibank: the company can pull funds as needed through September 30, each draw repayable over three years, at 0.625% to 0.875% over SOFR, the floating benchmark rate, depending on ratings [3]. It landed days after Amazon priced the largest Canadian-dollar corporate bond in history, a C$14 billion deal that drew more than C$28 billion in orders [4]. This is for a company whose trailing free cash flow has collapsed to $1.2 billion from $25.9 billion a year earlier, on the way to roughly $200 billion of capex this year [5]. The banks looked straight past the AI bet. Unsecured borrowing with no financial covenants is routine at Amazon’s rating — and that is the point. What stands behind the loan is the retail engine and AWS’s operating cash flow: businesses already earning, whatever the GPUs return tomorrow.

Oracle got the conditional money. Its fiscal Q4, reported June 10, beat the headline estimates: $19.2 billion in revenue, $2.11 in adjusted EPS, and remaining performance obligations — contracted future revenue — of $638 billion, up 363% in a year and up $85 billion in a single quarter [6]. The stock fell anyway, not the first time this fiscal year, a headline beat has been sold [7]. The funding side explains it. Free cash flow for the year was negative $23.7 billion. Oracle raised $43 billion in debt and $5 billion in equity in fiscal 2026, and plans roughly $40 billion more this year, including a previously announced $20 billion at-the-market equity program (new shares sold directly into the market) [8]. The buildout has crossed a line worth stating plainly: reported capex of $90–95 billion next fiscal year, against $90 billion of guided total revenue for the same year [8]. Oracle plans to spend its entire revenue, roughly, on capital expenditures.

And it introduced a new number. “Net cash outlay for capital expenditures” is guided around $70 billion for fiscal 2027, against the same $90–95 billion in reported capex; the gap is due to customers prepaying for GPUs or supplying their own [9]. Those arrangements now total $75 billion across Oracle’s large AI contracts, and the company says they substantially reduce the capital it must raise [10]. Read that twice. Oracle is publishing a table showing lenders which parts of its capex are really someone else’s. The arrangement is genuine — prepaid hardware does reduce Oracle’s funding needs — but it shifts risk rather than removing it. The $75 billion is banked; the rest of the backlog, largely anchored to OpenAI through Stargate [11], still depends on customers being able to pay, quarter after quarter, for years. When a borrower starts inventing measures to reassure the market, the market has started asking questions.

What did Amazon’s lenders see that SoftBank’s couldn’t? The answer is a hierarchy worth understanding before the second half of Morgan Stanley’s $570 billion arrives.

SoftBank asked for the third kind of money and didn’t get it. In May, it sought a $10 billion margin loan backed by its OpenAI stake; lender hesitation cut the target to $6 billion; on June 10, the talks stalled outright [12]. SoftBank may yet revive them. About $5 billion had been lined up, though it was unclear whether those commitments were verbal or written [13]. The sticking point was not OpenAI’s prospects. It was the collateral itself. OpenAI is private; its valuation is set by funding rounds rather than a liquid market, and a margin lender needs collateral that it can price daily and sell quickly. A stake last marked inside a $122 billion round at an $852 billion post-money valuation [14] turned out to be worth, for borrowing purposes, nothing yet. The stall came even though OpenAI had confirmed, two days earlier, a confidential filing for a US listing that could debut as soon as the fall [15]. Some of the same prospective lenders had said the IPO news made the loan more attractive. They still walked. The clock, meanwhile, is real: a $40 billion bridge loan taken on to fund SoftBank’s OpenAI commitments comes due in March 2027 [16]. Shares fell as much as 9.7% on the news, nine days after the company had overtaken Toyota as Japan’s most valuable [17].

Strip away the deal terms, and the three answers reduce to one question: what gets the lender repaid? Amazon’s creditors are repaid from businesses that predate the AI bet and would survive its disappointment. Oracle’s creditors are repaid from backlog: promises from customers whose own funding remains unproven. SoftBank’s would have been repaid from a mark: a number set by the last buyer in a private round, untested by any open market. The week’s pricing followed that gradient without sentiment. Cheapest against yesterday’s cash. Conditional, and increasingly explained, against tomorrow’s contracts. Refused against a number on a page.

None of this says the lending stops. Morgan Stanley expects issuance to accelerate in the second half [1]. It says the lending discriminates, and what it discriminates against is distance from cash. Each rung down the ladder, the borrower pays more, explains more, and pledges more. At the bottom rung, the market said no to collateral with a listing already in motion. A ticker will do what the mark could not; the prospect of one was not enough. This reading would be wrong if SoftBank closes its loan at or near $6 billion before the IPO prices, or if Oracle’s $40 billion raise comes as ordinary investment-grade debt without leaning on equity. Either outcome would mean the market lends against marks and backlog as readily as against cash flow after all.

Until then, the hierarchy stands. Cash flow lends. Backlog negotiates. Valuation waits for its ticker.

Notes

[1] Morgan Stanley research note, June 10, 2026, as reported by Reuters via Tech Startups: ~$236 billion in AI-linked global debt issuance through May 31, 2026, roughly four times the prior-year pace; full-year 2026 forecast of nearly $570 billion. Analyst estimate, not a measured total.

[2] UBS estimate, as reported by TechTimes: 2026 hyperscaler capital spending on pace to consume close to 100% of operating cash flows, versus a 10-year average of about 40%. Analyst estimate.

[3] Amazon Form 8-K, filed June 10, 2026: term loan agreement dated June 8, 2026, Citibank N.A. as administrative agent; $17.5 billion senior unsecured delayed draw term loan facility; commitments expire September 30, 2026; three-year maturity per draw. SOFR margin of 0.625–0.875% depending on ratings and the absence of financial covenants per the filing as reported by Yahoo Finance; the agreement retains customary covenants and events of default. Joint lead arrangers: Citibank, JPMorgan, BofA Securities, HSBC, Wells Fargo.

[4] Bloomberg: C$14 billion (~$10 billion) priced June 8, 2026, the largest corporate debt offering on record in Canadian dollars, with more than C$28 billion in orders. Reuters confirms from the final pricing term sheet filed with the SEC: five tranches, maturities 2029–2056, surpassing Alphabet’s C$8.5 billion record set a month earlier.

[5] Yahoo Finance, per Amazon’s Q1 2026 results: trailing-twelve-month free cash flow of $1.2 billion versus $25.9 billion a year earlier; Q1 2026 capex of $44.2 billion; ~$200 billion full-year 2026 capex plan disclosed with Q4 2025 earnings.

[6] Oracle Q4 FY2026 earnings press release (Form 8-K exhibit, June 10, 2026): RPO of $638 billion, up 363% year-over-year and up $85 billion sequentially. Revenue and EPS beats per Sherwood News: revenue $19.2 billion vs. $19.1 billion expected; adjusted EPS $2.11 vs. $1.96 expected ($2.03 excluding one-time net investment gains). One miss beneath the headlines, per Yahoo Finance: total cloud revenue of $9.91 billion came in below the $9.99 billion consensus, with cloud applications light and cloud infrastructure ahead.

[7] Sherwood News and TheStreet. Shares fell in after-hours trading on June 10 despite the beats. Precedent: in Q2 FY2026, a 32.4% EPS beat was followed by a -10.8% day-of move (24/7 Wall St.). Note the broader tape: May CPI printed at a three-year high the same day, and all three major US indices fell, so the decline was not purely company-specific.

[8] Oracle Q4 FY2026 earnings press release: fiscal 2026 free cash flow of negative $23.7 billion; $43 billion raised in debt financing and $5 billion in equity financing in fiscal 2026; approximately $40 billion in combined debt and equity financing planned for fiscal 2027, including the previously announced $20 billion at-the-market equity issuance; fiscal 2027 total revenue guidance confirmed at $90 billion. The release adds that Oracle “does not expect to issue additional debt in calendar year 2026” — making the near-term portion of the raise equity-led by the company’s own statement. The $90–95 billion reported-capex figure for fiscal 2027 is from the earnings call (see [9]).

[9] Oracle Q4 FY2026 earnings call, June 10, 2026: expected net cash outlay for capital expenditures of around $70 billion in fiscal 2027, with customer prepayments and timing impacts of $20–25 billion raising reported capex above that figure; the press release includes a reconciliation table for the new measure, and CFO Hilary Maxson detailed it on the call (CNBC). Fiscal 2026 net cash outlay was $48 billion after ~$8 billion of prepayment and timing impacts, against $55.7 billion of reported capex — up 162% year-over-year, with depreciation nearly doubling to $7.62 billion.

[10] Oracle Q4 FY2026 earnings press release: prepaid and customer-supplied hardware portions of large AI contracts total $75 billion, which the company states “substantially reduces the amount of capital Oracle must raise” for its AI datacenter buildout.

[11] Sherwood News: the RPO balance is largely anchored by Oracle’s OpenAI partnership under the $500 billion Stargate initiative. Bank of America analysts estimate that over 50% of the remaining performance obligation comes from OpenAI (CNBC); analyst estimate, not a company disclosure.

[12] Bloomberg, June 10, 2026: talks to raise at least $6 billion via a margin loan backed by SoftBank’s OpenAI stake have stalled; the initial $10 billion target was cut by 40% in May after lender hesitation. SoftBank may resume the margin loan later and is considering other fundraising options. A fair objection: SoftBank itself is rated BB+ with a negative outlook from S&P (revised March 3, 2026, on the additional $30 billion OpenAI commitment), so borrower quality may have weighed on the talks. But margin lending looks to the collateral first, and the concern lenders voiced, per Bloomberg’s reporting, was the difficulty of valuing an unlisted company — not SoftBank’s own credit.

[13] Bloomberg via Yahoo Finance: approximately $5 billion had been secured before talks stalled, though it was unclear whether commitments were verbal or written.

[14] OpenAI announcement, March 31, 2026: the round closed with $122 billion in committed capital at a post-money valuation of $852 billion; confirmed by Bloomberg. SoftBank contributed $30 billion of the round. The figure is a private-round valuation, not a market price — which is the point.

[15] Bloomberg via The Edge Singapore: OpenAI said on Monday, June 8, that it filed confidentially for a US IPO and is working with Goldman Sachs and Morgan Stanley on a potential listing as soon as the fall. Some prospective lenders on the margin loan had said they viewed it more favorably after news of the IPO preparation; the talks stalled regardless.

[16] Bloomberg via Yahoo Finance and Quartz: a $40 billion bridge loan taken on to fund SoftBank’s OpenAI commitments comes due in March 2027; SoftBank has indicated it intends to cover it from existing assets plus additional financing. Counterpoint on severity from Hua Cheng, head of Asia credit research at AllianceBernstein, who called the stalled margin loan one piece of a larger puzzle and not a standalone red flag.

[17] Bloomberg via Yahoo Finance: shares declined as much as 9.7% on June 10; SoftBank had overtaken Toyota as Japan’s most valuable company by market capitalization on June 1 (Bloomberg, Nikkei Asia) — the first time in more than two decades. SoftBank’s credit default swaps had narrowed to about 307 basis points from a May 20 peak above 367 — the credit market was already charging for the OpenAI concentration before the loan stalled.

Nvidia Won the Cloud. Now It Wants the Laptop.

Julien Simon — Thu, 11 Jun 2026 11:26:51 GMT

Jensen Huang stood at the Taipei Music Center on the last day of May and announced that Nvidia intended to “reinvent the single most important tool of humanity.”[1] The tool in question is the personal computer, and the product behind the sentence is a laptop chip: RTX Spark, a 20-core Arm processor fused to a Blackwell GPU around a 128-gigabyte pool of memory, co-announced with Microsoft.[2] It ships this fall in machines from Asus, Dell, HP, Lenovo, and MSI, with a Surface Laptop Ultra as the flagship. Shares of AMD, Intel, and Qualcomm fell on the news.[3] The market read the announcement as a land grab in the PC business.

Eight weeks earlier, this publication described the single move that could pull local AI back into Nvidia’s orbit. In “Your Parents Paid,” we documented how Nvidia’s own product segmentation had handed the fastest-growing consumer AI workload to Apple and AMD, and we listed three conditions under which that would reverse. The third: “the CUDA moat extends into inference. If NVIDIA ships inference-specific optimizations — through TensorRT-LLM, NIM, or a CUDA-exclusive quantization format — that make the performance gap too large to ignore, practitioners return to NVIDIA hardware regardless of memory capacity.”[4]

RTX Spark is condition three, shipped as a product line. But it arrived with a twist we didn’t predict: Nvidia isn’t closing the performance gap. It’s making the gap irrelevant.

The spec sheet and the missing number

Start with what Nvidia published. The RTX Spark product page lists up to 6,144 CUDA cores on the Blackwell GPU, up to 20 CPU cores, up to 1 petaflop of FP4 AI performance, and up to 128 gigabytes of unified memory.[2] On stage, Huang claimed the chip runs 120-billion-parameter models locally.[5] That claim deserves a moment of respect. In April, we showed that the 120B model class needed 60-70 gigabytes at usable quantization and therefore did not fit on any consumer Nvidia product. The 32-gigabyte ceiling on the RTX 5090 was the centerpiece of Nvidia’s segmentation, the design choice that pushed private-inference buyers toward a $3,699 Mac Studio.[4] RTX Spark removes that ceiling. The capacity objection is gone.

Now look for the number that isn’t there. The product page lists cores, petaflops, and gigabytes. It does not list memory bandwidth.[6] For local language models, bandwidth is the most important metric: token generation reads the entire working set of model weights from memory for every token, making decode speed a near-linear function of memory throughput. Capacity decides whether a model loads. Bandwidth decides whether you can stand to use it. Nvidia headlined the first number and buried the second, the same disclosure pattern it used for the DGX Spark desktop, whose 273 GB/s figure appeared in technical documentation rather than marketing.[7]

Launch coverage and pre-launch leaks fill the blank and explain the silence. The full-spec N1X silicon inside RTX Spark is, by all accounts, the same configuration as the DGX Spark’s GB10: a 256-bit interface of LPDDR5X (laptop-class memory) delivering roughly 273 GB/s, with launch-day spec coverage citing up to 300.[8] Apple’s M4 Max delivers 546 GB/s. The M5 Max delivers 614. The M3 Ultra delivers 819.[9] On the dimension that determines how fast a local model actually runs, the machine Nvidia just announced trails the machines it was announced to displace by a factor of 2 to 3.

RTX Spark’s memory bandwidth — absent from its product page — sits in Apple M5 Pro territory, well below the Apple machines that hold 128GB.

So the puzzle is real. Nvidia came back for local AI without closing the gap that lost it the market in the first place. Why would a company that just reported $215.9 billion in annual revenue enter a fight it has already measured itself losing on the merits?

Because the fight isn’t on the merits. RTX Spark is not a bid for the PC market. It is the recapture of the one AI workload that was escaping Nvidia’s orbit, and the recapture runs on defaults, not on benchmarks. The machinery has four layers, and only one of them is silicon.

Four layers of default

The first layer is the hardware itself, and the most important word on the product page is “natively.” Nvidia’s copy reads: “CUDA, the software that accelerates the world’s AI, runs natively on RTX Spark.”[10] Every prior path to large-model local AI on a thin-and-light Windows machine ran through someone else’s silicon and someone else’s runtime: Qualcomm’s NPU through ONNX, AMD’s iGPU through Vulkan, Apple’s unified memory through Metal. Each of those paths was hardware-agnostic by necessity, which is precisely what made local inference the first workload to slip Nvidia’s gravity. RTX Spark ends the necessity. For the first time, the premium Windows laptop tier ships with the same CUDA stack that runs the datacenter, and 128 gigabytes to feed it.

The escape route and the orbit now share a machine.

The second layer is the runtime, where the recapture stops being a hardware story. On RTX AI PCs, Nvidia’s NIM microservices run as containers in Windows Subsystem for Linux, with CUDA acceleration, and package models with everything needed to run them.[11] The quieter announcement is the one that matters: Microsoft’s Windows ML inference stack now automatically routes to Nvidia’s TensorRT for RTX whenever it detects RTX hardware.[12] Read that sentence again at the level of incentives. A Windows application developer who calls the operating system’s standard AI interface does not have to select an inference backend. The operating system selects it, and on this machine, the selection is CUDA. The developer didn’t choose Nvidia. Windows chose it for them. Jensen’s defense writes itself: developers begged for this. Local CUDA parity with the datacenter was among the loudest requests in Nvidia’s developer ecosystem, and the convenience is not an illusion. But convenience is how every default gets built.

The trap and the gift are one and the same.

That shift, from chosen dependency to ambient dependency, is the difference between the lock-in we described in “Open Source, Closed Orbit” and the lock-in being assembled now.[13] The original Black Hole worked on practitioners: researchers and engineers who chose CUDA because the tools were better, then found the exit priced in switching costs. The new layer works on people who never make a choice at all. The mainstream Windows developer building an AI feature in 2027 will write to Windows ML, ship to machines that route to TensorRT, and acquire a CUDA dependency the way one acquires an accent. Nobody decides to have one.

The third layer is the agent platform, and it explains the timing. RTX Spark’s marketing mentions chatbots only briefly. The page promises a PC where “agents work alongside you — running tasks, generating assets, writing code, on demand,” and pitches the desktop variants as machines “built to run personal AI agents 24/7 right at your desk.”[14] The plumbing has a name: NVIDIA OpenShell, an agent framework coming to Windows on top of Microsoft’s new security primitives, packaging local autonomous agents with guardrails gating what they can touch — alongside NIM containers as local agent endpoints and native NIM support arriving in Azure AI Foundry in July.[15] The agent era re-platforms the PC around continuous local inference, the bandwidth-hungry, always-on workload pattern that decides hardware defaults for a decade. Whoever owns the default runtime when that re-platforming happens owns the next ten years of Windows AI development.

The Windows re-platforming is being co-authored by Nvidia.

The fourth layer is the funnel, and it is the oldest trick in the catalog. NIM’s developer tier is free and genuinely useful: unlimited endpoints for prototyping, hosted on DGX Cloud. Production is a different conversation. Nvidia’s own product page walks the path in two sentences: prototype freely, then “talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.”[16] The same NIM container that runs on the laptop runs in the datacenter and the cloud, which Nvidia presents as portability and which functions as a ratchet. A team prototypes an agent on a Spark laptop, the prototype works, and scaling it means an enterprise agreement that cross-sells the rest of the stack.

It is the catalog-and-contract structure we have documented across eight Nvidia infrastructure domains.[13] The laptop makes it nine, and it sits at the top of the funnel, where developers form habits.

Put the four layers together, and the design is legible. Nvidia fixed the capacity problem, kept the bandwidth problem, and wrapped both in the Windows default. It can concede the benchmark because it is buying the path. A 2x decode deficit against a Mac Studio matters to the practitioner who measures tokens per second. It matters not at all to the Windows developer whose operating system, container catalog, agent framework, and cloud funnel have already agreed on the answer before the question was asked.

The four-layer recapture — hardware, runtime, agents, funnel — each level routing to the same stack. None of them is a benchmark.

There is a precedent for this, and Jensen named it himself two months before the announcement. “GeForce is NVIDIA’s greatest marketing campaign,” he told the GTC crowd in March. “Your parents paid for you to be an NVIDIA customer... until someday you became an amazing computer scientist and became a proper customer.”[17] GeForce recruited the CUDA generation: gamers who became graduate students, who became the engineers who made CUDA the default in the datacenter. The pipeline aged. Gaming now accounts for 7% of Nvidia’s revenue, and the recruits were buying Macs.[18] Spark is that pipeline rebuilt for the agent era. The laptop is the new GeForce, except this time the product being marketed isn’t a graphics card a teenager will outgrow. It’s a default that a developer will never notice.

What the machine actually does

Honesty about the product, because it’s good. RTX Spark’s compute is real: the full configuration matches the 6,144 CUDA cores of a desktop RTX 5070, and on the GB10 silicon it shares with the DGX Spark, prefill — the compute-bound phase where a long prompt is ingested — runs at roughly 2,000 tokens per second on a 20B model.[19] For workloads that are mostly ingestion (summarizing long documents, retrieval over a document base, batch classification), that is a serious machine in a laptop chassis. The 45-to-80-watt envelope, the all-day-battery claim, and the full RTX gaming stack make it the most credible Windows-on-Arm product ever shipped, where Qualcomm’s Snapdragon X struggled to give mainstream buyers a reason to switch.[20] And 128 gigabytes of addressable memory on a Windows laptop is a first. None of this is vaporware.

The decode numbers are equally real, and they cut the other way. On the same GB10 silicon, LMSYS measured the DGX Spark generating just under 50 tokens per second on a 20B model at 4-bit precision — against 215 for an RTX Pro 6000 and 205 for an RTX 5090, a gap of roughly 4x that the reviewers attributed directly to the LPDDR5X memory interface.[19] Decode speed doesn't improve with the laptop’s power envelope because memory bandwidth doesn’t scale with wattage; the laptop will generate tokens at desktop-GB10 speed, which is to say at half to a third the speed of the Apple Silicon machines it shares a price bracket with.[21]

A reasoning model that thinks for ten thousand tokens before answering will make a Spark user wait three to four minutes per answer. The “agents running 24/7 at your desk” pitch quietly depends on the user not watching them work.

A note on a number you will see misquoted. Nvidia’s specifications include a 600 GB/s figure, and parts of the trade press have already printed it as the memory bandwidth.[22] It isn’t. The 600 GB/s is NVLink-C2C, the interconnect between the CPU and GPU complexes on the package. The memory interface feeding both remains LPDDR5X at roughly 273 to 300 GB/s. Bandwidth between two processors and bandwidth to the memory that holds the model are different numbers, and conflating them doubles the product’s apparent throughput. The confusion is not an accident of complicated engineering. Retail listings may yet publish the figure; the DGX Spark precedent, where the number surfaced in technical documentation after launch, suggests the pattern is policy. A company that headlines its interconnect bandwidth and buries its memory bandwidth knows which comparison it would lose.

So the honest scorecard reads: best-in-class prefill for the form factor, true 120B capacity, decode bandwidth in M5 Pro territory at M5 Max prices, and a marketing sheet built to keep you from computing that last number. Our April verdict — no amount of software makes 273 GB/s faster than hardware with three times the bandwidth — survives contact with the new product. What changed is that Nvidia stopped trying to win the comparison and started making sure the buyer never runs it.

The practitioners who walk away

The recapture has a boundary, and the boundary is choice. Everything in the four-layer machinery operates on defaults: the default premium laptop, the default OS inference path, the default agent runtime, the default scaling story. Nothing in it binds the practitioner who actively chooses a stack. llama.cpp still runs everywhere. Vulkan still outruns vendor stacks on AMD silicon.[4] Apple’s MLX is becoming the default backend of Ollama, the most popular local-model tool, with measured decode gains of 93% on supported models.[23] The buyer who reads benchmarks before purchasing will keep buying the machine with 614 GB/s, and nothing Nvidia shipped last week changes that calculus.

But count the populations. The benchmark-reading tier is a niche. The Windows installed base is more than a billion machines, refreshed through OEM defaults and corporate procurement cycles that have shipped “the premium Intel laptop” for thirty years and will ship “the premium RTX Spark laptop” with equal indifference to memory-bus arithmetic. Distribution decides defaults, and defaults decide ecosystems. Qualcomm proved that distribution alone doesn’t move an ecosystem: two years of Snapdragon X laptops put Arm Windows machines in every retail channel without giving developers a reason to target them. Nvidia inherits the compatibility groundwork Qualcomm paid for and arrives with the developer reason pre-installed: a machine carrying the stack the world’s AI is already written on, an OS that routes to it silently, and an agent platform whose launch partner is the OS vendor itself.

The exception that proves the design: the people most likely to escape recapture are the people Nvidia’s original segmentation already pushed out. The medical practice that bought a Mac Studio in 2025 for private 70B inference has no reason to return; its stack is Metal, its tooling is MLX, and its data never left the building. Nvidia has written them off; the project is making sure the next ten million developers never become them.

The ninth domain

Readers of this publication have seen this architecture before. “Open Source, Closed Orbit” mapped how Nvidia replicated the open-source ecosystem’s eight critical infrastructure functions — model hosting, developer tooling, inference serving, fine-tuning, evaluation, and the rest — each replica routing back to Nvidia hardware.[13] The framework’s gap was geographic: the eight domains lived in the cloud, and the datacenter, and the device on the desk remained contested ground. That contest is what “Your Parents Paid” documented from the other side: at the device tier, where workloads run through llama.cpp and MLX rather than NIM and Triton, the pull was visibly loosening. Local inference wasn't escaping because someone built a better CUDA; it was because the workload didn’t need CUDA at all.[4]

RTX Spark closes the map. The device is the ninth domain, and the replication strategy is identical to the first eight: take a function the open ecosystem performs in a hardware-agnostic way, ship a vertically integrated version that is easier than the agnostic one, and let convenience do what compulsion couldn’t. The two pieces are mirror images. April’s story was segmentation pushing the workload out: a 32-gigabyte ceiling, a missing NVLink, a bandwidth-starved Spark desktop, each a deliberate gap that protected datacenter margins. June’s story is integration pulling the workload back: full capacity, native CUDA, OS-level routing, and an agent platform. Opposite moves. Same gravity. In both directions, the constant is that Nvidia designs the consumer product around what it does to the datacenter business, because the datacenter is 90% of revenue, and the consumer device is, in Jensen’s own framing, a marketing campaign with a motherboard.[17][18]

What would have to break

The recapture thesis is falsifiable, and we’ll state the conditions plainly.

First, it breaks if the default path opens up. If Windows ML’s hardware routing stays neutral in practice — if a developer writing to the standard Windows AI interface gets equivalent first-class treatment on Qualcomm NPUs and AMD silicon, and the TensorRT route confers no meaningful advantage — then the second layer of the machinery never engages, and RTX Spark is just a fast laptop. The incentive structure argues against this. Microsoft has reasons to keep Windows ML formally vendor-neutral; Nvidia has reasons to make the neutral interface perform best on its hardware; and “formally neutral, practically optimized” is how platform defaults in computing have historically worked. Watch the benchmark deltas between Windows ML on Spark and Windows ML on Snapdragon through 2027. If your AI feature had to run on a non-RTX machine tomorrow, would anything break? If you don’t know, the default has already been decided.

Second, it breaks if the agnostic stack holds the mainstream, not just the practitioners. Ollama’s MLX migration, llama.cpp’s ubiquity, and an M5 Ultra refresh give Apple every chance to keep the enthusiast tier and grow it; the M5 Ultra skipped this week’s WWDC and is now expected around October on reported memory-supply constraints, which puts Nvidia’s fall launch and Apple’s 128GB-class answer in the same quarter.[24] If local AI on Windows stalls — if the agent-PC pitch lands as this decade’s 3D TV — then Nvidia will have built a beautiful funnel over a dry riverbed. The third condition is the prosaic one: adoption. Windows-on-Arm carries a decade of compatibility scar tissue, fall launches slip, Morgan Stanley’s channel checks put N1X machines at $2,899 and up, and premium-priced first-generation platforms have a long history of underselling their keynotes.[25] If OEM sell-through disappoints by the end of 2027, the ninth domain stays open.

Here is why we doubt Nvidia loses even then. In September 2025, Nvidia agreed to buy $ 5 billion of Intel common stock, roughly 4% of the company whose RTX Spark franchise it is ostensibly built to attack; the purchase closed in December after antitrust clearance.[26] The equity is the smaller half of the deal. The same agreement commits Intel to build x86 system-on-chips for the PC market “that integrate NVIDIA RTX GPU chiplets” — Intel’s own filing language.[26] Read the two moves together. If Arm-based AI PCs win, Nvidia owns the chip. If x86 holds, the incumbent’s next-generation PC silicon will carry Nvidia’s GPU by contract. The instruction set is a coin flip Nvidia has hedged; the layer it refuses to share in either branch is the one this piece is about: the GPU, the runtime, and the default path between a Windows developer and a model. That hedge is the clearest evidence of the bet. Companies hedge the parts they consider interchangeable. They never hedge the moat.

In April, we ended by noting that the local inference market was growing despite Nvidia’s product line, not because of it, and that the gravity of the Black Hole was measurably weakening at the device tier. Eight weeks later, Nvidia shipped the correction, which tells you how seriously it took the leak. It did not ship more bandwidth. It shipped a default.

Local inference still doesn’t need CUDA. Nvidia just rebuilt the machine it runs on so that the path of least resistance does.

Notes

[1] Jensen Huang, GTC Taipei keynote at Computex 2026, Taipei Music Center, May 31–June 1, 2026 (June 1 local time). “Reinvent the single most important tool of humanity” quoted by Tom’s Hardware.

[2] NVIDIA RTX Spark product page (nvidia.com/en-us/products/rtx-spark, accessed June 10, 2026): up to 6,144-core Blackwell RTX GPU, up to 20-core CPU, up to 1 petaflop FP4, up to 128GB unified memory. Laptop partners: Asus ProArt P16, Dell XPS 16, HP OmniBook X 14, Lenovo Yoga Pro 9n, Microsoft Surface Laptop Ultra, MSI Prestige N16 Flip AI+; desktop partners include Acer and Gigabyte. Announced May 31, 2026 with Microsoft (NVIDIA Newsroom); availability fall 2026. CPU complex: 20 Arm cores (10x Cortex-X925 + 10x Cortex-A725), co-designed with MediaTek, per HotHardware. Nvidia also showed a two-year cadence roadmap with successor chips in 2028 and 2030.

[3] AMD, Intel, and Qualcomm share declines on the announcement: CNBC, June 2, 2026.

[4] “Your Parents Paid,” The AI Realist, April 3, 2026. The three reversal conditions appear in the closing section, “What would have to break.” Companion hardware guide: “What to Buy for Local LLMs (April 2026).”

[5] 120-billion-parameter local model claim: Nvidia keynote and product materials, reported by Notebookcheck. At Q4-class quantization a 120B dense model requires roughly 60–70GB of memory; 120B-class MoE models fit comfortably in 128GB. Vendor claim; independent throughput benchmarks on shipping hardware not yet available.

[6] NVIDIA RTX Spark product page, accessed June 10, 2026. The specifications section lists GPU cores, CPU cores, FP4 throughput, and memory capacity. No memory bandwidth figure appears anywhere on the page.

[7] NVIDIA DGX Spark: 128GB LPDDR5x, 273 GB/s, documented in the DGX Spark User Guide rather than launch marketing. See “Your Parents Paid,” note 18.

[8] N1X full-spec configuration matching DGX Spark’s GB10 (256-bit LPDDR5X-8533, ~273 GB/s): Tom’s Hardware pre-launch specification reporting. Tom’s Hardware’s launch article states “up to 300 GB/s of memory bandwidth” in its spec rundown, suggesting the ceiling figure was briefed to press; it appears nowhere on the product page (note 6). One analysis cites LPDDR5X-9400 (~301 GB/s). The GB10-lineage claim is consistent across sources but not officially confirmed.

[9] Apple memory bandwidth, manufacturer specifications: M4 Max 546 GB/s; M5 Max with 40-core GPU — the only configuration offering 128GB — 614 GB/s (the 32-core variant is 460 GB/s); M5 Pro 307 GB/s; M3 Ultra 819 GB/s (Apple tech specs; Apple Newsroom, M5 Pro and M5 Max). See “Your Parents Paid,” notes 32–34, for pricing at the 128GB tier.

[10] NVIDIA RTX Spark product page: “CUDA, the software that accelerates the world’s AI, runs natively on RTX Spark.” Developer section: “The same NVIDIA CUDA stack the world’s AI is built on, so you can develop and prototype on the same machine... prototype, fine-tune, and inference on the latest models locally.”

[11] NVIDIA NIM microservices on RTX AI PCs run through WSL2 with CUDA acceleration: NVIDIA Developer Blog. That deployment path was established on x86 RTX PCs; Arm-native NIM containers are already in production on the DGX Spark, which runs the same GB10-lineage silicon as RTX Spark.

[12] Windows ML, powered by ONNX Runtime, automatically uses the TensorRT for RTX inference library on GeForce RTX GPUs: NVIDIA blog, Microsoft Build coverage. TensorRT for RTX is natively supported by Windows ML.

[13] “Open Source, Closed Orbit: The Hardware Monopolist’s Guide to Owning Open Source,” The AI Realist. The eight-domain replication framework and the catalog-and-contract lock-in structure.

[14] NVIDIA RTX Spark product page: “Welcome to the PC where agents work alongside you — running tasks, generating assets, writing code, on demand... There’s intelligence on both sides of the keyboard now.” Desktop section: “Built to run personal AI agents 24/7 right at your desk.”

[15] “NVIDIA OpenShell is coming to Windows on top of Microsoft’s new security primitives, giving developers a single, easy-to-deploy package for running autonomous agents safely”: NVIDIA, Computex 2026 announcements, May 31, 2026; OpenShell appears in NVIDIA’s trademark list (NVIDIA Newsroom). NIM containers as local agent endpoints and native NIM support in Azure AI Foundry from July 2026: Microsoft Build 2026 coverage (Windows News); the Foundry date is per Build coverage, not yet confirmed in Microsoft primary documentation.

[16] NVIDIA NIM product page (nvidia.com, accessed June 10, 2026): “Get unlimited access to NIM API endpoints for prototyping, accelerated by DGX Cloud. When ready for production, download and self-host NIM on your preferred infrastructure... Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.”

[17] Jensen Huang, GTC 2026 keynote, March 16, 2026: “GeForce is NVIDIA’s greatest marketing campaign... Your parents paid for you to be NVIDIA customers.” Full quote and sourcing in “Your Parents Paid,” note 1.

[18] NVIDIA Q4 FY2026 earnings (Form 8-K, filed February 25, 2026, SEC EDGAR): fiscal 2026 revenue $215.9B, of which Data Center $197.3B (91%) and Gaming $16.0B (7%).

[19] LMSYS, “NVIDIA DGX Spark In-Depth Review,” October 2025: GPT-OSS 20B (MXFP4) in Ollama at 2,053 tok/s prefill and 49.7 tok/s decode on DGX Spark, versus 10,108/215 on RTX Pro 6000 and 8,519/205 on RTX 5090. The reviewers attribute the decode gap to the unified LPDDR5x memory interface. Figures are for the GB10 desktop; RTX Spark shares the silicon per note 8 but laptop-specific benchmarks are not yet published.

[20] Power envelope 45–80W and integrated-GPU-only positioning (no discrete pairing planned): Engadget and Tom’s Hardware launch coverage. Qualcomm Windows-on-Arm context: Microsoft’s Windows-on-Arm exclusivity with Qualcomm expired in 2024, as Qualcomm executives publicly confirmed, opening the door to this product.

[21] Decode speed invariance with TDP: token generation is memory-bandwidth-bound, and LPDDR5X bandwidth does not change with the power envelope. Prefill, which is compute-bound, takes a 15–25% reduction at laptop wattage per independent analysis. Community analysis; consistent with the bandwidth-bound decode model established in “Your Parents Paid,” notes 18 and 36.

[22] 600 GB/s NVLink-C2C (CPU-to-GPU interconnect) listed by Nvidia and reported by VideoCardz; misreported as peak memory bandwidth by at least one major outlet (Notebookcheck: “With NVLink, its memory bandwidth peaks at 600 GB/s”).

[23] Ollama’s transition of its Apple Silicon backend from llama.cpp to MLX, with preview decode improvements of 93% on supported models: ollama.com/blog/mlx, March 2026. Methodological caveats in “Your Parents Paid,” note 38.

[24] Apple’s M5 Ultra Mac Studio, widely anticipated at WWDC (keynote June 8, 2026), did not appear; reporting attributes the slip to RAM supply constraints, with October 2026 viewed as the likely window (Macworld, June 8, 2026). Nvidia, for its part, says it does not expect RTX Spark laptop supply to be limited despite the same global memory shortage (Yahoo Finance; vendor claim). The rumored M5 Ultra retains the UltraFusion dual-die design, two M5 Max dies with interconnect bandwidth above 1,000 GB/s (TrendForce, citing Commercial Times) — rumored, not announced.

[25] Pricing per a Morgan Stanley report based on channel checks with PC brands at Computex: “AI PCs with N1X will need to price at US$2,899, while N1 models will be priced at US$1,799” (Wccftech; VideoCardz, June 2–3, 2026). Nvidia has not published pricing. Microsoft confirmed a fall release for the Surface Laptop Ultra while declining to discuss pricing (PCWorld, Build 2026).

[26] Securities Purchase Agreement dated September 15, 2025; announced September 18: NVIDIA purchased 214,776,632 Intel shares at $23.28, a $5.0 billion aggregate price (Intel Form 8-K, September 2025). The FTC, which had examined whether the roughly 4% stake raised antitrust concerns, cleared the deal December 18, 2025; the purchase closed December 26 (The Register; CNBC). The product commitment is in the same announcement: “For personal computing, Intel will build and offer to the market x86 system-on-chips (SOCs) that integrate NVIDIA RTX GPU chiplets” (Intel 8-K Exhibit 99.1). No ship dates for products under the agreement have been announced.

Macron Said Confirmed. SoftBank Said Up To.

Julien Simon — Tue, 09 Jun 2026 13:49:34 GMT

Update: 10 June 2026. Nine days after the summit, the strain is already visible. The margin loan SoftBank wanted against its OpenAI stake has stalled: cut from $10 billion to $6 billion on lender hesitation, then stuck at roughly $5 billion. A loan against the marquee asset is supposed to be the easy money. All of it confirms what the headline number was hiding. Read on.

On the morning of June 1, 2026, in the gilded halls of Versailles, Emmanuel Macron told the assembled chief executives that this year’s Choose France summit would “crystallize a record amount of 93 billion euros of confirmed investments.”[1] The word that mattered was confirmed — confirmés. It is the word that turns a press release into a balance sheet, an intention into a number a finance minister can book.

Strip the SoftBank pledge out of the total, and the record disappears. The Japanese conglomerate’s commitment — up to €75 billion to build five gigawatts of AI data-center capacity across France — would be four-fifths of the headline on its own; even the firm tranche France’s own press counted into the total, €45 billion, is roughly half of it.[2] It is also the reason the number is a record at all: this single edition of Choose France exceeded the announced investment promises of the eight previous summits combined, which together totaled around €87 billion.[3] One pledge, from one company, made one summit larger than eight.

And that pledge is not €75 billion of confirmed money. By SoftBank’s own announcement, issued the day before the summit, only the first phase — €45 billion to deliver 3.1 gigawatts — is a commitment. The remaining €30 billion describes “additional sites” the company plans to develop.[4] The language shift inside a single press release, from “commitment” and “investment” to “plans,” is the whole story compressed into one document.

The last time Masayoshi Son stood beside a head of state and named a number this large, it was $500 billion. Sixteen months later, a fraction of one of its seven sites was running.

The anatomy of a record

A Choose France headline is not a measurement. It is a sum of commitment tiers, each with a different probability of becoming a building, presented to the cameras as a single figure. Disaggregate the €93 billion and five tiers separate cleanly: one firm, one a ceiling, one smaller but real, one barely more than a letter of intent, and one recycled from a previous summit.

At the firm end sits SoftBank’s €45 billion first phase — named sites, a named industrial partner in Schneider Electric, a developer in SB Energy, and a 2031 horizon.[5] This is the most concrete pledge at the summit, and it deserves to be treated as a real intent. Below it sits the €30 billion remainder of the SoftBank ceiling, which exists only as “plans for additional sites.” Below that sits a layer of genuine but smaller data-center commitments: Brookfield’s pledge, Nebius’s €8 billion site on a former Bridgestone plant at Béthune, an Ardian-Verne campus in the Paris region.[6] And below that sits the softest tier — capacity that is announced but not yet sited or committed: the MGX–Bpifrance “imminent selection of a second site,” worth around €7.5 billion, and a Revolut commitment contingent on the fintech obtaining a French banking license.[7][8]

The recycling is not a footnote to this structure; it is part of how the record was assembled. Brookfield’s France AI total is now quoted at €30 billion — but €20 billion of that was announced at the February 2025 AI Action Summit, the same event that produced the €109 billion headline; only €10 billion is new to this summit.[6] The MGX–Bpifrance money is the expansion of Campus AI, the Bpifrance–Mistral–MGX–Nvidia joint venture first unveiled at Choose France 2025, whose flagship campus at Fouju is still in early construction.[7] The €7.5 billion “doubles” a commitment that was itself last year’s announcement. Macron’s own framing conceded the pattern: the summit, he said, represented “20 billion invested, and 20 billion of AI investments as a follow-up to the summit in February.”[9] That February summit — the €109 billion AI Action Summit whose figure France never reconciled to disbursement — is being folded back into June’s total as “follow-up.”[10] A material share of this year’s record is last year’s record, counted again.

This is the Commitment-versus-Spend Gap, the analytical move that separates an announced figure from the capital that actually moves. Summit pledges do convert — France has led European foreign direct investment for years running, and Choose France is not a fiction. But they convert at a rate and with a lag that the headline never discloses, and the disaggregation above is why the headline and the eventual deployment are different numbers. At hyperscaler and sovereign-summit scale, the gap is not an anomaly to be explained away; it is the default structure of the announcement. The headline is the ceiling of what could happen. The filing, eventually, shows the floor of what did. The distance between them is where the analysis lives — and at Versailles, the distance is most of the number.

This raises the real question. Why would the most active investor in artificial intelligence structure its largest-ever European commitment as an option it might never fully exercise? The answer is on its balance sheet.

What the same pledge looks like sixteen months later

To price a SoftBank infrastructure pledge at the moment of announcement, you do not need a forecast. You need the last one.

On January 21, 2025, Son stood in the White House alongside Donald Trump, Sam Altman, and Larry Ellison to announce Stargate: $500 billion over four years to build AI data centers across the United States, with $100 billion to be deployed “immediately.”[11] SoftBank took financial responsibility, and Son took the chairmanship. The structure was familiar to anyone who had watched Son work: of the $500 billion, only around $52 billion was committed equity — roughly $19 billion each from SoftBank and OpenAI, around $7 billion each from Oracle and MGX. The other ninety percent was to come from debt and vendor financing, not yet arranged.[12] A mega-pledge, in the Son method, does not deploy existing capital. It opens a financing campaign.

Sixteen months on, the campaign’s results are measurable. Independent satellite analysis put the flagship Abilene, Texas campus at roughly 0.3 to 0.6 gigawatts operational by April 2026 — four of its eight buildings live — against a site target of 1.2 gigawatts and an announced program of ten.[13] The other six US sites were foundations and steel on 2028 timelines. One site of seven was partially energized; the rest were under construction.

Some of that gap is just physics: gigawatt data centers take three to five years to build, and measuring a ten-year program at month sixteen will always show a low number. Abilene, taken alone, is arguably ahead of a normal curve. So the conversion rate, in itself, is not the indictment. The indictment is what happened around it.

The vehicle itself barely functioned. By early 2026, Stargate LLC — the entity unveiled with such ceremony — had reportedly hired no staff and was developing no data centers; OpenAI had bypassed it for bilateral deals with Oracle, Amazon, and Google, and had come to treat the word “Stargate” as, in one executive’s framing, an umbrella term for its compute strategy rather than a company.[14] The Abilene flagship’s planned expansion was canceled in March 2026; the UK Stargate site was paused in April due to energy costs.[15]

None of this means that nothing was built. This is the point at which the skeptical version of the story has to be disciplined, because the booster version is partly true. Abilene is a genuine, operational AI campus running Nvidia hardware; thousands of tradespeople built it; major lenders — JPMorgan, a Newmark-led syndicate — genuinely closed billions in project finance against it.[16] The accurate claim is not that the money was fake. It is that conversion was slow, partial, debt-heavy, and routed around the very vehicle that gave the announcement its name.

The $500 billion functioned as a frame. The deployed reality was a fraction of it, arriving years behind the rhetoric. That is the precedent now anchoring a French summit’s record.

The balance sheet behind the pledge

The deeper reason to discount the €75 billion is not SoftBank’s track record. It is SoftBank’s balance sheet, and specifically the distinction between the money SoftBank actually moves and the money it lends its name to.

SoftBank’s funded AI capital has gone almost entirely into one place: its equity position in OpenAI. It completed a $41 billion round in December 2025 for roughly an 11 percent stake, then in February 2026 agreed a further $30 billion that would bring the cumulative total to $64.6 billion and the stake to about 13 percent — a figure that is reached only when the follow-on completes, in tranches running to October 2026.[17] To fund it, SoftBank sold its entire Nvidia stake, shed T-Mobile shares, and drew on a $40 billion bridge facility, the first $10 billion of it borrowed in April 2026, with the facility’s fee structure deliberately escalating to punish slow repayment.[18]

By early 2026, the position had consequences a rating agency could not ignore. S&P revised SoftBank’s outlook to negative in March, affirming a BB+ rating already below investment grade and describing OpenAI as one of the group’s investments “with the weakest credit quality” — even as Moody’s held a stable view a notch lower, the disagreement itself is a measure of how contested the bet is.[19] Reported leverage was still inside SoftBank’s own 25 percent loan-to-value ceiling at the end of 2025, but the chief financial officer had publicly opened the door to exceeding it “temporarily,” and S&P warned the OpenAI follow-on could push leverage toward the 35 percent line that would trigger a downgrade. The shares fell roughly 45 percent from their October 2025 high; one bank labeled the company a “valuation trap”; and SoftBank paused a separate $50 billion acquisition to preserve capacity.[20]

The bull would correctly object that this is only the liability side. SoftBank also holds one of the most valuable single assets in technology — roughly 90 percent of Arm, a stake worth more than $150 billion at mid-2026 prices — plus some $45 billion in unrealized gains on the OpenAI position itself, and it has shown it can monetize on demand, having sold Nvidia and T-Mobile to raise cash. That is real, and it is the strongest case for SoftBank’s resilience. But it cuts toward the concentration problem, not away from it: by mid-2026, the Arm and OpenAI stakes together made up nearly two-thirds of SoftBank’s assets, and the Arm holding is already pledged — an $8.5 billion margin loan drawn against it, with room for more. The crown jewel is collateral. And the credit market noticed: SoftBank’s five-year credit-default swaps widened to an eleven-month high after the S&P action, the widest among major Japanese corporates — the cost of insuring its debt rising in step with the bet.[20]

There is a circularity worth naming. SoftBank is OpenAI’s largest outside backer and, through SB Energy, a builder of the data centers OpenAI will rent. In France, it would play both roles again: financing the anchor tenant and constructing the capacity that the tenant is expected to fill.

This is the round-trip structure that has become the default at the top of the AI market — the same shape as Oracle and OpenAI, as Nvidia and CoreWeave — where the investor, the builder, and the customer are versions of the same few balance sheets passing capacity back and forth among themselves. It works while the music plays. It concentrates the risk when it stops.

Here is what that balance sheet did not do: fund Stargate LLC. The roughly $19 billion equity tranche SoftBank pledged to the vehicle has no confirmation of ever having been wired, because the vehicle was bypassed.[21] The chief financial officer’s own description of the model is the tell: SoftBank makes an equity investment, but the project itself is “financed as project finance,” so its own commitment is “limited” and “should not be too huge.”[22] Stripped of the jargon: SoftBank lends its name and a sliver of equity, and someone else’s debt builds the thing. The capital that flowed to an actual Stargate site was a $500 million check into SB Energy for the Milam County build. The headline was $500 billion; SoftBank’s verified site-level equity was three orders of magnitude smaller.

This reframes what the French €45 billion actually is. It is not a promise that SoftBank will place €45 billion on its own books. It is a promise that SoftBank will supply catalytic equity and arrange project financing that does not yet exist — Son said as much at the podium, describing the venture as one SoftBank is “aggregating project financing” to fund, against demand from an anchor tenant not yet named, on a balance sheet already carrying the most concentrated single-name bet in the AI buildout.[22] The pledge’s deliverability is downstream of a financing structure that has to be assembled and of an OpenAI liquidity event — an IPO — that has to occur before the bridge facility is repaid. France controls none of those variables.

And here, the two halves of the story close together. Staged commitments and project finance are, on their own, unremarkable — every large data-center developer rings capex into phases and funds it with non-recourse debt, because that is cheaper than equity and isolates the risk. The question is never whether a pledge is staged; it is what the staging rests on. SoftBank will not put €75 billion of its own balance sheet behind this — the balance sheet just described could not absorb it on top of the OpenAI commitment — so it writes a €75 billion option instead: a headline ceiling, a smaller firm tranche, and project finance to be arranged later. What changes with leverage is not the structure but the margin for error within it. A cash-rich sponsor that stages a pledge can absorb a slipped tranche or a delayed financing; a sponsor whose crown jewel is already collateral, whose follow-on runs to October, and whose bridge presumes an IPO cannot.

The option produces the headline; the headline produces the political record; the record is what the summit needed.

The more strained the balance sheet, the larger and softer the number it can afford to announce, because softness is free and the announcement is the deliverable. To be precise about where the softness enters: not, mostly, with SoftBank. Its press release was scrupulous — “up to” €75 billion, a firm “€45 billion,” the rest explicitly “plans.” The recharacterization happened at the podium, when a phased pledge with one firm tranche became, in Macron’s telling, €93 billion “confirmed.” SoftBank disclosed an option. The summit booked it as cash.

What France actually brings, and what it doesn’t

The honest counterargument is that France is not Texas, and the difference favors the pledge. This deserves a fair hearing, because it is the strongest case for taking the €45 billion at close to face value.

France’s advantages are real and, unlike capital, not exportable. The grid is roughly 70 percent nuclear, France is, in most years, the world’s largest net electricity exporter, and industrial power prices sit well below those in much of Europe, with EDF long-term pricing around €70 per megawatt-hour from 2026.[23] For a buildout whose binding constraint is increasingly power rather than capital, that is a genuine structural edge, and it is why the first-phase sites cluster in Hauts-de-France near existing grid and nuclear infrastructure, including a former coal site at Bouchain where EDF is the named development partner.[24] It also matters that the prior SoftBank pledges failed precisely on the variable that France has solved: the Saudi solar plan had no offtaker, and the UK Stargate site was paused due to energy costs. France removes the constraint that killed those. If any SoftBank data-center pledge converts close to schedule, the case for this one is better than most — and that concession should be granted in full.

But cheap power is necessary, not sufficient, and it is not the variable that has stalled the buildout this year. What stalled Stargate was not the price of electricity; it was demand discipline and financing — a canceled expansion, a paused site, a vehicle that never funded. France solves the kilowatt-hour. It does not supply the anchor tenant, the assembled debt, or the balance-sheet capacity, and those are the three things the precedent says actually bind. Power is the one layer of the stack that France owns, and it is the bottom layer. Above the kilowatt-hour, the French buildout is foreign at every tier. The capital is Japanese. The chips are American — Nvidia silicon, subject to American export jurisdiction. The most likely offtaker of five gigawatts of French inference and training capacity is American, because the anchor tenant SoftBank builds for is OpenAI, and no European anchor of remotely comparable demand has been named.[25]

France is not building sovereign AI capacity. It is providing the land and the electricity for someone else’s intelligence layer, and calling the result French because the substations are.

Macron said the summit would make France “the leading country hosting data centers and computing capacity in Europe,” and that the country was “closing the gap we had in computing capacity.”[26] Both claims may even come true. But hosting capacity and owning intelligence are different sovereignties, and the gap that closes is the one measured in megawatts, not models. This is the substrate-state position, normally diagnosed in Southeast Asian economies that host hyperscaler data centers without owning any layer of the intelligence that runs on them. It is striking to find a G7 economy with a world-class research base occupying the same structural slot — providing the physical inputs and importing everything above it. The fair counter is that substrate can be a first rung rather than a ceiling: Taiwan and South Korea became chip powers partly by first hosting foreign firms’ manufacturing, and a country cannot build the intelligence layer on capacity it never built. Hosting compute you don’t yet own can be a deliberate developmental bet. But the bet only pays off if value accrues locally over time — if the substrate becomes a ladder.

The SoftBank pledge is built the other way: a foreign sponsor, foreign chips, and a most-likely-foreign tenant, with no disclosed mechanism for the intelligence layer to be handed over to French hands. It is a substrate as a destination, not a substrate as a rung.

There are two genuine exceptions inside the broader French buildout, and honesty requires naming both — because they sharpen the point rather than soften it. The first is Campus AI, the joint venture whose expansion supplied the €7.5 billion tier; its French AI champion, Mistral, secured up to 200 megawatts of capacity there, announced the same day as the summit.[7] But Mistral’s role in Campus AI is principally that of shareholder and board member; the project’s own coordinator described the startup as a “preferred” future client while conceding that, for now, “nothing has yet been done” on a binding tenancy.[7] Campus AI’s own president framed the stakes in terms that could serve as this article’s thesis: the test, he said, is that “every gigawatt must grow value in France, and not simply pass through it.”[7] The second exception, and the more real one, is Mistral’s own data center at Bruyères-le-Châtel — its first debt-financed build, totaling $830 million for roughly 13,800 Nvidia chips and about 44 megawatts of capacity.[27] That is the genuine article: a French company owning its own compute.

And its scale is the whole argument in one number. Forty-four megawatts of sovereign French compute, against SoftBank’s 3,100-megawatt first phase. The champion’s owned infrastructure is roughly 1% of the substrate that the country provides for someone else’s use. For the marquee number — five gigawatts — there is no French anchor. The grid is the moat, and nearly everything it powers belongs to someone else. And even the grid advantage is contingent on RTE, the French grid operator, actually delivering 3.1 gigawatts of new connection capacity to three specific sites by 2031 — an unprecedented load addition on a timeline that grid-connection history does not obviously support, and that no signed connection agreement has yet confirmed.[28]

What would have to be true

The thesis is falsifiable, and it is worth stating the conditions plainly, because they are also the things a serious investor should watch. And there is someone who should watch. An option-shaped pledge harms no one if everyone prices it as an option — but it is not being priced that way. It is being booked as a record by a government building industrial-policy narrative on it, cited by analysts pricing “France is Europe’s AI hub” into datacenter REITs and French-exposure allocations, and folded into the case for a SoftBank credit that already trades below investment grade. The reader who needs the disaggregation is the one about to treat €93 billion of intention as €93 billion of capital.

The skeptical reading is wrong if, within roughly twelve months, SoftBank secures binding project financing — not a memorandum — for at least the Dunkirk site; if a named anchor tenant or binding offtake agreement appears; if an executed lease replaces “preferred bidder” status at Bouchain; and if the OpenAI IPO closes cleanly enough to let SoftBank refinance the March 2027 bridge without forced asset sales. If those happen, the €45 billion converts, and the substrate-state critique becomes a quibble about who owns the value rather than whether the buildings exist.

The thesis is confirmed if the tells repeat: financing perpetually “being assembled,” capacity that “can scale to” rather than “will reach,” a first-operations date that slips past 2028, no anchor tenant disclosed by 2027, a further S&P action, or the same “pause” language that appeared over the UK site in April.

On sixteen years of SoftBank precedent — from the 2016 Trump Tower pledge that resolved substantially into the WeWork loss, to the 2018 Saudi solar plan shelved within six months of its announcement, to Stargate at one energized site of seven — the base case is not fabrication. It is conversion that runs well below the headline and well behind the clock.[29]

Which is the precise thing the word confirmés was chosen to obscure. Macron did not announce €93 billion of investment. He announced €93 billion of intention, of which the largest single component is a ceiling, two-fifths of that ceiling is merely a plan, and the firm remainder rests on a balance sheet betting its credit rating on a single American startup’s path to an IPO. The number is not false. It is an option — priced, and presented, as a certainty.

Notes

[1]: Emmanuel Macron, remarks at the Choose France summit, Versailles, June 1, 2026: “Cette édition de Choose France à elle seule va permettre de cristalliser un montant record de 93 milliards d’euros d’investissements confirmés.” Reported by franceinfo, June 1, 2026; quote also carried verbatim by Euronews FR. The €93 billion figure spans 71 projects and a French-government-stated ~15,600 jobs; it is an announcer-claimed forward figure, not an audited outcome.

[2]: SoftBank Group Corp., “SoftBank Group to Build 5 GW of AI Data Center Capacity in France,” press release, May 30, 2026. The €75 billion figure is stated as “up to.”

[3]: franceinfo, June 1, 2026, reporting that the single 2026 edition exceeded the cumulative announced totals of the prior eight Choose France editions (~€87 billion combined). Prior editions per Élysée/Business France press dossiers (2023 dossier, diplomatie.gouv.fr): 2023 ~€13B; 2024 ~€15B; 2025 stated variously as ~€20B (Macron, 2026 framing) and €40.8B (2025 press dossier) — the moving baseline is noted as itself indicative of headline elasticity.

[4]: SoftBank press release, May 30, 2026: the first phase is described as “an initial €45 billion investment to deliver 3.1 GW”; subsequent capacity is described as the company “also plans to develop additional sites across France.” The shift in verb from “investment/commitment” to “plans” is within the same document.

[5]: SoftBank press release, May 30, 2026. Named first-phase sites: Dunkirk (Loon-Plage), Bosquel, and Bouchain, all in Hauts-de-France; Schneider Electric named as strategic partner (robotized manufacturing at Dunkirk); SB Energy as developer; first operations targeted 2028, full phase by 2031. Per a separate SoftBank announcement (reported by TechRepublic, June 2026), the Bosquel ~1 GW site is structured as a majority-SoftBank joint venture with Sesterce — i.e. even within the “firm” first phase, the capital structure is partly third-party, not pure SoftBank balance sheet.

[6]: Smaller data-center tier, per Choose France 2026 reporting (Le Monde Informatique, Le Journal des Entreprises, Silicon.fr, June 1, 2026): Brookfield €10B at Escaudain (Nord), with Data4, for a ~1 GW datacenter, bringing its stated France AI total to €30B — of which €20B was announced at the February 2025 AI Action Summit (Brookfield press release, Feb 10, 2025: €15B via Data4 + €5B associated infrastructure, delivery by 2030), so only €10B is new to the 2026 summit. Nebius ~€8B / 240 MW on the former Bridgestone site at Béthune. Ardian/Verne ~€5B for a 500 MW Île-de-France campus, full 500 MW capacity targeted only 2035–2037, itself the first tranche of a broader ~€10B / 1 GW French consortium (Ardian, Iliad, EDF, Orange, Scaleway). Figures are announcer-claimed; several were pre-trailed by Les Echos and final terms may differ.

[7]: MGX–Bpifrance ~€7.5B is the national expansion of Campus AI, the joint venture of Bpifrance, Mistral AI, MGX (UAE), and Nvidia, first announced at Choose France 2025 (May 19, 2025) to build “Europe’s largest AI Campus” (flagship ~1.4 GW, Paris region). Per the Bpifrance press release (June 1, 2026), the expansion targets up to 3 GW nationally and the ~€7.5B second-site selection “doubles the consortium’s initial investment”; the second site selection is described as “imminent,” not yet committed. The flagship campus at Fouju (Seine-et-Marne) was reported still in early construction (”foundations laid, main site not yet begun”) as of April 2026; the flagship’s secured first tranche is reported at ~€8.5B (Le Figaro), a separate figure from the €7.5B second-site expansion. Campus AI is the one summit pledge with a French intelligence-layer anchor (Mistral); the substrate-state exception is noted in the body. The Campus AI president quoted in the body is Thibaud Desfossés (”chaque gigawatt doit faire fructifier la valeur en France, et non simplement la traverser”), per the Bpifrance press release.

[8]: Revolut’s ~€1B France commitment was reported as contingent on the firm obtaining a French/EU banking licence.

[9]: Emmanuel Macron, remarks reported by Reuters, June 1, 2026: characterizing the AI-related portion as “20 billion invested, and 20 billion of AI investments as a follow-up to the summit in February.” Verify verbatim French against the Élysée transcript before publication.

[10]: The February 2025 AI Action Summit in Paris produced a ~€109 billion headline; France published no public reconciliation of that figure to authorized, appropriated, or disbursed capital. See “The King’s New Datacenters” (The AI Realist, March 25, 2026), which audited the €109B pledge to an honest near-term figure of roughly €25B.

[11]: OpenAI, “Announcing The Stargate Project,” January 21, 2025; announced at the White House with President Trump, Sam Altman, Larry Ellison, and Masayoshi Son. Headline: “$500 billion over the next four years … We will begin deploying $100 billion immediately.” Son named chairman.

[12]: Reported equity structure (The Information; corroborated by Bloomberg, WSJ): ~$52B committed equity against the $500B headline — roughly $19B each SoftBank and OpenAI, ~$7B each Oracle and MGX — implying ~90% of the program was to be debt- and vendor-financed and not yet arranged at announcement. WSJ reported SoftBank’s equity share could be as low as ~10%.

[13]: Epoch AI, “OpenAI Stargate: where the US sites stand”, satellite-imagery analysis, April 17, 2026: Abilene operational at ~0.3 GW (April 17 reading; a later cached version of the same page shows ~0.6 GW), ~4 of 8 buildings live, against a 1.2 GW site target; Epoch projects the program to “exceed 9 gigawatts by 2029” versus the $500B/10 GW headline announced January 2025; six other US sites in early construction on ~Q4 2028 timelines. “Operational” capacity is satellite-verified (Airbus DS imagery); OpenAI’s “nearly 7 GW planned / $400B+ over three years” figures (five-new-sites announcement, Sept/Oct 2025: https://openai.com/index/five-new-stargate-sites/) are announcer-claimed. The Abilene ~600 MW expansion was redirected, with Microsoft taking the adjacent 900 MW Crusoe site.

[14]: Reporting by The Information, corroborated by Bloomberg and the Financial Times (early–April 2026): Stargate LLC had hired no staff and was developing no data centers; OpenAI pursued bilateral capacity deals (Oracle ~$300B/4.5 GW, plus AWS, Google Cloud) and treated “Stargate” as an umbrella term for its compute strategy. Bloomberg (Aug 7, 2025) earlier reported CFO Yoshimitsu Goto conceding the effort was “taking longer than anticipated.”

[15]: Abilene expansion (~600 MW) cancelled: Bloomberg, March 6, 2026 (Microsoft took the adjacent Crusoe capacity). Stargate UK paused: Bloomberg, April 9, 2026, citing energy costs.

[16]: Abilene construction: Crusoe/Oracle; JPMorgan project-finance facility (~$2.3B, May 2025) and a Newmark-led syndicate (~$7.1B); Nvidia GB200 racks installed from mid-2025; Ellison stated an ultimate target above 450,000 GB200 GPUs under a 15-year Oracle lease. These are real, closed commitments and are cited to discipline the “headline is empty” overclaim.

[17]: SoftBank completed a $41B OpenAI round in December 2025 for ~11% (comprising ~$30B from SoftBank Vision Fund 2 plus ~$11B syndicated co-investment); on February 27, 2026 it agreed a further $30B follow-on (SoftBank Group Corp. press release), funded through Vision Fund 2 as part of OpenAI’s ~$110B round (the largest private funding round on record, valuing OpenAI at ~$852B), bringing cumulative investment to an expected $64.6B and ~13% stake “upon completion,” subject to closing conditions. The follow-on is staged: first $10B tranche executed April 1, 2026; further $10B tranches scheduled July 1 and October 1, 2026 (SoftBank Group Corp. press release, April 1, 2026). As of the June 1 summit, the $64.6B figure is therefore expected-on-completion, not a settled position.

[18]: Funding via disposal of SoftBank’s entire Nvidia stake (~$5.83B, October 2025) and T-Mobile shares; $40B bridge facility signed March 27, 2026, with the first $10B drawn April 1, 2026 (SoftBank Group Corp. press release). The facility is unsecured and full recourse to SoftBank, with no OpenAI shares or Arm stake pledged as collateral; per IFR / loan syndication reporting, the margin starts at 250bp over SOFR and steps up by 17.5bp from July through end-September 2026, a structure designed to incentivise an early takeout via bonds or term loans ahead of an expected OpenAI IPO (widely reported as targeted for late 2026 / as early as Q4 2026; the 12-month tenor, maturing ~March 25–26, 2027, is read by lenders as a bet on that listing). A separate ~$10B margin loan (arranged by Goldman Sachs, JP Morgan, Mizuho; two-year facility with one-year extension, limited recourse) is distinct from the bridge; SoftBank subsequently scaled this facility back toward as little as ~$6B after creditor hesitation (Bloomberg, via Fortune) — a direct signal of the financing strain the body describes. MST Financial’s David Gibson, via the Financial Times, estimated SoftBank faces “[an estimated] $50bn ... of funding, between OpenAI, investments and refinancing” to arrange over the course of 2026; OpenAI is not expected to reach profitability until 2030.

[19]: S&P Global Ratings, action reported March 2026 (S&P statement via Bloomberg; B-tier link to wire coverage of the agency statement): outlook revised to negative, BB+ affirmed (below investment grade), OpenAI described as “one of its investments with the weakest credit quality”; S&P also flagged the unlisted-asset proportion rising above 50% (from 42%) and warned the $30B follow-on could push leverage toward the 35% level that would trigger a downgrade. Moody’s held SoftBank at Ba2/stable (2025 upgrade). The agency divergence is presented to avoid cherry-picking the bearish view; both keep SoftBank below investment grade.

[20]: SoftBank’s reported loan-to-value ratio was 20.6% at end-December 2025, within its stated financial policy (LTV managed below 25% in normal conditions, 35% emergency ceiling; SoftBank Group Corp. disclosure). CFO Yoshimitsu Goto told the Financial Times (March 2026) the group “does not rule out” temporarily exceeding 25%. ADR down ~45% from its October 2025 high by late March 2026; Jefferies downgraded to “Underperform,” calling the company a “valuation trap”; SoftBank paused a separate ~$50B acquisition (Switch). The piece does not claim the 25% ceiling was breached as of publication — only that the CFO opened the door and S&P flagged the trajectory.

[21]: No A-tier source confirms SoftBank’s ~$19B Stargate LLC equity tranche was wired; reporting (The Information, Bloomberg, FT) indicates the JV was bypassed in favor of bilateral deals. Bloomberg Intelligence estimated SoftBank’s actual Stargate cash requirement nearer ~$40B “given its less-active-than-expected participation” — an estimate, not a disclosure.

[22]: Yoshimitsu Goto, SoftBank Q3 FY2025 earnings call, February 12, 2026 (translated remarks): SoftBank makes an equity investment while the project itself is financed as project finance, so SoftBank’s own size is “limited” and the amount “should not be too huge.” Verify exact translated wording against the SoftBank transcript before publication. Son corroborated the same structure at the Choose France podium, stating SoftBank is “aggregating project financing” for the French venture and that the figure “balloons to roughly $750 billion once the broader system is factored in” (CNBC, June 1, 2026) — the announcer himself confirming both that the financing is not yet assembled and that the headline expands on a “broader-system” basis.

[23]: French grid: ~70% nuclear share of generation; France the largest net electricity exporter in Europe/globally in most years (RTE/IEA data — cite data year at fact-check). Note the 2022 exception: amid widespread reactor-corrosion outages France was briefly a net importer, which is why the body says “in most years.” EDF long-term industrial pricing ~€70/MWh from 2026 per the post-ARENH framework. Replace paraphrase with primary RTE/CRE figures and the specific data year before publication.

[24]: SoftBank press release, May 30, 2026; Bouchain former coal-plant site with EDF as named development partner (described at “preferred bidder/due diligence” stage). Grid-proximity rationale for the Hauts-de-France cluster per company and regional (CC2SO/RTE) materials; Bosquel reported ramping 240 MW → ~1 GW → 1.4 GW per regional authority citing RTE.

[25]: No anchor tenant was named in the SoftBank announcement. The substrate-state characterization (Japanese capital, US chips, likely-US offtake) is an analytical inference from SoftBank’s OpenAI relationship, not a stated offtake agreement; flagged as inference.

[26]: Emmanuel Macron, remarks from the Élysée, June 1, 2026, reported by regional French press (mesinfos/La Semaine de l’Île-de-France): aim to make France “le premier pays accueillant des centres de données et des capacités de calcul en Europe” and “Nous sommes clairement en train de combler le retard que nous avions en matière de capacités de calcul en Europe.” Verify against the Élysée transcript before publication.

[27]: Mistral AI raised $830M in debt financing (its first debt raise since founding) from a seven-bank consortium (incl. BNP Paribas, Crédit Agricole CIB, HSBC, MUFG) to acquire ~13,800 Nvidia GB300 chips for a data center at Bruyères-le-Châtel (Essonne), ~44 MW, operational expected Q2 2026 (TechCrunch, March 30, 2026; also Reuters, CNBC). Separately, on June 1, 2026, Bpifrance announced Mistral secured up to 200 MW of capacity with Campus AI (Bpifrance press release); the Campus AI project coordinator (L’Usine Nouvelle) described Mistral as a “preferred” future client and board member/shareholder while stating no binding tenancy was yet concluded — the distinction between equity partner and committed offtaker is preserved in the body. Scale contrast: ~44 MW of Mistral-owned compute vs. SoftBank’s 3,100 MW first phase.

[28]: RTE 3.1 GW connection feasibility to Dunkirk/Bosquel/Bouchain by 2031: no published binding confirmation as of publication. CRE fast-track connection regime (deliberation 2025-120) implies multi-year (≈3–4 year) connection timelines even when expedited. Press-release language on “abundant, decarbonised electricity” is political framing, not a signed connection agreement.

[29]: SoftBank pledge precedents: (a) December 2016 Trump Tower “$50B / 50,000 jobs,” drawn from the forming Vision Fund, with ~half of deployed capital flowing into WeWork (peak ~$47B valuation; 2023 bankruptcy) — Axios retrospective; (b) March 2018 Saudi PIF “$200B / 200 GW” solar MOU, shelved by ~September 2018 (WSJ); (c) Vision Fund 2 ($108B target, ultimately run largely on ~$38B of SoftBank’s own capital). Each: a head-of-state-adjacent headline converting to a fraction of announced, slower, and structurally different capital. France-specific conversion claims are forecasts based on this precedent, not observed outcomes.

The Amendments Were Whispered

Julien Simon — Wed, 03 Jun 2026 06:38:57 GMT

On May 29, a deputy from the president’s party named Éric Bothorel filed twelve amendments to a copyright bill. Three more came from his Renaissance colleague Prisca Thévenot. All fifteen landed the same Friday, three days before the Assembly’s culture commission was due to examine the text, and all fifteen failed the following Tuesday.[1] Asked where the measures came from, Bothorel told Le Point that some had been soufflés par Mistral — whispered by Mistral.[2]

That sentence is the one this newsletter spent four thousand words predicting last week.

“Lobby, Levy, Legislate” argued that Mistral’s moat is not sovereignty or model quality but access: the French president’s contact list, which Arthur Mensch is working to convert into formal law before the 2027 election changes who answers the phone.[3] That was an inference from the customer roster and the lobbying pattern. It did not name this bill; it named the move. A week later, a legislator in the governing party put the move on the public record.

The bill itself is narrow. Proposition de loi n° 2634, adopted by the Senate, would install a présomption d’utilisation: a presumption that an AI provider trained on protected cultural works, unless the provider can document otherwise.[4] It reverses the burden of proof. Today, an author has to prove their work was scraped; under the text, the company has to show what went into the model. The French government’s own civic portal describes it in one line: the bill reverses the burden of proof. [5]

The amendments tell you whom that threatens. One of Bothorel’s amendments inserts two words, de modèles, after “fournisseurs,” narrowing the bill so it binds only model-builders and exempts the French firms that merely deploy AI downstream — the corporates that fill Mistral’s customer list. Its justification is not commercial. It is sovereign: a broad scope, the amendment argues, would halt the sector’s growth and our digital sovereignty. [6] Another strikes the retroactivity clause, reciting the industry’s standard line that documenting training data demands complex technical adaptations. [7] A third went after the bill’s title.[8] Filing amendments against a bill is ordinary politics; a legislator admitting the affected company drafted them is not. The sovereignty argument, deployed to reshape a copyright statute, in the pen of the president’s party. This is not yet the procurement law the long-form predicted — it is the same access doing the simpler job first: shielding the customer list from a bill before writing the law that entrenches it.

Emmanuel Maurel, the deputy carrying the text, attributed the pressure to “certains anciens ministres du bloc central” — former central-bloc ministers now working the building.[9] Readers of the long-form will recognize the address. The piece built its second act on one such figure: Cédric O, the former secretary of state for digital affairs who became a Mistral shareholder and adviser.[3] Maurel, with no framework to grind, arrived at the same door.

The calendar rhymes, too. Yaël Braun-Pivet, the Assembly’s president, received Mensch on May 7. Five days later, the panel that sets the Assembly’s agenda drew up the lineup for a cross-party session and left the copyright bill off it.[10] The Assembly says the meeting was routine. The sequence stands regardless.

Last week’s piece closed on “a calendar that runs out in eighteen months.” This week sharpens why the calendar matters. The lobbying is not the behavior of a company that thinks it has time; it is the behavior of one racing against a deadline. Fifteen amendments filed in a single afternoon are the president’s party spending its access while the access still exists.

There is one scenario where the clock resets: Gabriel Attal. The former prime minister has made AI a central plank of his campaign, vowing to turn France into “la patrie de l’IA,” and a macroniste successor in the Élysée would keep the contact list warm.[11] But Attal is not the favorite, by far. A moat that depends on a trailing candidate is a moat with an expiry date. The base case is the one the long-form named: the access leaves with the administration that built it. That is why Mistral is not waiting. You do not whisper fifteen amendments into a friendly deputy’s hand if you expect the friendly deputies to still be there in three years.

The commission turned back all fifteen, and the bill survived the room. But the surviving committee is not a passage. The text now sits last in the running order of a reserved day claimed by a small opposition group, a slot it may never reach; if it advances with amendments attached, it returns to the Senate to die of scheduling.[12] Mistral does not need to defeat this bill. It needs the bill to never finish, and it has a governing party willing to file amendments to buy time.

Still, time is the one thing Mistral cannot lobby for. The president, whose contact list is the moat, is term-limited and polling in the low twenties; in 2027, he leaves, and the phone Mensch has been calling stops being his to answer.[13] The amendments filed in a single afternoon are not the work of a winning company. They are the work of one racing to pour its access into law before the access walks out of the Élysée. Strip the sovereignty language, and the structure is plain crony capitalism: a national champion whose valuation, customer base, and inner circle of former ministers are all underwritten by one man’s term in office.[3]

The lobbying was the visible part. The confession was the story. The clock is the verdict. Macron’s days are numbered, and everyone on his contact list is counting down with him.

Notes

[1]: Amendments to Proposition de loi n° 2634, Commission des affaires culturelles et de l’éducation, Assemblée nationale. Of sixteen amendments examined June 2, 2026, twelve were filed by M. Éric Bothorel and three by Mme Prisca Thévenot (both groupe Ensemble pour la République); one, by Mme Véronique Ludmann (Horizons), was withdrawn. All were deposited May 29, 2026 and rejected or withdrawn June 2. Amendment list and authors, Assemblée nationale.

[2]: Thomas Graindorge, “« Je n’ai jamais vu un lobbying de cette puissance » : à l’Assemblée, la bataille de Mistral contre le droit d’auteur,” Le Point, June 1, 2026. Éric Bothorel quoted acknowledging certain measures were “soufflés par Mistral.” Erwan Balanant (Les Démocrates) is quoted in the same piece: “Je n’ai jamais vu un lobbying de cette puissance-là sur les domaines culturels.”

[3]: “The President’s Customer List,” The AI Realist, May 2026. The Cédric O biographical detail — his role as Mistral shareholder and adviser following his tenure as secretary of state for digital affairs — is sourced there.

[4]: Proposition de loi relative à l’instauration d’une présomption d’utilisation des contenus culturels par les fournisseurs d’intelligence artificielle, n° 2634, adopted by the Sénat (unanimously) April 8, 2026. Commission text n° 2864-A0 deposited at the Assemblée June 2, 2026. Note: the Sénat title used “présomption d’exploitation”; the version examined at the Assemblée reads “présomption d’utilisation.”

[5]: Vie publique (Direction de l’information légale et administrative), notice of April 10, 2026: the bill “renverse la charge de la preuve de l’utilisation de contenus culturels par les fournisseurs d’IA.”

[6]: Amendement n° AC2, M. Éric Bothorel, Commission des affaires culturelles, rejected June 2, 2026: “À l’alinéa 4, après le mot « fournisseurs » insérer les mots « de modèles ».” Exposé sommaire: “Un champ d’application trop large et non justifié du texte […] mettrait un coup d’arrêt à l’essor du secteur et à notre souveraineté numérique.” The amendment also cites the Munich Regional Court ruling GEMA v. OpenAI (November 11, 2025) — the same enforcement action analyzed in “Register, Disclose, Pay.”

[7]: Amendement n° AC10, M. Éric Bothorel: “Supprimer l’alinéa 5,” removing retroactive application to pending litigation, on the grounds that transparency and traceability compliance requires “des adaptations techniques complexes” that cannot be applied retroactively.

[8]: Amendement n° AC3, M. Éric Bothorel, targeting the bill’s title (TITRE).

[9]: Maurel quote per Le Point (note 2). Maurel, the GDR rapporteur, has publicly championed the text alongside the collecting societies Adami, SACD, and ADAGP.

[10]: Braun-Pivet–Mensch meeting (May 7) per Le Point (note 2). The bill’s absence from the agenda set by the May 12 Conférence des présidents is corroborated by “IA : pas de proposition de loi sur le droit d’auteur à l’ordre du jour de l’Assemblée nationale,” Le Monde, May 12, 2026, and by Décideurs Juridiques, May 12, 2026.

[11]: Gabriel Attal, first major campaign rally, May 29, 2026, per Le Point (note 2), which reports his ambition to make France “la patrie de l’IA” and attributes to him an effort to slow the text. Direct-quote wording to be confirmed against the rally transcript before syndication.

[12]: Procedural posture per Le Point (note 2): the text is placed last in the GDR niche order of June 11; amendments lengthen hémicycle debate and, if adopted, force a return to the Sénat for a conforming vote.

[13]: Emmanuel Macron, in his second consecutive term, is barred by Article 6 of the French Constitution from seeking a third; his mandate ends in 2027. His approval stood in the low twenties as of May 2026 (Ipsos, Elabe, and Morning Consult tracking polls), as detailed in “The President’s Customer List” (note 3).

The Overbuild Put

Julien Simon — Mon, 01 Jun 2026 11:51:41 GMT

On May 27, asked at Meta’s annual shareholder meeting whether the company would ever take on Amazon, Microsoft, and Google in cloud computing, Mark Zuckerberg said the idea was “definitely on the table.” [1] Then he added the qualifier that matters more than the headline: Meta hasn’t rented out compute “because we think that we have a use for the compute,” and a cloud business becomes an option only “if we get to a point where we feel that we have overbuilt.” [2]

Read that again with the calendar open. Five weeks earlier, on April 24, Meta had signed a deal making it one of the largest customers in the world for Amazon’s Graviton processors — renting compute capacity from a direct competitor’s cloud, explicitly to get access to the silicon it needs now without waiting on its own data centers. [3] So in the same quarter, the only one of the four U.S. hyperscalers that does not sell cloud services [4] was simultaneously a major buyer of someone else’s compute and a prospective seller of its own. Short and long, on the same balance sheet, at the same time.

That is the contradiction worth chasing. Everyone in AI is supposedly starved for compute — GPUs backordered for months, Amazon’s own training chips shipping slower than it can build them, North American data-center vacancy at 1.4% at the end of 2025. [5] And here is the company building more of it than anyone, raising the possibility that it might have too much. Where does “excess capacity” come from in a world that can’t get enough?

The claim: a put, not a pivot

The answer is that “excess” and “scarcity” are not opposites here. They are the same condition seen from two ends of a balance sheet — and which end you look from determines how you should price Meta.

Meta’s cloud remark is not a product strategy. It is a put option on its own buildout. And the interesting question is whether to read that option as fragility or as optionality. Both readings are live. Both are defensible from the same numbers. The piece that follows is about which one the evidence favors, and why the remark itself is the tell.

The numbers frame the tension. Meta raised its 2026 capital-expenditure guidance to $125-$145 billion, up from a prior range of $115–$ 135 billion, citing higher component prices and “additional data center costs to support future year capacity.” [6] As much as double what it spent in 2025, and even at the floor, more than 2024 and 2025 combined. [7] Yet first-quarter capex came in at just $19.84 billion — below the $27.57 billion analysts expected. [8] The company spent modestly and guided enormously in the same breath, and the market punished the guidance, not the spend: the stock fell roughly 7%. [9]

The exposure is not in what Meta has spent but in what it has promised to spend. The first-quarter filing carries $237.67 billion in non-cancelable contractual commitments — mostly third-party cloud capacity, servers, network infrastructure, and data centers — against $81.18 billion of cash and marketable securities. [10] Separately, it disclosed $182.88 billion of leases not yet commenced, consisting of data centers, colocations, and network infrastructure that begin between now and 2036. [11] The commitment line jumped by $107 billion in the quarter alone, which chief financial officer Susan Li attributed to multiyear cloud deals and infrastructure purchase agreements. [12] The overbuild, if there is one, does not live in trailing capex. It lives in the contracts — and contracts do not flex when demand disappoints.

The first reading — call it the Overbuild Put — holds that Meta is stacking every available lever to make a buildout look affordable whose paying customer it has not yet secured, and that the cloud remark is the final lever: a backstop buyer of last resort for capacity its own products may not fill. The second reading — call it Scarcity Builds the Glut — holds that the overbuild is rational, that an advertising machine of staggering profitability is funding it from cash, that scarcity is real and the surplus will be absorbed, and that the cloud option is genuine upside rather than a distress signal. The rest of this piece develops both, then resolves them.

The mechanism: four levers and a backstop

Start with the paradox, because resolving it is the whole argument.

The compute Meta is renting from Amazon, and the compute it is building is not the same as the compute it is renting. The Graviton deal brings tens of millions of Arm-based CPU cores into Meta’s portfolio — general-purpose silicon suited to agentic inference, the workload that runs after a model is trained, and available immediately. [3] Hyperion, Meta’s flagship campus in Richland Parish, Louisiana, is a five-gigawatt site built for the next generation of frontier training, and it does not come online until 2029. [13] Different silicon, different workload, different clock. Meta is short on the inference capacity it needs this year and long on the training capacity it has committed to for the end of the decade.

That gap — between capacity contracted years ahead and demand demonstrated today — is where “excess” is manufactured. And it is manufactured by the very scarcity panic that justifies the spending. Racing a feared shortage, you commit to gigawatts that arrive in giant, indivisible blocks long before the workloads exist to fill them. The scarcity is what produces the glut. They are the same phenomenon.

Meta’s own answer is that the gap is illusory — that the committed capacity is precisely the inference base it needs to put personal and business agents in front of billions of users, as Li told the call. [14] That may prove right. It is also exactly the demand that has to materialize on schedule for the contracts to pay, and a backstop is what a management team names when it wants insurance against its own forecast.

Now the affordability levers — each of them legal, disclosed, and used across the industry. The question is never whether any one of them is permissible; it is what they add up to. The buildout only proceeds if it can be made to look cheaper than it is.

The first lever is the off-balance-sheet vehicle. Meta financed Hyperion through a joint venture with Blue Owl Capital — Blue Owl owns 80%, Meta 20% — funding construction through a special-purpose vehicle (SPV), Beignet Investor, that raised roughly $27 billion of debt against about $2.5 billion of equity: close to $30 billion in total, the largest private-credit data-center deal on record. [15] The structure keeps the debt off Meta’s books, and the price of that engineering shows in the terms. The bonds, rated A+ by a single agency on the strength of Meta’s backing, priced at a 6.58% yield — roughly 225 basis points over Treasuries, wider than Meta’s own senior notes pay despite the identical credit standing behind them, the premium a charge for the off-balance-sheet structure, and the 24-year tenor — and mature in 2049. [16] This is not a one-off but a template. A second vehicle follows the same logic: a roughly $13 billion structure for a gigawatt campus in El Paso, leaning on the same thin-equity, debt-heavy capitalization that is wildly insufficient if the workloads stall. [17][18] One detail in that deal is its own signal — it has no anchor lender, leaving the banks to syndicate the debt into capital markets rather than place it with a single committed buyer. [17] And these vehicles sit on top of, not instead of, the $58.75 billion of senior notes already on Meta’s own balance sheet. [19]

The second lever is the lease itself. To keep the rating agencies from treating the arrangement as debt, Meta structured the Hyperion lease on a four-year renewable term — short enough that the obligation need not be consolidated onto the balance sheet as a single long-term liability. [20] The debt is real; it simply does not appear where a casual reader of the 10-K would expect to find it.

The third lever is depreciation. Effective January 1, 2025, Meta extended the estimated useful life of a subset of its servers and network equipment to 5.5 years, a change it disclosed would reduce full-year 2025 depreciation expense by approximately $2.9 billion. [21] Lower depreciation flows straight to reported operating income without changing a dollar of cash. The timing is the point: Meta stretched the assumed life of its hardware precisely as it ramped the buildout, flattering the income statement at the moment the spending most needed flattering. The contrast with Amazon is exact. In the same window, Amazon shortened the useful life of a subset of its servers to five years, explicitly citing the rapid pace of AI innovation. [22] Two companies, one hardware reality, opposite accounting choices — and Meta picked the one that defers the reckoning. Michael Burry’s public broadside in late 2025, estimating roughly $176 billion of industry-wide understated depreciation between 2026 and 2028, is the bear case for that choice arriving on schedule. [23]

The defense is real: a GPU’s economic life does cascade — from frontier training down to cheaper inference and eventual resale — so a longer book life can be honest rather than cosmetic. But the cascade has to land somewhere. It presumes a profitable second use for silicon Meta has finished training on, and that second use is either internal inference demand or an outside renter. The depreciation assumption and the cloud option are the same bet wearing different clothes.

The fourth lever is the one Zuckerberg named out loud. Each of the first three makes the buildout look affordable; none makes it pay. The vehicles, the lease slicing, the stretched depreciation all assume the same thing — that the compute, once built, generates revenue. The financing analyst on the Hyperion deal said it plainly: Meta has to build the thing, “put workloads in it,” and operate on the presumption that it will monetize those loads later. [24] The cloud option is the answer to the question every other lever begs. If Meta’s own products do not fill the capacity, Meta rents it to someone whose products will — and the debt gets serviced either way. That is what a put is: the right to sell the underlying when you no longer want to hold it.

This is why the Commitment-versus-Spend gap matters so much. A company that has spent $19.84 billion against $237.67 billion in commitments is not yet overbuilt. [8][10] It is contracted to overbuild, with the spending back-loaded and the demand unproven. The cloud remark is what a management team says when it can see the gap between the contracts it has signed and the demand it can document, and wants the market — and the credit market in particular — to know there is an exit.

The strongest objection is that this is simply how the cloud business was born. AWS grew out of Amazon’s own internal slack in 2006: build for yourself, find you have spare capacity, rent it out. Selling the excess is not a red flag — it is the canonical path to the most lucrative franchise in enterprise computing. The distinction is sequence and leverage. Amazon converted capacity it already owned into a product before anyone had committed a quarter-trillion dollars of debt-financed, off-balance-sheet capacity to the bet. Meta is committing the capacity first, financing it through vehicles built to keep the debt invisible, and presuming the product will follow. AWS monetized a surplus it stumbled into; Meta is pre-committing to a surplus and naming, in advance, its buyer of last resort. One is discovery. The other is a hedge.

What actually exists

Here, the second reading is at its strongest, and honesty requires giving it full weight.

Meta can build models. For a year, that was an open question. Llama 4 launched in April 2025 to a poor reception; Yann LeCun later told the Financial Times that the benchmark results had been “fudged a little bit,” that the team used different models for different benchmarks, and that Zuckerberg lost confidence in the group and sidelined it. [25] Eleven of the fourteen researchers behind the original Llama left the company; LeCun himself departed in November 2025. [26] The flagship “Behemoth” model was delayed due to performance issues and never shipped as promised. [27] If the thesis were “Meta cannot compete at the frontier,” that history would carry it.

But it isn’t, and the history was reversed. On April 8, 2026, Meta Superintelligence Labs — the division built around the $14.3 billion Scale AI investment and chief AI officer Alexandr Wang — released Muse Spark, which scored 52 on the independent Artificial Analysis Intelligence Index — fourth in the world at launch, behind only Gemini 3.1 Pro, GPT-5.4, and Claude Opus 4.6, and far ahead of Llama 4 Maverick’s 18. [28] Meta is, demonstrably, back in the race. But the profile is spiky in a telling way: Muse Spark’s weakest results fall on exactly the agentic, real-world-work benchmarks that enterprise compute is sold against — it trails GPT-5.4 and Anthropic’s Claude models on Artificial Analysis’s GDPval economic-task evaluation and on Terminal-Bench, gaps Meta itself flagged as priorities for further work — while its standout scores cluster in consumer health and multimodal fluency. [29]

What it did with that model is the crux. Muse Spark is closed. Its weights are not published, and at launch, Meta offered no public API, only a private preview to select users — the model was available free through the Meta AI app and website, and rolling out as the default assistant across Facebook, Instagram, WhatsApp, and Ray-Ban glasses, but not sold to developers as a service. [30] Meta deliberately declined to monetize the intelligence layer externally. The model exists to make Meta’s own products better and to be consumed by Meta’s own three-billion-user base, monetized the way Meta monetizes everything — through advertising, supplemented by new $7.99 and $19.99 Meta AI subscriptions. [31]

And the advertising machine is extraordinary. First-quarter revenue rose 33% to $56.31 billion, the fastest growth since 2021; ad impressions were up 19% and price per ad up 12%; operating income reached $22.87 billion; and the company generated $12.4 billion of free cash flow in the quarter even after capex. [32] This is the heart of the optimistic reading. Meta is funding a generational infrastructure bet out of one of the most profitable businesses in the world, not borrowing against hope. It underperformed expectations in the quarter and retains the discipline to throttle. If the buildout is rational, this is why.

That cushion is thinning fast, though. The $43.6 billion of free cash flow Meta generated across 2025 is set to fall steeply in 2026 as capex roughly doubles — far enough that several analysts now model it turning negative within a year or two. [33] The ads engine funds the buildout today; whether it still does in 2027 is the seam the bear case pulls at.

It also sharpens the problem. A closed model that, at launch, sold nothing to outside developers does not generate external compute revenue. Muse Spark fills Meta’s consumer demand, not the commercial demand that would absorb a five-gigawatt training campus and service a thirty-billion-dollar SPV. The model’s success and the buildout’s empty revenue case trace to one decision: Meta chose to keep its best work inside the walls. The capacity outside the walls still needs a tenant.

Whose money builds it

If the advertising machine genuinely pays for all of this, one hire is hard to explain. In January 2026, Meta named Dina Powell McCormick, with sixteen years at Goldman Sachs, where she ran the global sovereign investment banking business, later a deputy national security adviser, with the Gulf relationships to match, president and vice chairman. [34] Zuckerberg’s brief for her was specific: partner “with governments and sovereigns to build, deploy, invest in, and finance Meta’s AI and infrastructure,” and build “new strategic capital partnerships” that “expand our long-term investment capacity.” [35] A company that can comfortably fund its buildout from operating cash flow does not recruit a sovereign-wealth dealmaker to expand its investment capacity.

The move follows a path Microsoft, OpenAI, and Amazon have already worn — courting Gulf sovereign-wealth funds to help underwrite AI infrastructure. [36] It is the logic of the SPVs taken one tier further. Private credit moved the debt off Meta’s balance sheet; sovereign capital would move part of the funding burden off the private-credit market, which — as the El Paso deal’s missing anchor lender hints — is showing early signs of indigestion. Each tier widens the circle of people other than Meta who carry the bet.

And it changes what the capacity costs in something other than dollars. Sovereign money is not neutral money. A loan financed by a foreign government carries strings; private credit does not: preferences about where the capacity sits, who gets access, and what the financier expects in return. Meta has not closed such a deal — it has hired the person whose job is to find one. But the direction is the tell. When the cheapest available capital for a buildout is a sovereign-wealth fund, the buildout has outgrown every conventional source, and the question of who holds leverage over Meta’s compute stops being rhetorical.

The mirror: Amazon built the same trap in reverse

The cleanest way to see what Meta is doing is to set it beside the company it is renting chips from.

Amazon is the canonical case of infrastructure reversion: a company that stumbled repeatedly at the intelligence layer — Titan, Nova, the slow developer uptake of its own silicon — and resolved each stumble by retreating to infrastructure it could sell. The difference is that Amazon has the infrastructure business. When its models underperformed, it had AWS to monetize the compute regardless, and Bedrock to resell everyone else’s models through its own billing relationship. The reversion worked because the floor was already a product.

Meta has arrived at the same place from the opposite direction. It has the intelligence-layer stumbles. It has the infrastructure. What it lacks is a cloud business that would let the infrastructure pay for itself if the models don’t. The two companies even diverge on the accounting of identical hardware, each in the direction its strategy implies: Amazon, which sells compute and lives with hardware honesty, shortened its server life; Meta, which needs the buildout to look affordable, lengthened it. [21][22]

So the Infrastructure Reversion Test produces a Meta-specific verdict. Reversion is a fallback to a business you already run. Meta is contemplating reversion to a business it has never run, against incumbents who own two-thirds of the market between them, [4] as the backstop for a model strategy that, at launch, sold nothing to outsiders. It closed its models and is now weighing whether to sell the floor beneath them. When the intelligence layer is walled off from outside revenue, the only thing left to sell to outsiders is the raw compute — and that is the layer with the lowest margins and the most entrenched competition.

The bear’s favorite analogy belongs here, and it cuts more sharply than the bulls admit. The dark-fiber buildout of the late 1990s did, eventually, become the backbone of the modern internet — vindication, the optimists say, for building ahead of demand. But the surplus enriched whoever bought it cheaply out of bankruptcy, not the companies that financed and laid it. [37] If Hyperion and its siblings are dark fiber 2.0, the relevant question is not whether the capacity is used. It is who is holding the paper when it does. And the paper here sits with private-credit funds, insurers, and — through target-date and core bond funds — ordinary retirement accounts, layered over a thin equity cushion. [18][38]

This also answers the objection that the SPV debt is the lenders’ problem, not Meta’s. It is both, and the split is the point. Meta’s own direct exposure is comparatively contained — the lease payments it owes the vehicles, plus its minority equity. The structure’s fragility — the thin cushion, the long-dated near-junk debt — sits with Blue Owl, the bondholders, and the insurers behind them. The risk sits there by design. Meta engineered it onto someone else’s balance sheet, which is precisely why it can afford to be sanguine about overbuilding, and why “cloud is on the table” costs it so little to say.

What would have to break

The cloud remark serves as the hinge between the two readings, making this a thesis with a falsification date built in.

If Meta never exercises the option — if Muse-series models scaling across three billion users, plus recommendation and ads inference, plus whatever agentic workloads arrive, actually fill Hyperion and El Paso, and the SPV debt is serviced out of the advertising machine’s cash flow — then the optimists were right. The buildout was a rational forward purchase, the cloud line was idle optionality, the levers were prudent capital management. The end-state even has a name: Meta becomes a second Google — billions of users, a wall of apps, a competitive frontier model, and the custom silicon and data centers to run it all in-house. Google built precisely that stack, and it pays.

But the comparison is where the bull case turns on itself. Google’s vertical integration includes the one layer Meta has conspicuously skipped: it monetizes the same silicon and models externally, renting its TPUs and selling Gemini through Google Cloud — the chips that train Gemini and serve a billion users also collect rent from outside customers. [39] Google fills its own fleet and sells the overflow. Meta closed its model, withheld the API, and runs no cloud; and its silicon program, MTIA, is the least-proven leg of the stack, which is why it still leans on Nvidia, AMD, and rented AWS capacity to do the work TPUs do for Google. [3] Follow the optimistic case all the way to its end, and Meta lands as Google minus the cloud — the exact configuration that makes “cloud is on the table” necessary in the first place. The bull and bear cases converge on the same missing layer.

The market has already begun pricing that gap. On near-identical first-quarter beats, Alphabet’s stock rose about 7% the same day Meta’s fell — the difference resting on the cloud layer that turns AI capex into outside revenue. [40] And independent modeling cited by the Financial Times puts most of the hyperscalers, Meta included, at negative implied returns on AI investment through 2030, even on the generous assumption that the systems cost nothing to run; Amazon, the most mature cloud monetizer, is the lone positive, at roughly 7%. [41]

If Meta does exercise it — if it stands up external compute sales because internal demand fell short of the contracted capacity — then the put was the plan all along, and the levers were what they looked like: a structure for sustaining a buildout whose customer Meta had not secured.

The clock that decides it is already running, and three hands move at once. The physical capacity arrives on a schedule — Prometheus at a gigawatt in 2026, El Paso in 2028, Hyperion’s five gigawatts in 2029. [13][17][42] The SPV debt amortizes on another, with refinancing risk concentrating as the late-2020s maturities meet the capacity coming online. And the depreciation assumption reconciles on a third: if the stretched five-and-a-half-year life proves optimistic, the write-downs land in precisely the 2026–2028 window Burry flagged. [23] Demand, power delivery, and refinancing have to line up on the same timeline for the optimistic case to hold. [38] Each runs on its own logic, and none waits for the others.

There is a reason the credit market is already watching rather than the equity market. Meta’s five-year credit-default swaps had no liquid market until November 2025 — there was little to insure, because until then Meta funded itself largely from its own cash rather than from debt. [43] The CDS exists today because Meta became a borrower, and it has widened alongside Oracle’s, the cohort’s weakest credit, whose five-year spread has sat near 200 basis points since the spring — its highest since the 2008–09 financial crisis, and roughly quadruple its mid-2025 level. [44] JPMorgan now sells a hyperscaler CDS basket — Alphabet, Amazon, Meta, Microsoft, Oracle — so institutions can hedge a category of risk that barely existed eighteen months ago, against five names carrying $969 billion in commitments with $662 billion of data-center leases not yet commenced. [45] The instrument to bet against Meta’s buildout was built before Meta finished building it.

Weigh it, and the tension does not fully resolve — but it tips. The advertising business is strong enough that the optimistic case cannot be dismissed, and the first-quarter underspend shows real discipline. Yet a management team that genuinely expected internal demand to fill its capacity would not need to remind shareholders, with the CDS trading and the commitments at a quarter-trillion dollars, that it could always rent out the capacity. You name the exit when you can see the scenario that requires it. The most revealing thing Zuckerberg said in May was not that the cloud is on the table. It was the condition attached: if we feel that we have overbuilt. He is pricing the probability himself.

Meta is the one hyperscaler that built the cathedral before it had a congregation. “Cloud is on the table” is the sound of a company that has noticed, and is letting its lenders know there is a door.

Notes

[1] Mark Zuckerberg, remarks at Meta’s annual shareholder meeting, May 27, 2026, as reported in Jonathan Vanian, “Mark Zuckerberg says a Meta cloud computing business ‘definitely on the table,’” CNBC, May 27, 2026.

[2] Ibid. Zuckerberg: “We haven’t done that yet because we think that we have a use for the compute,” and the option arises “if we get to a point where we feel that we have overbuilt.” The conditional framing is load-bearing for this piece: Meta did not announce a cloud business; it named an option contingent on overbuild.

[3] Meta and AWS press releases, April 24, 2026; see “Meta Becomes One of World’s Largest Customers of Amazon AI Chips,” PYMNTS, April 24, 2026 (Graviton Arm cores, “tens of millions” of cores, positioned for agentic inference). Graviton is an Arm-based CPU, not a GPU; reporting that the deal also covers Trainium/Inferentia is less firmly sourced and is not relied on here.

[4] “Of the four U.S. hyperscalers, Meta is the only one that doesn’t sell cloud infrastructure and services”; AWS holds roughly a third of the market, with Microsoft and Google together holding another third. CNBC, May 27, 2026 (n.1); TechRadar, “Meta cloud computing business ‘definitely on the table,’” May 2026.

[5] North American data-center vacancy fell to 1.4% at year-end 2025, per CBRE’s North America Data Center Trends, H2 2025; JLL’s year-end 2025 read put the primary-market vacancy rate near 1%. On Trainium supply, AWS has stated demand exceeds production: TechCrunch, “An exclusive tour of Amazon’s Trainium lab,” March 22, 2026.

[6] Meta Platforms, “Meta Reports First Quarter 2026 Results,” April 29, 2026 (guidance raised to $125–145B from $115–135B; rationale: “higher component pricing... and, to a lesser extent, additional data center costs to support future year capacity”). SEC / Meta IR.

[7] 2025 full-year capex was $72.2 billion; 2026 guidance is nearly double that figure. Fortune, “Meta just bumped its 2026 capex forecast up to as much as $145 billion,” April 29, 2026.

[8] Q1 2026 capital expenditures (including principal payments on finance leases) were $19.84 billion, below the $27.57 billion StreetAccount consensus. Meta 10-Q / “Meta Q1 earnings report,” CNBC, April 29, 2026.

[9] Shares fell roughly 7% (intraday as much as ~10%) following the capex guidance raise. CNBC (n.8); Yahoo Finance, April 30, 2026.

[10] Non-cancelable contractual commitments of $237.67 billion as of March 31, 2026, described in the 10-Q (Note 8, Commitments and Contingencies) as “mostly related to third-party cloud capacity arrangements and continued investments in servers and network infrastructure, data centers, and consumer hardware products in Reality Labs,” with ~$42.25B due in 2026 and ~$47.65B in 2027; cash, cash equivalents and marketable securities of $81.18 billion. Meta Q1 2026 10-Q, SEC.

[11] Operating and finance leases not yet commenced of approximately $182.88 billion as of March 31, 2026, “consisting of data centers, colocations, and certain network infrastructure,” commencing between the remainder of 2026 and 2036, with terms from greater than one year to 30 years. Meta Q1 2026 10-Q, Note 8, SEC.

[12] Susan Li, Meta Q1 2026 earnings call, April 29, 2026: “These multiyear cloud deals and our infrastructure purchase agreements drove a $107 billion step up in our contractual commitments this quarter.” Meta Q1 2026 earnings call transcript.

[13] Hyperion: Richland Parish, Louisiana; ~5 gigawatts; completion expected 2029. Blue Owl / Meta joint-venture announcement and coverage. PE Insights, “Blue Owl and Meta close record $30bn financing,” 2025.

[14] Susan Li, Meta Q1 2026 earnings call, April 29, 2026: the infrastructure investments “will support our training needs for future models and, most importantly, provide us the inference capacity necessary to deliver personal and business agents to billions of people.” Transcript (n.12).

[15] Meta Platforms, “Meta Announces Joint Venture with Funds Managed by Blue Owl Capital to Develop Hyperion Data Center,” October 2025 (Blue Owl 80% / Meta 20%; ~$27B debt to PIMCO and other investors plus ~$2.5B equity; largest private-credit data-center deal on record).

[16] Bonds issued by the Beignet vehicle were rated A+ by S&P (single agency, reflecting Meta’s backing), priced at a 6.58% yield (~225 bps over Treasuries), fully amortizing, maturing 2049. Yahoo Finance / WSJ, “Meta’s $27 billion bet,” October 31, 2025; PE Insights (n.13).

[17] “Sopaipilla”: ~$13 billion SPV for a gigawatt-scale data center in El Paso, Texas, expected online 2028; Morgan Stanley and JPMorgan leading and, unlike the PIMCO-anchored Hyperion deal, may offer the debt to capital-markets investors rather than place it with an anchor. Bloomberg, via “Meta Taps Morgan Stanley, JPMorgan for New Data Center Deal,” Advisor Perspectives, May 5, 2026.

[18] On the inadequacy of the thin equity cushion in these data-center SPVs — typically on the order of 10% equity against a debt-heavy structure — see Paul Kedrosky, “SPVs, Credit, and AI Datacenters,” June 2025. Reported Meta vehicles, including a triple-net leaseback arrangement involving Apollo, have been described at roughly 90% debt / 10% equity (Covenant Lite, “Meta’s $29 Billion Bet with Apollo,” July 2025); whether that arrangement is distinct from the Blue Owl–led Hyperion financing or an earlier account of the same raise is not independently confirmed, and the body does not treat it as a separate vehicle.

[19] Carrying amount of long-term debt (fixed-rate senior unsecured notes) of $58.75 billion as of March 31, 2026. Meta Q1 2026 10-Q, Note 7, SEC.

[20] Meta structured the Hyperion leases in four-year increments so rating agencies would not treat them as debt. The Information, “The Creative Dealmaking Behind Meta’s $30 Billion Data Center Financing,” reported via Michael Parekh, November 2025.

[21] Meta Platforms Form 8-K, FY2024 results: “In January 2025, we completed an assessment of the useful lives of certain servers and network assets, which resulted in an increase in their estimated useful life to 5.5 years, effective beginning fiscal year 2025... we expect this change in accounting estimate will reduce our full year 2025 depreciation expense by approximately $2.9 billion.” SEC.

[22] Amazon shortened the useful life of a subset of its servers and networking equipment to five years in early 2025, citing the rapid pace of AI and machine-learning innovation — the opposite direction to Meta. DeepQuarry, “Depreciation of GPUs: between useful lives and useful myths,” December 2025.

[23] Michael Burry’s late-2025 argument that hyperscalers understate depreciation by using five-to-six-year lives for hardware with a real economic life closer to two-to-three years, estimated at ~$176 billion of understated depreciation industry-wide across 2026–2028; Nvidia publicly rebutted. WSJ, “The Accounting Uproar Over How Fast an AI Chip Depreciates,” December 8, 2025; CNBC, November 25, 2025. (Burry comparison to Cisco circa 2000, not Enron.)

[24] Paraphrased from the financing analyst quoted on the Hyperion structure: Meta must build the facility, place workloads in it, and presume future monetization of those workloads. WSJ via Yahoo Finance, October 31, 2025 (n.16).

[25] Yann LeCun, interview with Melissa Heikkilä, Financial Times, published January 2, 2026: Llama 4 benchmark “results were fudged a little bit,” the team “used different models for different benchmarks to give better results,” and Zuckerberg “lost confidence in everyone who was involved” and “sidelined the entire GenAI organisation.” FT (subscription); reproduction: Fast Company, “Yann LeCun: Meta ‘fudged’ on Llama 4 testing,” January 2026.

[26] Eleven of the fourteen researchers who created the original Llama left Meta; LeCun departed in November 2025. Maginative, “Meta Goes All-In on ‘Superintelligence,’” June 2025; The Next Web, “Meta hires five Thinking Machines Lab founders,” April 2026.

[27] “Behemoth” (the planned ~2-trillion-parameter flagship) was repeatedly delayed on performance and not released in promised form; the GenAI organization was sidelined ahead of the Superintelligence Labs reorganization. Maginative (n.26); Wikipedia, “Meta Superintelligence Labs” (secondary, for chronology only).

[28] Muse Spark scored 52 on the Artificial Analysis Intelligence Index v4.0, fourth globally behind Gemini 3.1 Pro (57), GPT-5.4 (57), and Claude Opus 4.6 (53); Llama 4 Maverick scored 18. Artificial Analysis was given early access to benchmark independently. Artificial Analysis, “Muse Spark: everything you need to know,” April 8, 2026. Note: Meta’s own claim of 50.2% on Humanity’s Last Exam used a multi-agent “Contemplating” mode with tools; the independent single-agent figure was 39.9%. Treat vendor mode-specific claims separately. The #4 ranking reflects the index at launch (April 8, 2026); the leaderboard has since shifted as newer models posted higher scores.

[29] Artificial Analysis (given early access by Meta) scored Muse Spark 52 on its Intelligence Index v4.0, 4th at launch. On GDPval-AA — Artificial Analysis’s evaluation of economically valuable, real-world office tasks — Muse Spark scored roughly 1,427 Elo (Meta’s own reported figure was 1,444), behind GPT-5.4 (~1,672) and Anthropic’s Claude Opus 4.6 (~1,606) and Sonnet 4.6 (~1,648), though ahead of Gemini 3.1 Pro Preview (1,320); it likewise trailed the leaders on Terminal-Bench Hard. Meta flagged long-horizon agentic systems and coding workflows as areas of continued investment. Most non-composite Muse Spark figures are Meta-reported: because the model is closed (no open weights; Meta AI app and a private API preview only), independent evaluators such as Vals.ai and BenchLM had not posted independent scores as of late May 2026. Artificial Analysis, “Muse Spark: everything you need to know,” April 8, 2026; VentureBeat, April 8, 2026.

[30] Muse Spark launched closed-weight, distributed free through the Meta AI app/website and rolling out as the default assistant across Meta’s platforms and Ray-Ban glasses, with no first-party public API at launch (Artificial Analysis benchmarked it via early access; Bloomberg reported the design and code would not be made public). Artificial Analysis (n.28); aitoolbriefing, “Meta’s Muse Spark Drops — And It’s Closed Source,” April 9, 2026. API availability may change; the claim is specific to launch.

[31] Meta is testing Meta AI subscriptions at $7.99 and $19.99 per month. Intellectia, “Zuckerberg: Meta May Enter Cloud Computing Market,” May 2026.

[32] Meta Q1 2026: revenue $56.31B (+33% YoY, fastest since 2021); ad impressions +19%, price per ad +12%; income from operations $22.87B; free cash flow $12.4B. Net income $26.77B included an $8.03B tax benefit (underlying EPS $7.31). Meta Q1 2026 release / 10-Q (n.6, n.8, n.10); CoinDCX earnings recap, April 2026.

[33] Meta’s full-year free cash flow was $43.59 billion in 2025 (Meta Q4/FY2025 release, SEC 8-K). Sell-side projections for 2026 fall sharply as capex roughly doubles — one widely cited Street estimate has full-year free cash flow dropping toward the high single-digit billions (IND Money, citing Street estimates) — and several analysts now model free cash flow turning negative across the AI-infrastructure cohort in 2026–2028; Barclays specifically projected a roughly 90% decline in Meta’s 2026 free cash flow after the raised guidance. CNBC, “Tech AI spending approaches $700 billion in 2026, cash taking big hit,” February 6, 2026.

[34] Meta named Dina Powell McCormick president and vice chairman, announced January 12, 2026; she spent 16 years at Goldman Sachs, where she led its Global Sovereign Investment Banking business, served as deputy national security adviser in the first Trump administration, and most recently was president at BDT & MSD Partners. She had been a Meta board member from April to December 2025. Axios, “Meta taps Dina Powell McCormick as president and vice chairman,” January 12, 2026; Advisor Perspectives, January 12, 2026.

[35] Zuckerberg said Powell McCormick would focus “on partnering with governments and sovereigns to build, deploy, invest in, and finance Meta’s AI and infrastructure”; Meta added that she would “drive an effort to build new strategic capital partnerships and find innovative ways to expand our long-term investment capacity.” Axios (n.34); AGBI, “Meta hires former Trump adviser to focus on Middle East deals,” January 16, 2026.

[36] Microsoft, OpenAI, and Amazon have made AI-infrastructure investment deals with Gulf-based sovereign-wealth funds, many focused on building data centers in the US and the Gulf. AGBI (n.35).

[37] In the late-1990s telecom buildout, the large majority of fiber laid sat dark for years and bandwidth prices collapsed; the surplus later became the backbone of Web 2.0, benefiting those who acquired it cheaply rather than those who financed it. “The AI Infrastructure Bubble,” Development Corporate, November 2025.

[38] AI-infrastructure debt is reaching retail retirement accounts through target-date and core bond funds; the bull case “requires demand, power delivery, and refinancing to line up on the same timeline.” Seeking Alpha, “Your 401(k) Is Funding AI’s Data Center Buildout,” May 14, 2026.

[39] Google Cloud sells external access to its Tensor Processing Units (TPUs) — the custom silicon that also trains Gemini and serves Google’s own products to over a billion users — through Compute Engine, Google Kubernetes Engine, and the Vertex AI / Gemini Enterprise Agent Platform, and offers Gemini models commercially on the same platform. “Tensor Processing Units (TPUs),” Google Cloud product page, accessed May 2026.

[40] On April 29–30, 2026, Alphabet and Meta both beat first-quarter estimates and both raised capital-expenditure guidance, yet Alphabet’s stock rose roughly 7% while Meta’s fell roughly 7% — a divergence widely attributed to Alphabet (like Amazon and Microsoft) operating a cloud business that converts AI investment into external revenue, which Meta lacks. CNBC, “Investors still trust Google more than Meta when it comes to spending their money on AI,” April 30, 2026.

[41] Modeling by Panmure Liberum, cited by the Financial Times, finds that most major US hyperscalers — Microsoft, Alphabet, Meta, and Oracle — show negative implied returns on AI investment over 2025–2030, even under the generous assumption that building and running the AI systems costs effectively nothing; only Amazon is positive, at roughly 7.2%, reflecting its more mature external cloud monetization. One published account put Meta’s implied figure near −29%. This is forward-looking modeling, not realized return. IBTimes UK, “Big Tech’s AI Gamble Shows Negative Returns Despite Surge in Spending,” May 30, 2026; figure for Meta via Sherwood/Yahoo Finance coverage of the same FT analysis.

[42] Prometheus, a ~1-gigawatt data center, is scheduled to come online in 2026. Trending Topics, “Meta’s Comeback: Muse Spark,” April 12, 2026.

[43] Meta’s (and Alphabet’s) five-year CDS did not begin trading until November 2025; before that these companies funded AI expansion from their balance sheets rather than debt markets, so there was little single-name CDS interest. Mellon Investments, “Record-Breaking AI-Related Debt Issuance in 2025,” December 15, 2025 (Bloomberg data).

[44] Oracle’s five-year CDS has sat near 200 basis points since spring 2026 — its highest since the 2008–09 financial crisis and roughly quadrupled from its mid-2025 level (≈198 bps reported late March–April 2026). BondbloX, “Oracle’s 5Y CDS Spread Hits All-Time Highs,” March 31, 2026; The Motley Fool / Yahoo Finance, April 10–11, 2026. A specific basis-point level for Meta’s own CDS is not independently confirmed here and is deliberately not stated.

[45] JPMorgan launched a hyperscaler CDS basket (Alphabet, Amazon, Meta, Microsoft, Oracle) in March 2026, in $25M blocks with $5M per name; the five issued $121B in bonds in 2025 (vs. a $28B annual average 2020–2024), with total commitments of $969B and $662B in data-center leases yet to commence. Winbuzzer, “JPMorgan Launches CDS Basket to Hedge AI Debt Risk,” March 24, 2026 (citing Fortune).

Two Chips, One Decade, One Winner

Julien Simon — Wed, 27 May 2026 09:14:34 GMT

On May 19, 2026, Sundar Pichai stood on the Google I/O stage and made a claim that would have been science fiction when Google first ran its own AI chip in its data centers a decade ago. Gemini, he said, was now trained across more than a million of Google’s own Tensor Processing Units, distributed across data centers on multiple continents, stitched into a single logical cluster, with no Nvidia hardware anywhere in the loop. The chip that began life as an internal cost-saving project, a way to keep Google’s own search and translation workloads off other people’s silicon, was now training and serving one of the world’s frontier models end to end.

The same week, Amazon was telling a different story about its own decade-old silicon bet, though it was dressed in the same language. Trainium, Amazon’s custom AI chip, had “momentum.” Two of the largest AI labs in the world had committed to it. Andy Jassy told CNBC that “the two largest AI labs are both significantly betting on Trainium.” On paper, the two companies were making the same boast: our AI runs on our chips, not Nvidia’s.

Only one of those statements is load-bearing. The question that separates them is not who built a chip. Amazon and Google both did, starting at almost the same moment a decade ago. It is whose chip pulls its own demand, and whose chip has to have its demand bought for it.

The same bet

The two programs are almost exactly the same age. Google built its first TPU in 2015 to run its own neural networks more cheaply than it could on bought GPUs. That same year, Amazon bought Annapurna Labs, the Israeli design house behind its custom silicon: the Nitro networking chips first, then the Inferentia and Trainium AI chips that followed by the end of the decade. Both companies were chasing the same prize, and it is worth being precise about what that prize is.

Nvidia’s gross margin on AI hardware runs around 75 percent.[1] Every GPU-hour a hyperscaler sells carries that margin, paid to Nvidia. At the scale of AWS or Google Cloud, the arithmetic is brutal: the more AI compute you sell, the more of your customers’ money flows straight through your data centers and out to Santa Clara. Building your own chip keeps that margin instead of passing it along. The logic is identical for both companies.

What differs is what each company had to put on the other side of the equation, and that difference is the whole story. A custom chip is worthless without a workload to run. Silicon matures through use: each generation exposes the bottlenecks that the next generation fixes. The question for any custom-silicon program is: where does that workload come from? Google had an answer that Amazon did not.

Google made it

Google’s answer was Gemini. Because Google builds its own frontier model, it has a workload deep enough, demanding enough, and large enough to pull its silicon up the maturity curve generation after generation. The TPU did not have to win customers in a bake-off. It had to serve Google, and Google made sure each chip generation was shaped by what training and serving Gemini actually required.

The result, several generations in, is a chip line that has split to match the work. Google’s newest TPUs come in two variants: a training-optimized part and an inference-optimized part.[2] The training part is built for compute-bound pretraining, where Google claims roughly three times the per-pod performance of the prior generation, scaling near-linearly toward a million-chip logical cluster. The inference part handles the opposite problem: the memory-bound work of generating tokens one at a time, with 288 gigabytes of high-bandwidth memory and a large on-chip cache, tuned for the latency-sensitive serving that agentic workloads demand. This is the disaggregation of the inference problem into purpose-built hardware, and Google does it inside one chip family, on its own silicon.

The clearest evidence that the bet worked is in the pricing. When Google released Gemini 3.5 Flash in May 2026, it priced the model at $1.50 per million input tokens and $9.00 per million output tokens.[3] That was a threefold increase over the previous Flash generation. The model still undercut comparable frontier models on cost while claiming output speeds several times theirs. A company can only price like that if it owns its cost base. Google is not paying Nvidia’s margin on the tokens Flash generates; it is paying its own fabrication and power costs and amortizing its own chips. The price is proof that the silicon escape succeeded: Google has a cost floor that its GPU-dependent competitors cannot match, and it is beginning to use it as a weapon.

None of this required Google to buy a single customer; the demand was already inside the building. “Made it” means made it for Google’s own purposes — escaping Nvidia’s margin on a cost structure Google controls. Whether the TPU ever becomes a chip that other companies rent in volume is a separate and real question. But escaping the margin was the best, and Google has the receipts.

None of which makes Google’s books innocent. Alphabet booked an even larger markup last quarter than Amazon did: some $36.9 billion in gains on its own private-company stakes, including Anthropic and SpaceX, flattering net income the same way. The accounting game is industry-wide. But it is a separate game from the silicon. The difference is dependence: Google’s markup is gravy on a chip that already works and a cloud business growing fast, whereas Amazon, as the next section shows, leans on its markup to carry a quarter that the operating business did not.

Amazon had to anchor it

Amazon’s problem is that it has no Gemini. It acquired the silicon. Annapurna gave it the design talent, and Trainium is a real chip whose third generation is a genuine step up. What it could not acquire was a workload to pull that chip forward. So Amazon had to buy one. And the way it bought it gives the game away.

Consider the two labs Jassy points to. The first is Anthropic. Amazon has put roughly $8 billion into Anthropic since 2023, and in April 2026 committed up to $25 billion more. In return, Anthropic trains Claude on Trainium and has agreed to consume up to five gigawatts of the chips, housed partly in an $11 billion data center campus Amazon built for it in Indiana.[4] The second is OpenAI. In February 2026, Amazon committed up to $50 billion to OpenAI: $15 billion upfront in preferred stock, $35 billion more contingent on OpenAI completing an IPO or hitting undefined milestones. As part of the same deal, OpenAI agreed to consume 2 gigawatts of Trainium capacity, and AWS became the exclusive third-party cloud distributor for OpenAI’s enterprise platform, Frontier.[5]

Look at what each lab actually received in exchange for betting on the chip. Anthropic’s commitment sits on top of Amazon’s equity. OpenAI’s commitment came bundled with up to $50 billion and exclusive distribution rights to enterprise customers it could not otherwise reach through AWS. Neither commitment is a price-performance verdict on Trainium. Trainium has smaller customers who took no equity. The claim here is narrower and harder to wave away: its flagship, frontier-scale demand, the demand Jassy cites as validation, was bought. The two anchor commitments are the consideration in much larger strategic deals, and in OpenAI’s case, the connection is not interpretive. Amazon’s own regulatory filing states that the equity investment and the cloud partnership are contractually linked: if the collaboration agreement terminates, the $35 billion equity commitment dies with it.[6] The money and the chip commitment are bound together in the contract.

The strongest evidence that this is procurement rather than merit is what these same labs do when money is not attached. OpenAI runs an aggressively multi-cloud strategy: it has a custom-ASIC deal with Broadcom, buys Nvidia GPUs through multiple clouds, and has committed to AMD. In the same round that included Amazon’s $50 billion investment, OpenAI committed to 5 gigawatts of Nvidia’s next-generation systems, more than twice its Trainium commitment.[7] When OpenAI allocates compute on the merits, it goes substantially to Nvidia. The Trainium slice is the one with Amazon’s equity stapled to it.

And salvage it

Anchoring the chip to bought demand is the first move. The second is admitting it cannot finish the job alone. In March 2026, AWS announced a partnership with Cerebras to deliver fast inference through its Bedrock platform. The architecture is revealing. Trainium handles “prefill”: reading and digesting the prompt, the fast-parallel part. Cerebras’s wafer-scale chips handle “decode”: writing the answer back one token at a time, the slow sequential part that determines how fast a response feels. There, Cerebras claims an order-of-magnitude speed advantage over conventional hardware.[8] One industry analyst put the implication plainly: by splitting inference across two companies’ chips, “AWS is betting that no single chip architecture can win alone.” That is a precise description of an admission. Amazon went outside its own silicon for the half of inference that matters most for the agentic, token-hungry workloads everyone is racing toward. Google does not hand the decode stage to a third party’s silicon; it builds its own inference chip.

The third move is in the financials, and it is the one that turns the argument into evidence. If Trainium were winning on merit, Amazon’s chip strategy would show up as cash: customers paying for compute, margin retained instead of forwarded to Nvidia. Instead, the most important number Amazon’s chip strategy produced last quarter was an accounting entry.

In the first quarter of 2026, Amazon reported net income of $30.3 billion, up 77 percent year over year, a headline blowout. But $16.8 billion of the pre-tax income behind it was a non-cash, non-operating gain: the markup on Amazon’s Anthropic stake, triggered when Anthropic’s latest funding round reset the valuation, and Amazon revalued its holding.[9] After tax, that single mark-to-market entry was larger than Amazon’s entire year-over-year increase in net income: the company’s headline profit growth was, in effect, the markup. Strip it out and roughly $23 billion of pre-tax income remains, up from the prior year but unspectacular. The gain itself cost nothing and produced nothing. Under the accounting rule that governs it, the gain reverses only if a future Anthropic transaction reprices the stake downward, or the holding is impaired — not a number Amazon can spend, and one that can run backward as easily as forward.[13]

Now set that against the cash. Over the trailing twelve months, Amazon’s free cash flow fell to $1.2 billion, down 95 percent from $25.9 billion a year earlier. Net capital expenditure over the same period climbed to roughly $147 billion, the overwhelming majority of it AI infrastructure.[10] The company posting record AI-era profit is generating almost no free cash, and the profit growth that made the headline is a revaluation of a startup Amazon itself funds and supplies.

The cash collapse, to Amazon’s credit, is largely a choice, not distress. Free cash flow fell because Amazon elected to spend roughly $147 billion on AI infrastructure, and the operating business underneath is healthy: AWS grew 28 percent year over year, its fastest in several quarters, and segment operating income rose. A company can spend its cash flow into the ground on purpose and be sound. So the depressed cash is not, by itself, the indictment. The indictment is narrower: the profit growth the market celebrated came from none of that operating strength. It came from marking up a private stake. Strip the Anthropic gain and the quarter was solid and unspectacular. The blowout was an accounting event.

This is the circuit that holds the salvage together. Amazon invests equity in Anthropic; Anthropic commits to spend on Trainium and AWS. That spending returns as AWS revenue and as evidence of Trainium “traction”. Anthropic raises its next round at a higher valuation. Amazon marks up its stake and books the gain as profit. Each loop raises the mark. The chip’s flagship demand and the quarter’s profit growth trace to the same root: the roughly $8 billion Amazon had invested in Anthropic by the time the stake was marked. The capital goes out as investment, returns as Trainium and AWS revenue, and then appreciates. The appreciation on that stake, not any cash Anthropic paid out, is what carried net income to a record. The money does double duty: once as evidence of silicon momentum, once as reported profit. It is an unusual position — being able to influence the marked value of your own largest asset by doing business with it.[11]

Why one made it and the other didn’t

The difference between the two companies is not intelligence or execution. It comes down to a single asset that cannot be faked: a captive frontier workload. Google’s TPU is pulled up the maturity curve by a model that uses it on the merits, paid for by Google’s own economics, answerable to no outside buyer — captive-demand pull. Amazon’s Trainium is pushed forward by tenants it bought with equity and distribution — procured-demand push. The two can look identical in a press release (”the largest labs run on our chip”), but one is a workload choosing the best tool it has, and the other is a tool that had to purchase its workload.

This is why the same test that condemns Trainium clears the TPU. Both chips are attractive partly because Nvidia is scarce and expensive, but that is not the distinction. The distinction is the counterfactual. Strip away the scarcity premium, and Trainium loses its rationale. Its demand was assembled to fit the shortage: anchor tenants routed to it by equity, overflow capacity so tight that Jassy says Amazon is considering selling racks directly.[12] Strip the same premium from the TPU, and Google still has a frontier model running on it every day, for reasons that have nothing to do with GPU availability. Captive demand survives the counterfactual. Procured demand does not.

The mirror

This is the inverse of a pattern this newsletter has traced before. In “Compute Equals Commitments,” the dynamic was a chipmaker funding its own customer’s purchases — round-trip revenue dressed as demand. Here the same financial structure appears one layer up: a cloud provider funding the labs that validate its chip, and booking the resulting equity markup as profit. The round trip is the same; only the layer has moved. The Annapurna bet was the right instinct — Amazon was early to see that owning the silicon mattered. It was just never able to feed the chip the way Google feeds the TPU.

What would have to break

The honest case against this verdict rests mostly on OpenAI, and it deserves a fair hearing. OpenAI is not a captive Amazon subsidiary; it is a genuinely multi-cloud lab that could have said no. Its willingness to put two gigawatts on Trainium is a real data point, and if Trainium were worthless, a company with OpenAI’s options would not have agreed to run on it at all. That is true, and the piece concedes it: the chip is not bad. But “not bad” is not “won.” OpenAI’s commitment came stapled to up to $50 billion in Amazon investment — $15 billion of it funded so far, the rest contingent on an IPO that has not happened — and to exclusive enterprise distribution. Its merit allocation, the five gigawatts of Nvidia capacity in the same round, went elsewhere, more than twice the Trainium commitment. Procured is not coerced, but neither is it chosen on the basis of price-performance.

So the verdict is falsifiable and worth stating in terms that could fail. If Trainium wins a large frontier customer that is neither funded by Amazon nor bundled with distribution it cannot get elsewhere, the salvage thesis weakens. If the Cerebras dependency ends because a future Trainium wins the decode stage outright, one of the three tells falls. If Amazon’s free cash flow recovers while the Anthropic markup stays flat — proving the operating business stands on its own — the financial tell dissolves. And if Google’s TPU never escapes its own data centers to win external cloud customers, then “made it” is too strong, and Google has merely built an excellent internal tool rather than a competitive product. Each of those is a real possibility, and each would move the verdict.

But on the evidence available now, the two-decade-old silicon bets have not converged. Google built a chip that its own frontier model pulls forward, prices its products off a cost base it owns, and needs far less outside capital to keep improving. Amazon built a chip that it must supply with purchased tenants, finish with a rented decode engine, and validate with an accounting gain while its cash disappears into the build-out. Both companies can say their AI runs on their own silicon.

Only one of them is telling you the whole sentence.

Notes

[1] NVIDIA Corp, Form 10-Q for the quarter ended April 26, 2026 (Q1 FY2027): GAAP gross margin 74.9%; full fiscal 2026 GAAP gross margin 71.1%. NVIDIA does not separately disclose a Data Center segment gross margin; with Data Center at ~92% of revenue, the consolidated figure is the best available proxy, and the “~75%” in the body refers to that consolidated GAAP margin.

[2] Google Cloud, “Ironwood is here: our eighth-generation TPU for the agentic era” (April 2026). Specifications are vendor-published and not independently benchmarked: ~3× per-pod performance and near-linear scaling toward a million-chip logical cluster (training part); 288 GB high-bandwidth memory and on-chip cache (inference part); up to 2× performance-per-watt. Treat as vendor specifications.

[3] Google, official Gemini API pricing (ai.google.dev, updated May 19, 2026): Gemini 3.5 Flash at $1.50 / million input tokens, $9.00 / million output tokens (incl. thinking tokens), $0.15 / million cached input tokens — roughly 3× the prior Flash generation’s list pricing. The “undercuts comparable frontier models / several times the speed” framing is Google’s own, presented at I/O, and the speed comparison is vendor-claimed.

[4] Amazon’s Anthropic investment: ~$8 billion in tranches from September 2023 (initial $1.25B; $4B completed March 2024; further $4B announced November 2024), initially convertible notes, partially converted to equity. On April 20, 2026, Amazon committed up to $25 billion more — $5 billion immediately (at Anthropic’s $350B valuation), up to $20 billion tied to commercial milestones. The Q1 2026 markup discussed below was on the ~$8B already invested; the additional commitment closed after quarter-end. The up-to-five-gigawatt Trainium commitment and the ~$100B AWS spend are Anthropic commitments to consume AWS/Trainium capacity. Project Rainier (Indiana; ~500,000 Trainium2 chips scaling toward 1 million, $11B site) is the dedicated buildout.

[5] OpenAI, “OpenAI and Amazon announce strategic partnership” (Feb 27, 2026): $50 billion total Amazon investment, $15 billion initial (OpenAI Series C Preferred Stock), $35 billion contingent; OpenAI to consume 2 GW of Trainium; AWS as exclusive third-party cloud distributor for OpenAI Frontier. See also Amazon Form 8-K, EX-99.1 (Feb 2026).

[6] Amazon Form 8-K (Feb 2026) and accompanying agreement disclosure: the equity investment and cloud partnership are contractually linked; the $35 billion contingent equity commitment (stated as $34,999,999,447.98) terminates if the Joint Collaboration Agreement terminates, and is contingent on conditions including an OpenAI IPO or direct listing, expiring if not invested by Dec. 31, 2028.

[7] OpenAI’s $110 billion round (Feb 27, 2026; $730 billion pre-money) included $30 billion each from NVIDIA and SoftBank alongside Amazon’s $50 billion; OpenAI’s NVIDIA commitment in connection with the round was 5 GW of Vera Rubin-generation capacity, versus 2 GW of Trainium. OpenAI’s separate Broadcom custom-ASIC and AMD commitments are company-announced.

[8] AWS and Cerebras, “AWS and Cerebras Collaboration Aims to Set a New Standard for AI Inference” (March 13, 2026): AWS Trainium optimized for prefill, Cerebras CS-3 optimized for decode, connected via Elastic Fabric Adapter, delivered exclusively through Amazon Bedrock. Cerebras’s decode speed advantage is vendor-claimed. The “no single chip architecture can win alone” reading is an analyst characterization, not AWS’s.

[9] Amazon Form 8-K, EX-99.1, Q1 2026 (quarter ended March 31, 2026): “First quarter 2026 net income includes pre-tax gains of $16.8 billion included in non-operating income from our investments in Anthropic.” Income before income taxes was $39.834 billion; $16.8B / $39.834B = 42.2%. (Some contemporaneous reporting characterized the gain as “more than half” of pre-tax income; the filing figure is ~42%.)

[10] Amazon Form 8-K, EX-99.1, Q1 2026. Headline “Free cash flow” line (TTM operating cash flow less purchases of property and equipment): $1.232 billion for the TTM ended March 31, 2026, versus $25.925 billion a year earlier. Amazon publishes alternative FCF measures that differ; the headline line is used here. TTM net property-and-equipment purchases ~$147 billion. AWS Q1 2026 revenue grew 28% YoY with segment operating income up, per the same release.

[11] On the structural point — that a company can influence the marked value of an asset through its own business dealings with the issuer — see the disclosure mechanics in Amazon’s Q1 2026 Form 8-K [9] and the ASC 321 measurement-alternative note [13]. The characterization is the author’s, drawn from the filing’s own description of the Anthropic gain.

[12] Andy Jassy, Amazon Q1 2026 earnings commentary and CNBC interview (Feb 27, 2026): Trainium demand sufficiently high that AWS was considering selling Trainium racks directly; “the two largest AI labs are both significantly betting on Trainium.” AWS reported 28% YoY revenue growth in Q1 2026, per Amazon’s Q1 2026 results.

[13] Equity stakes in private companies such as Anthropic are generally accounted for under ASC 321’s measurement alternative: carried at cost and remeasured to fair value only on an observable transaction (e.g., a new funding round) for similar securities of the same issuer, or on impairment. An up-round markup is unrealized and non-cash; it can reverse on a later down round or impairment, and becomes cash only when realized through a sale or liquidity event.

Huawei Can’t Buy EUV. It Says It Doesn’t Need To.

Julien Simon — Mon, 25 May 2026 11:16:25 GMT

On Monday in Shanghai, He Tingbo — president of Huawei’s semiconductor business and chair of its Scientist Committee — stood in front of a room at the IEEE International Symposium on Circuits and Systems and told the industry it had spent six decades optimizing for the wrong thing.[1] Her keynote, “New Semiconductor Path in Practice,” proposed retiring the principle that has organized the entire business since 1965: shrink the transistor, double the count, repeat. In its place, she offered the “Tau (τ) Scaling Law” — already nicknamed “Her’s Law” by her peers — which optimizes not for how small a transistor is but for how fast a signal moves through the chip.[2] “I used to think it may take us 10 years,” she told the room, “but six years, we are here.”[1]

The French wire that crossed my desk called it “un nouveau mode de fabrication de puces” — a new way of manufacturing chips.[3] It is not. That distinction is the entire story, and getting it wrong is how a reader ends up either over- or under-pricing what just happened.

Huawei did not announce a manufacturing breakthrough. It announced a design breakthrough, executed on the manufacturing it already has. The company is process-constrained: its chips lean on SMIC’s roughly 7-nanometre-class nodes, several generations behind the 3nm-class processes feeding Apple, Qualcomm, and AMD — and behind the 2nm that TSMC is now ramping.[4] What it claims to have built is a way to extract frontier-class transistor density from trailing-edge fabrication — by changing the layout, not the lithography.

The mechanism is called LogicFolding. In a conventional layout, logic blocks sprawl across a mostly flat plane. The limiting factor is increasingly not how fast the transistors switch, but how long it takes a signal to cross the long, resistive wires between them — a delay that caps the clock and wastes energy driving the interconnect. LogicFolding “folds” the logic — expanding the layout from one layer to two — pulling critical paths closer together, shortening the wiring, cutting propagation delay, and packing more transistors into the same footprint.[5] Huawei says the fall 2026 Kirin gains 53.5% in transistor density, to 238 million transistors per square millimeter, alongside a 40% jump in performance-core power efficiency and a 3.1GHz top clock.[6] That density figure sits, on paper, near Intel’s 18A and TSMC’s 3nm.[7] The phone is only the showcase: Huawei frames the same time-scaling logic running up through its UnifiedBus interconnect to the AI clusters, where it is trying to displace Nvidia.[8]

This is neither vaporware nor a triumph — it is a genuine engineering idea aimed at a real bottleneck. The honest steelman comes from Omdia’s semiconductor research director, He Hui, who calls it a shift from node-driven scaling to “system-level efficiency scaling”—in his view, a credible way to wring more performance out of constrained lithography.[9] Interconnect delay is genuinely the dominant frontier problem, and stacking silicon to address it is not new: HBM has stacked memory since 2015, and TSMC and Intel have stacked finished dies with SoIC and Foveros. What Huawei claims is harder and less proven — folding a single logic block’s own gates across two bonded tiers so signals take a short vertical hop instead of a long planar route. That is logic-on-logic at the cell level, the territory the whole industry has been circling for a decade, because the thermal and yield problems are brutal. The difference from the rivals’ version is that theirs still rides on leading-edge fabs Huawei cannot buy.

But density on a slide is not density at competitive yield, power, and thermals. The most important sentence published all day came from Paul Triolo of DGA Group: a stacked or folded design can produce genuine density gains, he said, but it “does not mean Huawei has solved” the yield, power, thermal, and device-performance problems of true 1.4nm-class manufacturing.[10] Counterpoint’s Neil Shah was blunter on the strategic point: this “parallel semiconductor path is still unproven at scale.”[11] And the headline number — a transistor density “equivalent to” a 1.4nm process — is not a 2026 result. It is a 2031 projection, unaccompanied by any independent performance data.[12] Stacking buys you density. It does not automatically buy you the efficiency that makes density useful at the frontier.

So strip the projection away and look at what actually shipped: a strategic reframe. The export-control regime was architected on the premise of manufacturing. Deny China extreme ultraviolet lithography — the ASML machines no Chinese firm can legally buy — cap it at 7nm, and the density frontier stays out of reach.[13] The wall is real, and it works against the thing it was built to stop. What it cannot do is stop Huawei from deciding that the frontier is no longer defined by the dimension the wall measures. If the goal is signal-propagation time rather than transistor pitch, then a control regime denominated in nanometres is policing a metric the target has stopped competing on. Even Triolo, who doubts the manufacturing claim, reads the move this way: Huawei is “turning an engineering strategy into a quasi-’law’” — shorten wires, stack logic, co-design the whole system.[10]

The reframe does not entirely escape the wall. A folded design still has to be finalized for production on EDA software, where America’s Synopsys and Cadence dominate, and fabricated on SMIC’s constrained base node. Huawei now claims home-grown design tools, but domestic EDA at the leading edge is unproven — and Washington showed in 2025 that it can switch the EDA tap off at will, before it relented.[13] The dependency is real. It simply no longer sits where the lithography rules are pointed.

The market saw the same thing even where the engineering is unproven: SMIC shares rose 7.6% on the news.[14] And the competitive backdrop sharpens it — last week, Nvidia’s Jensen Huang told CNBC his company had “largely conceded” China’s AI chip market to Huawei.[15] The Tau Law is a flag planted in the ground that Nvidia is vacating.

None of this means Huawei has closed the gap. It almost certainly has not, and the skeptics may be entirely right that folding logic across bonded tiers hits a thermal-and-yield ceiling well short of the 2031 target. But the bet is now legible, and it is falsifiable on a clock. The first checkpoint is this autumn, when the new Kirin ships and an independent teardown can confirm or puncture the density claim. The test is not whether the design works on a slide but whether Huawei can build it in volume without the chips failing — the gap, in stacked designs, where ambition usually dies.[16] The second checkpoint is 2031. If either one lands at competitive efficiency without EUV, Washington is left writing its rules in a unit that no longer measures the race.

The wall was built to keep China from making the transistors smaller. Huawei’s answer is to stop trying.

Notes

[1]: He Tingbo delivered the keynote “New Semiconductor Path in Practice” at the 2026 IEEE International Symposium on Circuits and Systems (ISCAS), Shanghai, May 25, 2026. Huawei newsroom, “HUAWEI Presents the Tau (τ) Scaling Law, Enabling Breakthroughs in Transistor Density and System Performance,” May 25, 2026, huawei.com. Vendor-primary source. The “I used to think it may take us 10 years, but six years we are here” remark, and a teaser that Huawei would “bring the surprise” before winter 2026, are reported from the keynote by BusinessToday, businesstoday.in.

[2]: The principle “proposes replacing geometric scaling with time (τ) scaling as a new guiding principle for the evolution of both semiconductors and electronic systems.” Huawei newsroom, ibid. The “Her’s Law” nickname (a play on He Tingbo’s surname and the convention of naming foundational laws after their originators, as with Moore’s Law) is reported by the South China Morning Post, “Huawei unveils new scaling law and tech that narrows gap with TSMC, Samsung,” May 25, 2026, scmp.com. τ (tau) is the time constant engineers use to describe how quickly signals propagate through a circuit.

[3]: “Huawei a développé un nouveau mode de fabrication de puces,” Boursorama (reproducing an AFP wire), May 25, 2026, boursorama.com. The “fabrication” framing is the error this piece corrects.

[4]: On the process gap: “analysts say China remains behind global leaders in the most advanced process technology,” with Huawei’s chips produced on SMIC’s 7nm-class node versus TSMC’s 2nm. Reuters, “China’s Huawei reveals chip design breakthrough amid US sanctions,” May 25, 2026, reuters via rappler.com. The Kirin 9030 (Mate 80 Pro Max) was built by SMIC on an “N+3” process, a scaled evolution of its 7nm node and still behind TSMC and Samsung, per a TechInsights teardown reported by the South China Morning Post, scmp via tech.yahoo.com.

[5]: LogicFolding “would shorten wiring inside chips and considerably improve performance”; Reuters, op. cit. (rappler.com). Huawei’s own description: the architecture “can be used to continuously compress signal propagation delay and steadily improve transistor density.” Huawei newsroom, op. cit. CNBC reported that “Huawei’s new chip architecture expands the layout from one layer to two,” per He Tingbo. CNBC, “Huawei plans new smartphone chips this fall,” May 25, 2026, cnbc.com.

[6]: Per-metric figures (vs. a conventional SoC): +53.5% transistor density to 238 MTr/mm², +40% P-core power efficiency, and +12.7% max clock frequency to 3.1GHz. These figures appear in He Tingbo’s ISCAS presentation slides as relayed by trade press; they are not stated in Huawei’s official press release, which carries only the τ Law framework, the “381 chips” and “Fall 2026 Kirin” claims, and the 2031 target (Huawei newsroom, op. cit.). Slide figures via FoneArena, “HUAWEI presents Tau (τ) Scaling Law,” May 25, 2026, fonearena.com, and Huawei Central, huaweicentral.com. Vendor-claimed presentation data; no independent verification as of publication.

[7]: The 238 MTr/mm² figure has been described as roughly comparable to Intel’s 18A and TSMC’s 3nm-class density. The comparison is density-only and does not establish equivalent power, yield, or performance; nor is it specified whether the figure is logic-only or SRAM-inclusive, which materially affects any cross-foundry comparison. Treat as vendor-claimed slide data pending an independent teardown of the shipping Kirin.

[8]: Huawei describes the τ Scaling Law operating “at the system level” by “redefining interconnect protocols for computing systems with UnifiedBus to achieve unified memory addressing and native memory semantics for SuperPoDs,” reducing system communication latency. Huawei newsroom, op. cit. (huawei.com). This situates the phone-level LogicFolding claim within Huawei’s broader AI-cluster ambition.

[9]: He Hui, director of semiconductor research at Omdia, quoted in Reuters via Rappler, “China’s Huawei reveals chip design breakthrough amid US sanctions,” May 25, 2026, rappler.com.

[10]: Paul Triolo, head of technology, Asia and Americas, DGA Group, quoted in CNBC, “Huawei plans new smartphone chips this fall as rivalry with Nvidia and Apple heats up,” May 25, 2026, cnbc.com. Full quotes: “A stacked/folded design can produce effective density gains, but it does not mean Huawei has solved the full process, yield, power, thermal, and device-performance problems associated with true 1.4 nm-class manufacturing”; and separately, “Huawei is turning an engineering strategy into a quasi-’law,’” which Triolo characterized as “more a systems-level optimization doctrine: shorten wires, stack logic, improve memory semantics, and co-design chips, packages, software, and clusters.”

[11]: Neil Shah, vice president of research, Counterpoint Research, quoted in CNBC, ibid.

[12]: “By 2031, the high-end chips HUAWEI designs based on the τ Scaling Law are expected to feature a transistor density that is equivalent to 14 Å (1.4 nm) processes.” Huawei newsroom, op. cit. “Although Huawei did not provide independent performance data, the target is significant because 1.4 nm is expected to be close to the global frontier for advanced chipmaking around the end of the decade.” Reuters via Investing.com, investing.com.

[13]: China “is widely seen as unlikely to reach that level through conventional manufacturing alone because Washington has restricted its access to advanced lithography tools and other key semiconductor technologies.” Reuters, op. cit. ASML has never shipped an EUV machine to China, and there is no credible domestic alternative — the binding reason SMIC sits several generations behind TSMC and Samsung; see TheNextWeb, “Huawei unveils ‘Tau Scaling Law’ as China’s workaround,” May 25, 2026, thenextweb.com. On the EDA dependency as a demonstrated lever: US BIS ordered Synopsys, Cadence, and Siemens EDA to halt China sales in late May 2025, then rescinded the restriction on July 2, 2025; the three firms hold roughly 80% of China’s EDA market. US Commerce/BIS via Network World, July 3, 2025, networkworld.com; EE Times, eetimes.com. The episode established that the tool dependency is a switch that can be activated, even though it is not active as of publication. At the keynote He Tingbo claimed Huawei had spent six years building domestic capabilities “including electronic design automation (EDA) tools and chip design methodologies”; BusinessToday, May 25, 2026, businesstoday.in. Domestic EDA at leading-edge nodes remains commercially unproven.

[14]: SMIC shares rose 7.6% on Monday following the LogicFolding announcement. South China Morning Post, op. cit.; Reuters via Rappler, op. cit.

[15]: Jensen Huang told CNBC the company had “largely conceded” China’s AI chip market to Huawei. CNBC, op. cit.; corroborated in Modern Diplomacy, moderndiplomacy.eu.

[16]: LogicFolding is described as cell-level “folding” — distributing a single logic block’s gates across two vertically bonded wafer tiers connected by hybrid bonding — rather than the die-to-die stacking used by HBM or by TSMC’s SoIC and Intel’s Foveros. One technical reconstruction puts the Kirin 2026 hybrid-bonding pitch at ~1.5µm (versus TSMC SoIC at <15µm and Intel Foveros at ~25µm TSV pitch), with density scaling roughly as the square of interconnect pitch; the same analysis back-calculates the density gain as 155→238 MTr/mm². These are independent analyst figures, not Huawei-published data: GlobalSemiResearch, “Huawei’s Tau Scaling Law: A Technical Deep Dive,” May 25, 2026, globalsemiresearch.substack.com; pitch comparisons via SemiAnalysis, semianalysis.com. The thermal and yield penalties of logic-on-logic stacking are long-documented; see Semiconductor Engineering, “Stacking Logic On Logic.”

Lobby, Levy, Legislate

Julien Simon — Fri, 22 May 2026 06:51:24 GMT

On May 12, in a near-empty hearing room of the French National Assembly, Arthur Mensch did something that tells you everything about how Mistral actually competes.

He didn’t talk about his models. He warned the deputies about someone else’s — and the deputies, mostly, hadn’t bothered to come. He delivered his warning about the fate of European civilization to a scattering of empty benches for ninety minutes, with the cameras running. [1]

The warning itself: Anthropic — the American lab that sits ahead of Mistral at the frontier, and whose restricted-access Claude Mythos Preview model can autonomously hunt down and exploit software vulnerabilities [2] — had been circling the French defense establishment, offering to scan the army’s code bases. Mensch’s counsel to the Republic was to keep them out. Letting a foreign model that deep into French defense, he argued, would create a dependency that is “hard to unwind.” [3]

There’s a real security argument buried in there, and Mensch made the fair version of it himself: you might reasonably not want any vulnerability-hunting model — foreign or domestic — crawling through your defense code, and he conceded in the same breath that Mistral’s own models or Chinese ones could find the same flaws. [3] But notice how neatly the sovereignty case lands on the one outcome that also protects Mistral’s existing contract with the French armed forces — and that it asks France to turn away the strongest defensive tool on the market during the worst run of data breaches in its history, a point we’ll come back to.

That convenience — a principled-sounding argument that happens, every time, to favor Mistral — is the thread running through everything Mensch has said and done this spring.

Once you pull it, the whole confusing season snaps into focus.

The eight days that looked like hypocrisy

Here is the sequence in which French Twitter called Mensch a hypocrite.

May 7. Brussels agrees to the “Digital Omnibus,” delaying the AI Act’s high-risk obligations by sixteen months — from August 2026 to December 2027. Mistral is among the industry voices that lobbied for the slowdown. [4]

May 12. Five days after winning that delay, Mensch sits before the National Assembly and warns that Europe over-regulates, that it has “heavy regulation and a fragmented market,” that the stack of GDPR, copyright rules, and the AI Act is an “empilement“ — a pile-up. [5] Europe, he says, has two years to build its own AI infrastructure or become America’s “vassal state.” [6]

May 15. It’s the front page of the Journal du Dimanche.

A man lobbies to weaken a regulation, wins, and then days later complains that Europe is over-regulated — and lands the front page doing it. The hypocrisy reading writes itself. It’s also wrong, or at least lazy. Mensch isn’t confused. He’s running a press strategy with a clock on it, and the clock matters more than the contradiction.

And the empty room is the proof. If your goal is to persuade legislators, you care whether legislators are in the seats. If your goal is the front page, the seats are set dressing, and the camera is the audience. Mensch wasn’t talking to the handful of deputies who showed up. He was talking, through them, to the JDD, to Bercy — the Finance Ministry — to the Élysée, and to every procurement officer in France who would read about it three days later.

The benches were empty because, on some level, everyone involved understood the room wasn’t the point.

Then there’s the third move people keep filing separately. In March, in the Financial Times, Mensch proposed a 1–1.5% levy on the European revenues of all AI providers — including the American and Chinese ones — to fund a European cultural pot. A tax on AI, from the CEO of an AI company. [7] And back in September, asked about France’s proposed Zucman wealth tax, he offered warm words — “at the risk of disappointing the polemicists, I’m rather convinced we need more fiscal justice in France” — while making clear in the same sentence that he could not and would not pay it himself. [8]

Pro-tax, anti-tax, pro-rules, anti-rules. It looks incoherent. It isn’t. Every one of these positions serves the same end.

Mistral isn’t winning on capability

Start with the thing the sovereignty conversation is engineered to make you forget: Mistral does not make the best models, and its own strategy quietly concedes the point.

This isn’t a knock on the engineering. Mistral’s latest flagship, Medium 3.5, is a genuinely strong model: a dense 128-billion-parameter system that posts 77.6% on SWE-Bench Verified, a respected coding benchmark, and undercuts the closed frontier models on price by roughly half. [9] Mistral does benchmark it against the frontier leaders — and that comparison is exactly the tell. On SWE-Bench, it lands about two points behind Claude Sonnet 4.6 (77.6% versus 79.6%); the pitch is not “we win,” it’s “we come close and cost less, and you can run the weights yourself.” [10] That is a deliberate, coherent position — near-frontier at a fraction of the price — and it is, by design, a second-place pitch. The pace-setting systems on reasoning and agents remain American.

Which is exactly the point. If your strategy is to be the affordable, open, sovereign alternative rather than the best model in the world — and the capex math says it has to be; Mistral’s roughly $400M in annual recurring revenue, with €1bn targeted for the year [11] sits against OpenAI’s $20bn-plus annualized run-rate as of late 2025 [12] — then “best model” can never be your moat. You need a different one.

So what is the moat?

The fashionable answer is “sovereignty.” And there’s a real version of that argument. Europe genuinely should worry about routing every critical digital service through American infrastructure governed by American law. The CLOUD Act is real. Dependency is real. Mensch is not wrong that a continent with no domestic frontier lab has no leverage.

But watch what happens when you ask which kind of sovereignty actually protects Mistral, and you find the real answer is none of the ones he names.

It isn’t technical sovereignty — data-stays-in-Europe. Microsoft, AWS, and OpenAI are all racing to offer EU data residency. That checkbox is being commoditized by the quarter.

It isn’t legal sovereignty — open weights you can self-host. Llama and Qwen are open too. A French integrator could run them on a French cloud under French law and undercut Mistral on price tomorrow.

It isn’t corporate sovereignty, either — the cleanest version of Mensch’s own case. He told the Assembly that US investors hold less than 30% of Mistral and that the founders keep strategic control, aiming for a European listing. [13] That’s true, and it does distinguish Mistral from a Microsoft-funded OpenAI. But a European cap table is a governance fact, not a competitive one. It tells you who controls the company; it tells you nothing about why a customer would choose the product over a cheaper, equally European-hosted open model. Ownership is a moat against acquisition, not against competition.

The moat that’s left, when you subtract the ones that don’t hold, is the customer list. And the customer list gives the game away.

Mistral was born inside the Rolodex

Before we read that list, rewind to where it came from. It’s tempting to picture three research “kids” who somehow assembled a blue-chip roster of backers from scratch. That gets them backward. Guillaume Lample and Timothée Lacroix were core authors of Meta’s LLaMA; Arthur Mensch came from DeepMind with his name on Chinchilla and RETRO. In the frenzied weeks after ChatGPT, they were arguably the most bankable large-model team in Europe — and all three had met years earlier at École Polytechnique, the grande école that functions as the spine of the French establishment. [14]

So they didn’t pitch their way in; the network reached out and pulled them through it. The bridge was the founders of Alan, the French insurtech unicorn: Jean-Charles Samuelian-Werve and Charles Gorintin, who introduced the team around, talked Lightspeed into leading, and worked the phones to fill the round. Gorintin and Cédric O — Macron’s former minister for digital — signed on as founding advisors. [15] The result was €105M one month after incorporation, before a single product: the largest seed in European history. [16]

And look who was already in that first cheque: Bpifrance — the French state’s own investment bank — Xavier Niel, and the shipping billionaire Rodolphe Saadé, whose CMA-CGM would two years later become Mistral’s marquee customer. The customer-patron was present at the founding. So was the political bridge, represented by Cédric O, Macron’s campaign treasurer, whose story I’ve told before.

The point isn’t that any of this was secret. It’s that none of it was. Mistral didn’t earn its way to the French establishment; it was incorporated into it. Which is why the customer list reads the way it does.

There’s a darker reading here for anyone following the question of why national AI ecosystems succeed or fail. The usual diagnosis is exclusion — the most significant builders are outsiders, the system pushed away, or had to be imported. France is the inverse. Its champion was built by the consummate insiders the system produces by design — Polytechnique, DeepMind, the right dinners — and the system’s reward for producing them was to wire them straight into state procurement. The failure mode here isn’t a talent the country couldn’t keep. It’s the opposite: a talent the country captured so completely that the product never had to compete.

For now, the Rolodex is the product

When Mensch defends Mistral’s traction, he names the same flagship customers: France Travail, CMA-CGM, Stellantis, and TotalEnergies. This spring, he added the Caisse des Dépôts, the French state investment bank. [19] Read that list not as wins but as buying decisions:

France Travail is a French government agency. TotalEnergies is a French strategic asset whose CEO doesn’t sneeze without an Élysée check-in. [20] Stellantis carries the French state’s industrial legacy through its old stake in PSA, the Peugeot-Citroën group that merged into Stellantis. [21] Caisse des Dépôts is a French state-owned institution. And CMA-CGM is owned by Rodolphe Saadé — the shipping billionaire whom Macron meets on his trips to Marseille, and who assembled BFM-TV, RMC, La Provence, La Tribune, and Brut into one of the largest newsrooms in France. [22] When La Provence ran a 2024 front page the Élysée disliked, Saadé suspended its editorial director; the journalists’ union called it political pressure. [23]

The CMA-CGM deal is the one to study. In April 2025, Saadé’s group invested €100M into Mistral and signed a five-year, $110M service contract — investor and customer, same party, the same circular structure now standard at hyperscaler scale. [24] And here’s the part that should end the “Mistral is winning enterprise on merit” story for good: the same CMA-CGM had already signed a separate $150M AI deal with Google. [25] Saadé bought Mistral the press release and Google the workload.

Now hold the strongest version of the counterargument. At the same hearing, Mensch noted that 70% of Mistral’s revenue is non-French — proof, he argued, of a genuine export champion rather than a subsidized domestic pet. [26] Take the number at face value. It doesn’t touch the argument because the moat was never part of the revenue base. Wherever that 70% lives — cross-border API calls, seat licenses, partner channels — it isn’t the source of the political moat. That lives in the flagship names, the lighthouse customers Mensch puts on the slide to validate the company to the next investor and the next government. And that list, the one that does the political and fundraising work, is almost entirely the French political-industrial complex. No Volkswagen. No Siemens. No Maersk. No ING. No Telefónica. No European reference customer of consequence outside the French orbit. Mistral may sell tokens to Europe. It anchors its credibility to the French permanent state.

That’s the moat. Not sovereignty — the President’s contact list. Mistral built a product that the most important buyers were always going to choose, and those buyers are a circle who have lunch together.

The game: legislate the Rolodex before it expires

A contact list is a fragile asset. Macron is term-limited; 2027 is coming; a contact list does not survive a change of government. So the genius — and it is genius, in a cold way — of Mensch’s spring is that every policy move converts a perishable relationship into durable law. And the conversion is self-reinforcing: each rule that raises a rival’s cost buys time, and that time is spent deepening the very relationships the rule protects, which in turn supply the political capital to write the next rule. The relationship becomes the law that governs it.

Read the spring’s moves as one design, and they line up. The 1–1.5% levy on all AI providers’ European revenue raises rivals’ operating costs in Europe and falls hardest on high-revenue American companies. A public-procurement “European preference” would codify that European equity beats European hosting — turning the Rolodex itself into a rule. The foundation-model carve-out from the AI Act trims Mistral’s own compliance bill while leaving the application-layer obligations that mostly bite US deployers. The Digital Omnibus delay buys sixteen months before any of it starts charging rent. The “vassal state” rhetoric inoculates against the obvious objection — that the French state is overpaying for a domestically preferred model. And the warning to keep the army off Anthropic’s Claude Mythos defends the single highest-value contract on the list from a stronger rival. Six moves, one direction.

Sam Altman wants to be the CEO of an AI company. Arthur Mensch wants to be the CEO of an AI market — and the difference is that markets are made of rules, and rules can be written. While Altman lobbies against regulation, Mensch lobbies for the right regulation: the kind his competitors can satisfy only at a cost they can’t bear, and he doesn’t pay.

It’s the most sophisticated regulatory game in AI right now. Calling it hypocrisy misses how good it is.

And you’re the one paying for it

A strategy this elegant still sends someone an invoice. Several someones.

French taxpayers buy a domestic-preferred model so a French logo can sit on the contract. European consumers may soon pay a 1.5% levy that, as input taxes typically do, flows at least partly downstream into prices. The French army, if Mensch gets his way, runs on the home-team model rather than the strongest available tool, because the strongest is American. And every European who wants the AI Act’s high-risk protections has to wait until December 2027 for them, courtesy of a delay sold as a boost to competitiveness.

None of that is sovereignty. It’s a subsidy — routed through procurement and regulation instead of a line item, paid to one company, and narrated in the language of national dignity so that questioning it feels unpatriotic.

The bill comes due in breaches

And the steepest cost isn’t measured in euros. France is, right now, among the most cyberattacked countries in the world, and the diagnosed root cause is a “remediation gap” — institutions that keep finding vulnerabilities and keep failing to patch them in time. [27] The identity-document agency ANTS leaked up to 19 million passport and license records; [28] much of the spree was carried out by teenagers. [29] And the marquee name on Mistral’s own customer list, France Travail, is the largest data breach in French history — tens of millions of job-seekers exposed and a €5M regulator fine in January 2026. [30] Sovereignty delivered the French logo on the contract. It did not deliver security.

Which is what makes the Mythos warning go sour. A remediation gap is exactly what a frontier vulnerability-hunting model closes — and within days of the hearing, Bloomberg reported that Mistral is building its own such model for European banks shut out of Mythos. [31] So the warning isn’t “keep dangerous vulnerability-hunters out of France.” It’s “keep the American one out, while we build and sell ours.” There is a strong security case for not allowing any foreign model to crawl through the defense code. But weigh it honestly: a country hemorrhaging data because it can’t find its own holes fast enough is being counseled to refuse the best tool for finding them. Denying France the Mythos audits may be the larger sovereignty risk, and the sovereignty argument and the product roadmap turn out to be the same document.

The bet, and how we’ll know

Here’s the falsifiable part — the thing to actually watch, rather than the rhetoric to argue about.

If Mistral’s strategy is sound, the policy moat buys enough time for the product to close on the frontier and for the company to break out of the Macron orbit into real, arm’s-length European enterprise demand. So the test is simple: by the middle of 2027, does Mistral’s flagship customer list contain names that aren’t tied to the French state or Macron’s circle? A Volkswagen. A Siemens. A bank in Milan or Madrid that chose Mistral in a competitive bake-off and paid full freight.

If yes, Mensch will have pulled off one of the great industrial strategy plays of the decade — using the rulebook to buy time to build a real business.

If no — if in two years the list is still France Travail and friends — then the policy moat was never a bridge to a product. It was the product. And policy moats have a half-life measured in election cycles. A non-Macroniste Élysée could simply stop steering the contracts. Or Brussels could decide that France’s domestic procurement, routed so reliably to one favored national champion, is a selective advantage that runs into EU state-aid rules. That is a separate exposure from the “European preference” now being drafted at the EU level — which, awkwardly for the sovereignty story, is a French-led project Mistral wants. Either way, the moment the political weather changes, the whole structure reprices overnight.

Mensch says Europe has two years to avoid becoming America’s vassal. He may be right. But Europe should be careful not to mistake one clever founder’s moat for a continent’s sovereignty — and careful, too, about who exactly it’s being asked to be sovereign for.

Mensch, in German, means “man.” In Yiddish, it came to mean something better — a person of integrity, someone who does the right thing. Watching Monsieur Mensch this spring, the open question isn’t whether he’s brilliant. It’s the kind of mensch France thinks it’s buying.

Notes

[1] Assemblée nationale (official video) — Vulnérabilités systémiques dans le secteur du numérique: audition de M. Arthur Mensch (May 12, 2026; the near-empty room is visible on the official feed)

[2] Fortune — Anthropic says it’s testing “Mythos,” a powerful new AI model representing a “step change” in capabilities (the model’s full name per Anthropic’s April 7, 2026 system card is “Claude Mythos Preview”)

[3] The Decoder — Mistral CEO Arthur Mensch warns France against letting Anthropic’s Mythos scan military code bases (also the source for Mensch’s concession that Mistral’s or Chinese models could find the same vulnerabilities)

[4] Council of the EU — Artificial intelligence: Council and Parliament agree to simplify and streamline rules (official press release, May 7, 2026)

[5] Le JDD — Fiscalité, énergie, dépendance: le patron de Mistral AI alerte sur les faiblesses de l’Europe (May 15, 2026)

[6] Assemblée nationale (official video) — audition de M. Arthur Mensch, “vassal state” / two-year warning (same hearing, May 12, 2026)

[7] IT Pro — Mistral CEO calls for AI cultural levy (reporting Mensch’s Financial Times op-ed, March 20, 2026)

[8] Boursorama (AFP) — Taxe Zucman: le patron de Mistral demande “plus de justice fiscale” tout en préservant la compétitivité de la France

[9] Mistral AI / Hugging Face — Mistral-Medium-3.5 model card (official benchmarks: dense 128B parameters; 77.6% SWE-Bench Verified)

[10] TechSifted — Mistral Medium 3.5 review (independent comparison: 77.6% vs. Claude Sonnet 4.6’s 79.6% on SWE-Bench Verified; ~half the per-token price; open weights under modified MIT terms)

[11] Maddyness UK — Mistral AI on track to reach one billion euros in revenue by 2026 (Mensch at Davos; ~$400M ARR, €1bn a target for the year, not booked revenue)

[12] Reuters (via Yahoo Finance) — OpenAI CFO says annualized revenue crosses $20 billion in 2025 (ARR and “annualized run-rate” are adjacent but not identical metrics; the orders of magnitude make the comparison regardless)

[13] The Decoder — Mistral CEO Arthur Mensch warns France… (Mensch’s statement that US investors hold under 30% and founders retain strategic control)

[14] Sifted — Meta and DeepMind alumni raise €105m seed round to build OpenAI rival Mistral (founders’ LLaMA / DeepMind pedigree; École Polytechnique)

[15] TechCrunch — Alan’s founder role in Mistral’s origin story (Samuelian-Werve and Gorintin as connectors; Cédric O founding advisor)

[16] TechCrunch — France’s Mistral AI blows in with a $113M seed round at a $260M valuation (full investor roster incl. Bpifrance, Niel, Saadé, Schmidt)

[17] Caisse des Dépôts (official) — Souveraineté numérique: le groupe Caisse des Dépôts s’adjoint les services de Mistral AI (May 2026)

[18] TotalEnergies (official) — TotalEnergies to collaborate with Mistral AI to increase the application of AI in its multi-energy strategy

[19] Stellantis (official) — Stellantis and Mistral AI expand their collaboration to accelerate enterprise-wide AI adoption (Oct 2025)

[20] CMA CGM Group (official) — CMA CGM completes acquisition of Altice Media (BFM-TV, RMC; group also owns La Provence, La Tribune, Corse Matin)

[21] Puremédias / Ozap — “La Provence”: Rodolphe Saadé met à pied le directeur de la rédaction après une Une qui aurait déplu à l’actionnaire (March 2024; “political pressure” is the journalists’ union’s characterization)

[22] The Maritime Executive — CMA CGM Group: new custom-designed AI solutions from Mistral AI (€100M investment + five-year $110M contract, April 2025; figures as reported in mixed currencies)

[23] CMA CGM / PR Newswire (official) — CMA CGM embarks on a strategic partnership with Google to deploy AI across all shipping, logistics, and media activities (the separate $150M Google deal, 2024 — predating the Mistral deal)

[24] Angelo Lima (hearing analysis, cross-checked to the Assemblée nationale feed) — What Arthur Mensch told the French National Assembly (Mensch’s claim that 70% of Mistral revenue is non-French; secondary analysis of the official hearing)

[25] Cybernews — Experts warn France “operationally paralyzed” as cyberattacks mount in 2026 (single-source characterization; “among the most-attacked” softened from “second-most” pending an ANSSI/CNIL primary)

[26] Cybernews — ANTS hack: 19 million records exposed in French ID agency breach (April 2026)

[27] The Record — French police arrest suspected hacker behind dozens of data breaches (HexDex, 21, ~100 breaches incl. sports federations, Education Ministry, SIA)

[28] CNIL (official) — Data breach: France Travail fined €5 million (Jan 22, 2026)

[29] Bloomberg — European Banks Explore Mistral AI’s Alternative to Anthropic’s Mythos Model (May 13, 2026; Mistral developing its own vulnerability-detection model for European banks shut out of Mythos)

Zero for Three

Julien Simon — Tue, 19 May 2026 17:35:04 GMT

The Trump-Xi summit in Beijing on May 14-15 was supposed to trade chips for rare earths. Washington would ease restrictions on advanced AI accelerators heading to China. Beijing would open the licensing gate on the rare earths and functional materials that feed every semiconductor fab on the planet. Two interlocking grips, mutually released.

No deal.

On May 15, the US Trade Representative sat down with Bloomberg Television. Asked whether semiconductor export controls had come up during the summit that had just concluded, Jamieson Greer answered without hedging: “This was not a major topic of discussion at the bilateral meeting. We did not talk about chip export controls at the meeting.”[1]

One side of the trade is denied at the principal level.

Last week, this newsletter previewed the summit with three specific tests.[2]

An exemption from China’s case-by-case licensing for the rare earths used in advanced AI chips (sub-14-nanometer logic, 256-layer memory).
A Chinese commitment to replace case-by-case review with blanket export approvals for the functional materials flowing through every semiconductor fab: polishing slurries, sputtering targets, and non-military magnets.
A mutual rollback of the October 2025 rule under which Beijing can require an export license for any foreign-made product anywhere in the world that contains more than 0.1% Chinese-origin rare earth content.

Three tests. Three strikes. The dependency the trailer described — rare earths as the layer beneath chips, cloud, and models — survived the summit intact.

The two readouts

The White House Fact Sheet of May 17 announced that China would “address U.S. concerns regarding supply chain shortages related to rare earths and other critical minerals, including yttrium, scandium, neodymium, and indium,” and would “address U.S. concerns regarding prohibitions or restrictions on the sale of rare earth production and processing equipment and technologies.”[3] The verbs are “address” and “concerns.” That is the language of diplomatic intention. It is not the language of a regulatory commitment.

The Chinese readouts said nothing about rare earths.

Xi Jinping’s statement, issued by the Ministry of Foreign Affairs on May 14, covered “strategic stability,” agricultural trade, and Taiwan.[4] The MOFCOM follow-up on May 17 discussed tariff reductions and announced two new bilateral bodies: a US-China Board of Trade and a US-China Board of Investment.[5] CNBC noted the gap on May 18: “The Chinese statement also did not mention rare earths, while the U.S. said China would address rare earth shortages.”[6]

This is a pattern, not an accident. At Busan in October 2025, the White House announced that China had committed to “issue general licenses valid for exports of rare earths, gallium, germanium, antimony, and graphite for the benefit of U.S. end users and their suppliers around the world.” Beijing never confirmed that framing in writing. The gate has stayed narrow. The EU Chamber of Commerce in China reported that MOFCOM approved fewer than 15% of rare-earth license applications submitted by EU firms in 2025, leading to seven production stoppages in August and 46 expected in September.[7] As of December 2025, three Chinese exporters held streamlined general licenses: JL Mag Rare Earth, Ningbo Yunsheng, and Beijing Zhong Ke San Huan.[8]

When one side announces a concession that the other side does not acknowledge, the announcement is not a commitment. It is a press release on one side and regulatory silence on the other.

Beijing has now declined twice, at Busan and at Beijing, to confirm in writing the rare-earth language the White House has put out. The licensing gate has stayed closed throughout.

Chips off the table

The first two tests required leader-level negotiated outcomes on chip-specific carveouts.

Test 1: an exemption from MOFCOM’s case-by-case licensing for the rare earths used in advanced AI chips.
Test 2: a Chinese commitment to replace case-by-case review with blanket export approvals for functional semiconductor materials.

Greer’s answer closes the path by which either could have happened. If chip export controls were not discussed at the leader level, no carveout for sub-14-nanometer chips was negotiated. No blanket-approval commitment was extracted. The “address U.S. concerns” language in the fact sheet is an aspiration. The rules in force are still MOFCOM Notice 61 of October 9, 2025, with the case-by-case review for sub-14-nanometer logic and 256-layer memory currently sleeping under Notice 70.[9] Neither was rescinded. Neither was modified. Neither was discussed.

Jensen Huang said the quiet part out loud at a Citadel Securities event two weeks before the summit: “In China, we have now dropped to zero. Conceding an entire market the size of China probably does not make a lot of strategic sense, so I think that has already largely backfired.”[10] The remark concerned the H200, the AI accelerator Nvidia is licensed by the US to ship to roughly ten approved Chinese firms. Among the buyers are Alibaba, Tencent, ByteDance, and JD.com; among the distributors, Lenovo and Foxconn. Each shipment carries a 25% remittance to the US Treasury and physical transit through US territory for inspection.[11] Trump told reporters aboard Air Force One that the Chinese firms had “chose not to” buy “because they want to develop their own.”[12]

Three days after returning from Beijing, Huang reversed course. Asked at a Dell event in San Francisco on May 18 whether the Chinese market would reopen to Nvidia, he answered: “My sense is that over time, the market will open.”[13] Reuters framed the H200 file more pointedly: “Nvidia has received licenses from the U.S. government to sell its H200 chips but has not received approval from Chinese officials who are fostering China’s own chip suppliers.”

Huawei Ascend, Cambricon, and Biren are not waiting for Nvidia’s return. They are closing the gap while the H200 file sits in MOFCOM’s review queue.

Beijing has no need to bargain for chips it is increasingly producing itself. Rare earths are what buy Beijing the time.

The gate runs both ways. Beijing can stop delivery of US-approved chips through State Council guidance just as Washington can stop delivery of advanced GPUs through Commerce Department rules. The coercion stack the trailer described is not one-sided. Markets priced it within hours. Nvidia closed at $225.04 on May 15, down 4.20%, erasing roughly $170 billion of market value intraday.[14]

The cliff

The third test asked whether the summit would rescind China’s October 2025 extraterritorial 0.1% rule, the provision under MOFCOM Notice 61 that lets Beijing require its approval to export any foreign-made product anywhere in the world that contains more than 0.1% Chinese-origin rare earth content. The rule was suspended on November 7, 2025 under MOFCOM Notice 70. The suspension expires November 10, 2026.[15]

The summit produced no announcement closing this cliff. No language in the White House fact sheet addresses Notice 61 specifically. No Chinese regulatory action followed the summit. The cliff remains live and is scheduled to re-arm automatically in six months.

This is the deepest finding of the summit. Beijing’s most powerful tool was not given up. It was not contested. It was not even discussed at the leader level, by the USTR’s own account. It was left in place, suspended on a calendar timer. November 10 is when the paper rules catch up to the practice on the ground. MOFCOM has held approval rates below 15% throughout the suspension; the licensing gate has been closing in operation while sleeping on paper. The cliff is when the paper wakes up.

The next signal arrives in September, when Xi is scheduled to visit Washington during United Nations General Assembly week. If he arrives without a renewed suspension already in writing, the cliff becomes the central deal of the cycle. APEC in Shenzhen follows in November, two weeks after the suspension expires. The summit calendar has been arranged around the regulatory calendar, not against it.

What markets read

Lynas Rare Earths fell from A$19.90 on May 13 to A$17.95 on May 15, a 9.8% decline in two trading sessions.[16] MP Materials rallied to $61.27 on May 15, then dropped 7.5% to $56.67 on May 18 as the “tactical truce” reading took hold.[17] The supply-side equities priced the same conclusion the Greer interview made plain: nothing had moved underneath. The rally that built into the summit was unwound by what the summit failed to produce.

The ex-China spot prices tell the same story. Terbium oxide averaged $1,140 per kilogram FOB in late April; dysprosium oxide averaged $292 per kilogram.[18] Inside China, the same materials cleared at roughly $895 and $125 per kilogram, respectively — the Western buyer pays a quarter more for terbium and more than double for dysprosium. That spread is the cost of the licensing gate, what the marginal Western buyer pays when MOFCOM approves fewer than 15% of applications. It did not collapse during summit week. It widened.

The Western-buyer premium is the price the supply chain pays for the dependency the trailer described, and the summit confirmed that price is staying in place at least through 2028.

The supply-side response continues on its own timeline, indifferent to the summit. MP Materials begins commissioning heavy rare earth separation at Mountain Pass in mid-2026, targeting 200 metric tons per year of dysprosium and terbium combined.[19] Lynas continues its Malaysian expansion. Iluka’s Eneabba refinery is now targeted for 2027 commissioning, slipping from earlier 2026 guidance.[20] Combined Western heavy rare earth capacity at full ramp is on the order of 600 metric tons per year by 2028, a fraction of the heavy rare earth content embedded in the 58,000 tons of permanent magnets China exported in 2024 alone.[21]

The summit did not change any of these timelines. It did not need to. The diversification is happening regardless. The cliff is on the calendar regardless.

What the summit settled

The trailer argued that rare earths form the fourth layer of the AI infrastructure coercion stack, under chips, cloud, and models. The Beijing summit tested that argument against three concrete questions. Each question required a specific regulatory action that would have shown leader-level willingness to ease the rare earth grip. None of the three occurred.

The summit produced agricultural commitments, Boeing aircraft orders, beef market access restoration, and two new bilateral talking shops. These are real diplomatic outputs. A lower geopolitical temperature reduces tail risk; the next confrontation is postponed. But they are the deliverables of a managed-stability summit, not of a rebalancing of the underlying dependence. Greer’s sentence confirms the boundary: leader-level discussions did not reach the rules that hold the rare earth grip in place. Working-level talks may continue. Without principal-level direction, MOFCOM has no political cover to dismantle the rules it issued under leader-level authority five months ago.

The two readouts confirm the consequence: what one side announces is not what the other side will enforce.

Six months from now, on November 10, 2026, MOFCOM Notice 70 expires. Either Beijing extends the suspension before that date, in writing, or the extraterritorial 0.1% rule re-arms automatically. The two scheduled summit appearances — Xi in Washington in September, Trump and Xi at APEC Shenzhen in November — are the venues where that decision will be made.

Last week, this newsletter set three tests for the Beijing summit. The summit returned each test unchanged. The exposure the trailer described was not negotiated away. It was scheduled forward.

The chip war happens in press releases. The war underneath happens on the regulatory calendar.

Notes

[1] Jamieson Greer, US Trade Representative, Bloomberg Television interview, May 15, 2026, as reported by Reuters: “Chip export controls not major topic in China talks, US trade rep Greer tells Bloomberg News”.

[2] “Below the Silicon”, The AI Realist, May 13, 2026.

[3] “Fact Sheet: President Donald J. Trump Secures Historic Deals with China, Delivering for American Workers, Farmers, and Industry”, The White House, May 17, 2026.

[4] “President Xi Jinping Holds Talks with U.S. President Donald J. Trump”, Ministry of Foreign Affairs of the People’s Republic of China, May 14, 2026.

[5] “White House touts deals on soybeans and rare earths after Trump-Xi summit, while China talks up tariff cuts”, CNBC, May 18, 2026.

[6] Ibid.

[7] “False sense of security: European complacency on rare earths is the wrong answer to the US-China trade truce”, European Union Institute for Security Studies, citing EU Chamber of Commerce in China data, accessed May 2026.

[8] “China issues first batch of streamlined rare earth licences”, Mining.com, December 2, 2025.

[9] MOFCOM Notice 61 of October 9, 2025; MOFCOM Notice 70 of November 7, 2025, suspending the extraterritorial provisions until November 10, 2026. Analysis: Pillsbury Winthrop Shaw Pittman, “China Suspends Export Controls on Certain Critical Minerals and Related Items”; Clark Hill, “China Hits ‘Pause’ on Rare-Earth Export Controls and What it Means for Supply Chains”.

[10] Jensen Huang, remarks at Citadel Securities event, early May 2026, as reported by Tom’s Hardware: “Trump says China is blocking Nvidia H200 purchases despite US approval — says country ‘chose not to’ sanction purchases, pushing homegrown chips instead”.

[11] H200 framework details per Implicator, “Nvidia H200 China Deliveries Stalled After Trump-Xi Summit”, May 2026.

[12] Trump remarks aboard Air Force One, May 15, 2026, as reported by Tom’s Hardware (op. cit.).

[13] “Nvidia CEO says he believes China market will open over time”, Reuters, San Francisco, May 18, 2026 (Bloomberg Television interview at Dell event).

[14] “Delayed Chinese approval for H200 chips sends Nvidia stock down 4.20%”, Traders Union, May 15, 2026, citing Google Finance.

[15] Pillsbury Winthrop Shaw Pittman, op. cit.; Clark Hill, op. cit.; MOFCOM Announcement No. 70 of 2025.

[16] Lynas Rare Earths (ASX: LYC) close prices per ASX official data, accessed via StockAnalysis.com.

[17] MP Materials (NYSE: MP) close prices per Morningstar; “Lynas Tumbles as ‘Trump–Xi Truce’ Lifts False Calm Over Rare Earths”, Rare Earth Exchanges, May 18, 2026.

[18] Rare earth FOB spot price data per Rare Earth Exchanges market reports, May 2026; “Rare Earth Market Outlook May 2026: Prices Fall”, Rare-earth-mining.com.

[19] MP Materials Q3 2025 earnings release; “Pentagon-Backed MP Materials to Start Rare Earths Plant in 2026”, Bloomberg, November 6, 2025.

[20] Iluka Resources Q1 2026 financial reporting; “Eneabba Rare Earths Refinery Funding Update”, Iluka Resources ASX release, December 6, 2024.

[21] IEA, “Global Critical Minerals Outlook 2025”, October 2025, citing 2024 Chinese permanent magnet export volumes.

Where the HALEU bet actually pays

Dante — Sat, 16 May 2026 01:05:28 GMT

In Post 1, I argued that the tightest knot in the US nuclear fuel cycle is HALEU enrichment — high-assay low-enriched uranium, the 5–19.75% U-235 fuel that every advanced reactor in the US needs for its first core. There is no commercial Western HALEU supply at scale. Until 2024, it all came from Russia.

There are exactly two US-listed names with direct HALEU exposure. One of them is the obvious pick — funded by the Department of Energy, owned by ~80% of institutions, up 200% in the last twelve months. The other is a $700M micro-cap whose enrichment subsidiary you’ve probably never heard of.

Thanks for reading! Subscribe for free to receive new posts and support my work.

I think the smaller one is the better trade today. Not because the bigger name is bad, it isn’t, but because the market has already priced its bull case, and the smaller name is the only fundamentally cheap HALEU option in the US public market.

Here’s the work.

The two listed names

Centrus Energy (NYSE: LEU) is the name. It is the only US-owned commercial enricher and one of three companies funded by the DOE’s January 2026 $2.7B HALEU and LEU enrichment award.

Created with TradingView

Centrus’s American Centrifuge Operating subsidiary received $900M for HALEU production at Piketon, Ohio, and produced the first ~900 kg of US-origin HALEU. Centrus raised 2026 revenue guidance to $450–500M in its Q1-26 print (https://www.prnewswire.com/news-releases/centrus-reports-first-quarter-2026-results-302763250.html), and is sitting on a $3.9B contracted backlog. Market cap $4.37B.

ASP Isotopes (NASDAQ: ASPI) is the optionality name. Its core business is laser-based isotope enrichment for medical (Mo-99 path), semiconductor (Silicon-28), and pharmaceutical applications. The HALEU exposure is via a wholly-owned subsidiary, Quantum Leap Energy (QLE), which holds a long-term HALEU offtake agreement with TerraPower plus a $22M conditional loan, and which signed a non-binding MOU in March 2026 (https://www.stocktitan.net/news/ASPI/) with a major US nuclear power operator for HALEU, LEU+, uranium conversion, and deconversion services. Market cap $690M, of which $333M is cash as of December 2025.

Created with TradingView

The numbers side-by-side:

If you only look at the top three rows, Centrus is obviously the better company. It has revenue, it has guidance, it has DOE money, and the chart has only gone up. ASP Isotopes is small, loss-making, and the stock has gone nowhere for a year while the rest of the nuclear thematic has rallied.

But the bottom three rows are where the trade actually lives.

Where the value actually is

I built probability-weighted scenario DCFs on both names.

Centrus scenarios (my P-weights):

Probability-weighted fair value: ~$169/share. Current price $222. That gap doesn’t mean Centrus is overvalued in any absolute sense — it means the market is pricing the bull-case outcome at roughly 60–70% probability, versus my 30%. Either I’m wrong about the probabilities, or the market is paying ahead of execution. Both can be true at once.

ASP Isotopes scenarios:

Weighted operating NPV ~$0.74B, plus $333M cash on hand → fair equity ~$1.07B. Current market cap $690M. That’s a +55% asymmetric setup, with the cash backstop providing a soft floor.

Three observations from running these numbers.

First, the asymmetry runs in opposite directions. Centrus’s bear case is –80% from current; ASPI’s bear case is –78%. That looks similar. But ASPI’s bear case still leaves you with $150M of NPV against $333M of cash, so the actual equity floor is higher than the bear-NPV number suggests. Centrus’s bear case has no cash floor — it’s a working business, and the bear case is operational impairment. The shape of the downside is different even when the magnitude is similar.

Second, insider ownership is doing real work. ASPI’s 13.5% insider ownership versus Centrus’s 3.3% (and NuScale’s 0.4%) is the kind of management-alignment signal that tends to matter at exactly the inflection moment ASPI is approaching — the 2026 commercial-shipment year. Founders who own the company tend not to price-collapse it on the first dilutive raise.

Third, the TerraPower offtake is a third-party validation that the market hasn’t internalized. TerraPower is privately held, well-funded, and has every incentive to source HALEU from the most credible producer it can find — including, in theory, from Centrus directly. The fact that TerraPower committed offtake terms and a $22M conditional loan to QLE specifically tells you the market is too pessimistic on QLE’s technical credibility.

The catalysts most people aren’t tracking

Both names have a thick catalyst calendar through the end of 2027. The two that matter most for getting positioning right are very specific.

For Centrus, the binary is the Q2-26 print in August, where management will disclose Piketon HALEU production cadence in kg/month run-rate. The current implied schedule has Piketon ramping toward roughly 6 metric tons per year of HALEU output by 2028. That implies a run-rate around 80 kg/month at maturity. If the August print shows the production cadence tracking below ~40 kg/month — half the implied path — the bear case activates fast and the multiple compresses with it. If it tracks at or above 60 kg/month, the bull case stays alive, and the stock probably runs further before consolidating.

For ASPI, the binary is QLE’s first HALEU pilot output disclosure, expected in Q4 2026. This is the cleanest existence proof the market has been waiting for. If QLE produces enriched material on schedule, the bull-case probability re-weights upward — and at a $690M market cap, the re-rating math is significant. If QLE misses by more than two quarters, the bear-case probability dominates, and the cash floor becomes the only thing holding the stock up.

Two other dates worth flagging:

The Russia uranium waiver expiry in 2027 — under the Prohibiting Russian Uranium Imports Act is a structurally positive catalyst for both names, but more so for Centrus, which loses Russia LEU revenue but gains tighter pricing on its US-domestic enrichment.
Centrus’s first commercial HALEU shipments, targeted for Q2 2027, are the bull-case proof point. If those land on schedule with TerraPower, X-energy, or Kairos as the first counterparty, Centrus becomes harder to fade.

What this means for stock-picking

1. Centrus is the right structural answer to the wrong question. “Which name has the most HALEU exposure?” gets you to LEU. “Which HALEU name offers asymmetric upside at current prices?” gets you to ASPI. Both questions are valid, but only one is a trade.

2. The size discount is doing the lifting. ASPI is small enough that institutional ownership hasn’t yet crowded out the asymmetry — 51.6% versus Centrus’s 79.3%. The same business at $4B of market cap would already be priced in line with Centrus.

3. Optionality positions need explicit exits. I would hard exit if QLE produces no enriched HALEU material by year-end 2026, or if TerraPower offtake terms are publicly restructured downward. The cash backstop makes the position survivable; the falsification triggers make it disciplined.

4. Centrus is a buy on pullback, not a buy on chase. I would build a position below $165, where the implied bull-case probability falls into a range that matches my analytical view. Above $200, the math doesn’t work even on aggressive assumptions.

5. Both names will be revisited together every quarter. The catalyst structure is interlocking — Centrus’s Piketon cadence and ASPI’s QLE pilot are the two existence proofs that determine whether US-owned HALEU is real or theoretical. Watching only one of them gives you half the signal.

What’s coming

Post 3 — Conversion: the bottleneck nobody can play directly. Why ConverDyn / Solstice’s Metropolis Works is the single tightest commercial chokepoint in the chain, and how an HON Advanced Materials spin (rumored, not confirmed) would unlock the cleanest pure-play if it ever lists.
Post 4 — *Picks-and-shovels*. The mid-cap that passes the 4-variable filter in two of seven segments simultaneously, and why I think it’s a satellite rather than a core position, despite that.
Post 5 — *SMR demand*. Why I think the post-CFPP-cancellation reset on NuScale is more advanced than the market recognizes — and why that doesn’t yet mean I’m long.
Post 6 — *The book*. Five names, position-sized, with explicit falsification triggers for each.

Subscribe if you want this in your inbox over the next weeks.

Further reading

- DOE — *[Awards $2.7B to restore American uranium enrichment](https://www.energy.gov/articles/us-department-energy-awards-27-billion-restore-american-uranium-enrichment)* (Jan 6, 2026)

- Centrus Energy — *[Q1-26 results press release](https://www.prnewswire.com/news-releases/centrus-reports-first-quarter-2026-results-302763250.html)*

- ASP Isotopes — *[Q3-25 10-Q via SEC EDGAR / StockTitan summary](https://www.stocktitan.net/sec-filings/ASPI/10-q-asp-isotopes-inc-quarterly-earnings-report-b8eb89c3dea3.html)*

- World Nuclear News — *[US enrichment funding recipients flesh out plans](https://www.world-nuclear-news.org/articles/us-enrichment-funding-reactions)*

- ANS Nuclear Newswire — *[DOE awards $2.7B for HALEU and LEU enrichment](https://www.ans.org/news/article-7652/doe-awards-27b-for-haleu-and-leu-enrichment/)*

- Prohibiting Russian Uranium Imports Act ([P.L. 118-62](https://www.congress.gov/bill/118th-congress-house-bill/1042), May 2024)

---

*Capacity Factor is a six-part series on US nuclear fuel-cycle equities.

Thanks for reading! Subscribe for free to receive new posts and support my work.

Below the Silicon

Julien Simon — Tue, 12 May 2026 04:55:02 GMT

Inside a TSMC fab in Taiwan, at this moment, an Nvidia Blackwell die is being polished flat to within fractions of a nanometer. A few meters away, a lithography scanner is exposing the next wafer with extreme-ultraviolet light generated by vaporizing tin droplets with a 30-kilowatt laser, 50,000 times a second. This is the most precise industrial process in human history.

It runs on rare earth elements. China mines 70% of the world’s supply and refines 91% of it. America refines less than 1%.[1]

On Wednesday, the President of the United States flies to Beijing to negotiate continued access.

The recipe

The first step is the polish. Before a chip can be patterned, the silicon wafer has to be made flat to within fractions of an atom across an area the size of a dinner plate. This is done with a slurry of fine abrasive particles. The abrasive is cerium oxide, a rare-earth compound made almost entirely in China.[2]

Next comes lithography: the printing of the chip’s pattern onto the wafer using extreme-ultraviolet light. Three rare earths appear in the light path. Erbium is doped into the optical fibers that amplify the laser’s pulse. Terbium forms a special crystal — terbium gallium garnet — that lets light through one direction and blocks it in the other, protecting the laser from its own reflections. Thulium will be used in the next generation of these lasers, currently under development at Lawrence Livermore National Laboratory. The lithography machines that will use them are built by ASML — the Dutch company that supplies every advanced fab in the world.[3]

Between exposures, a metrology laser checks that the pattern came out right. The crystal at its heart is often Nd:YAG — yttrium aluminum garnet, doped with neodymium.[4] After exposure, the patterned features are etched into the silicon by corrosive fluorine and chlorine plasmas. To survive the plasma, the etch chamber is lined with yttrium oxide.[5]

Then comes deposition: laying down the metal films that become the chip’s wiring. The way to deposit a metal film is to put a solid block of it (a “sputtering target”) into a vacuum chamber and knock atoms off it with ions. Most sputtering targets are pure metals — copper, tungsten, titanium — but the targets that lay down the high-performance dielectric layers and certain barrier materials contain rare earths, most often yttrium and lanthanum.[6] Finally, the chip is packaged. In a modern AI accelerator, packaging means stacking multiple silicon dies into high-bandwidth memory — the “HBM” the industry talks about — and bonding them with millions of microscopic copper joints. Between each bonding step, the surfaces are polished flat again, with the same cerium oxide slurry that started the process.

Then the chip leaves the fab. It arrives at a hyperscaler datacenter on a server board cooled by spinning fans. The motors driving those fans are made from neodymium magnets — alloys of iron, boron, and neodymium, almost always with a small percentage of dysprosium or terbium added to keep them magnetic at high temperatures.[7] The same magnets power the hard drives, the liquid-cooling pumps that keep modern GPU racks from melting, and every motorized actuator in the rack.

Behind the chip, the tools that fabricated it run on the same chemistry. Every lithography scanner, every ion implanter, every etch tool — the precision motors are all neodymium magnets, with the highest-performance versions in fab equipment carrying up to ten percent dysprosium by weight.[8] The magnetic bearings on many cleanroom and vacuum pumps are neodymium too. So are the robotic arms that move wafers between tools.

A modern AI accelerator is, in material terms, a tightly packed assembly of silicon, copper, and rare-earth elements. The silicon and copper have multiple commercial sources. The rare earths do not. Substitutes exist for some uses but perform worse — there is no commercial alternative to cerium oxide at advanced lithography nodes, and no replacement for the heavy rare earths in high-temperature magnets.

The dependency

China controls roughly 70% of global rare earth mining, 91% of separation and refining, and 94% of the world’s strongest permanent magnets — the kind used in motors, generators, and precision equipment.[9] The geological deposits that yield commercial quantities of the heavy rare earths used in those magnets — dysprosium and terbium — are a specific type of clay-bound ore (geologists call them ion-adsorption clays), found in commercial concentrations only in southern China and northern Myanmar. Together, they account for more than 99% of the world’s heavy rare-earth feedstock, with Myanmar production largely flowing into Chinese refineries.[10]

Last year, every gram of terbium America imported came from China. So did every gram of holmium, and every gram of lutetium. Net U.S. import reliance on heavy rare earths is 100%; the small share nominally sourced from third-country processors in Estonia, Japan, and Malaysia is itself derived from Chinese feedstock.[11]

This is the layer beneath the chip war. “Access, Disable, Destroy” mapped a three-switch model of AI infrastructure coercion: chips at the silicon layer, cloud at the infrastructure layer, models at the application layer.[12] The materials layer sits beneath all three. China has commercial and diplomatic reasons not to embargo rare earths outright — its producers want the revenue, and a formal cutoff would accelerate Western diversification.

The leverage operates instead through individual export approvals: China’s Ministry of Commerce (MOFCOM) requires a case-by-case license for any shipment of rare earths destined for advanced semiconductors. The trigger categories are logic chips at process nodes below 14 nanometers (every AI accelerator made today) and memory stacked with more than 256 layers (the high-bandwidth memory inside those accelerators). This licensing regime remains active throughout the November 2025 suspension.[13] A single review can stall a shipment indefinitely, even without a formal export ban. Diversification at the binding constraint takes time that the AI capex cycle does not have: industry estimates place full onshoring of heavy rare-earth refining at 5 to 7 years.[14]

The response

On February 2, 2026, Donald Trump announced Project Vault — a $12 billion strategic reserve of rare earth elements, modeled on the Strategic Petroleum Reserve that has insulated the United States against oil shocks since the 1970s. The signal: the administration now treats rare earth dependency as a national security exposure on par with energy security. The structure is a $10 billion, 15-year loan from the Export-Import Bank, plus roughly $1.7 billion of private capital, with procurement handled by three commodities trading houses.[15] They buy imported oxides and metals on behalf of civilian-sector manufacturers, who can draw down their allocations in a disruption and replenish them when supply normalizes.[16] At blended heavy rare earth prices — terbium oxide at $1,010 per kilogram, dysprosium at $239 — $12 billion is a serious buffer against price spikes and short interruptions.

It does not address the binding constraint. The United States has no commercial-scale heavy rare earth separation capability operating today.[17] MP Materials’ Mountain Pass heavy rare earth circuit, backed by a $150 million Department of War loan, targets 200 metric tons per year of dysprosium and terbium production from mid-2026.[18] Lynas, the only commercial-scale producer of separated heavy rare earths outside China, is expanding its Malaysia facility to a full suite of heavy rare earths within two years.[19] Combined Western capacity at full ramp is on the order of 600 metric tons per year of dysprosium and terbium by 2028 — a fraction of the heavy rare-earth content embedded in the 58,000 tons of permanent magnets China exported in 2024 alone.[20]

What Project Vault stockpiles is what comes out of the country it was designed to protect against. The reserve relocates the dependency one step upstream — from end-use to inventory — without changing the upstream geography. Meanwhile, the chokepoint is moving. In March 2026, Shenzhen launched a state-coordinated R&D program for domestic rare-earth-based polishing slurries — the same cerium oxide chemistry the wafer polish opens with, currently dominated by U.S. and Japanese suppliers.[21] The pattern is consistent: control raw materials upstream, control separation in the middle, and as Western capacity catches up at the upstream layers, move downstream into the higher-margin functional materials. Each Western response addresses a layer that the chokepoint has already moved past.

What to watch on Friday

The summit will produce announcements. Boeing purchases. Agricultural commitments. A bilateral Board of Trade. Possibly an extension of the November 2025 suspension beyond the November 10, 2026 expiry, framed as continued de-escalation.[22] None of these alters the materials layer.

Three things would. First, an exemption from MOFCOM’s case-by-case licensing for the rare earths used in advanced AI chips — the sub-14-nanometer logic and 256-layer memory categories now requiring individual Chinese approval. This would dissolve the most direct chokepoint. Second, a commitment to blanket licenses rather than per-shipment review for the functional materials flowing through semiconductor manufacturing: polishing slurries, sputtering targets, and non-military magnets. That would turn managed dependency into something predictable. Third, a mutual rollback of China’s October 2025 extraterritorial rule, which lets Beijing license any foreign-made product anywhere in the world that contains more than 0.1% Chinese-origin rare earths. That rule is currently suspended; rescinding it would close the November cliff rather than postpone it.

None of these is on the agenda that the U.S. Trade Representative previewed in April.[23] The summit is one whose success is measured by the absence of breakdown, not by the resolution of substance.

Every Blackwell, every MI300, every TPU, every Trainium, every HBM stack from Samsung and SK Hynix carries this recipe inside it. The rare earths are extracted from Chinese land. The chips are built by TSMC on Chinese land — or so they say.

Beijing claims both halves as Chinese. It controls only one. By Friday, the President will have negotiated with the half it controls. Taiwan, where the chips are made, will be the silence in the room.

Notes

[1] Mining figures: U.S. Geological Survey, Mineral Commodity Summaries 2025: Rare Earths, January 2025. China mined 270,000 metric tons of REO equivalent in 2024, accounting for 69.2% of the world total (390,000 tons); the United States mined 45,000 tons. Refining figures: International Energy Agency, “With new export controls on critical minerals, supply concentration risks become reality,” October 9, 2025. China = 91% of global rare earth separation and refining; 94% of sintered permanent magnet production. U.S. domestic production of refined rare earth compounds and metals in 2024 was approximately 1,300 tons (USGS) — roughly 0.3% of global production. Most U.S.-mined concentrate is exported for refining elsewhere, principally to China.

[2] Cerium oxide is the dominant abrasive in chemical-mechanical planarization slurries used for advanced-node silicon wafer polishing; its abrasive properties at sub-nanometer scales are not matched by available substitutes. Chinese mining accounts for the majority of global cerium supply, and Chinese separation accounts for the overwhelming majority of refined cerium oxide production.

[3] Lawrence Livermore National Laboratory, “LLNL selected to lead next-gen extreme ultraviolet lithography research,” December 23, 2024. Erbium-doped fiber amplifiers are standard in the seed-laser stages of EUV light-source pre-pulse generation. Terbium gallium garnet (TGG) is the standard material for Faraday optical isolators in DUV and short-wavelength laser systems, including those used in lithography, metrology, and inspection. Thulium-doped yttrium lithium fluoride is a candidate gain material for next-generation high-numerical-aperture EUV sources.

[4] Neodymium-doped yttrium aluminum garnet (Nd:YAG) is a long-established laser crystal used in fab metrology, alignment, inspection, and certain marking applications. See Vimaterial industry overview, “Rare earth materials for a brighter future,” February 26, 2026.

[5] Yttrium oxide ceramic coatings are standard for plasma etch chamber liners due to their resistance to fluorine and chlorine plasma chemistries; they reduce particle contamination and extend chamber service intervals. See industry technical literature on plasma etch chamber materials.

[6] Sputtering targets composed of rare-earth metals and oxides are used in physical vapor deposition of barrier layers, electrodes, and functional thin films in semiconductor manufacturing. Yttrium, gadolinium, and other rare earths appear across multiple deposition recipes.

[7] Standard NdFeB permanent magnet formulations contain 1–3% dysprosium or terbium for elevated-temperature applications. Industry-standard composition; see also USGS MCS 2026, Rare Earths (Heavy) chapter.

[8] Higher-performance NdFeB grades used in precision-motion applications (semiconductor manufacturing equipment, certain medical devices, defense applications) can contain heavy rare-earth content of up to approximately 10% by mass, depending on temperature and demagnetization-resistance requirements.

[9] Mining share: USGS MCS 2025, op. cit. (China 270,000 / world 390,000 = 69.2% in 2024). Refining and magnet shares: IEA, op. cit. (91% separation, 94% sintered permanent magnets).

[10] Payne Institute for Public Policy (Colorado School of Mines), “Explainer on the MP Materials–Department of War Partnership,” August 2025. The principal global sources of separated heavy rare earths, such as dysprosium and terbium, are ion-adsorption clay (IAC) mining operations; the only notable IAC operations in the world are in China and Myanmar (>99%), with the Myanmar production typically flowing into Chinese separation facilities.

[11] U.S. Geological Survey, Mineral Commodity Summaries 2026: Rare Earths (Heavy), February 2026. US heavy rare-earth imports in 2025: 100 metric tons of compounds and metals. Net import reliance 100% across 2021–2025. Terbium imports 100% from China; holmium 100% from China; lutetium 100% from China (including Hong Kong); ytterbium 86% from China.

[12] “Access, Disable, Destroy,” The AI Realist.

[13] White & Case LLP, “China imposes extraterritorial jurisdiction and a 50% Rule for export controls on rare earth elements and other items,” October 2025. Article 4 of MOFCOM Notification 61/2025 imposes a case-by-case review for memory chips at 256-layer and above and logic at 14 nanometer and below, plus production and testing equipment. Carra Globe, “China Rare Earth Export Controls 2026,” May 2026: case-by-case review remains active during the November 2025 suspension. MOFCOM original text: Center for Security and Emerging Technology translation of Notice No. 61.

[14] Discovery Alert/industry analyst commentary, November 2025, citing industry consensus on heavy rare earth separation onshoring timelines. (B-tier; consistent with multiple industry sources but no single A-tier confirmation.)

[15] PBS NewsHour / AP wire, “WATCH: Trump announces plan for rare earth elements strategic reserve,” February 2, 2026; Fortune, “New ‘Project Vault’ critical minerals stockpile is ‘first step of many’,” February 3, 2026. Procurement firms named: Hartree Partners, Mercuria, Traxys.

[16] Quest Metals industry analysis, “Project Vault: $12 Billion Critical Mineral Stockpile,” February 5, 2026, describing draw-down and replenishment structure.

[17] Rare Earth Exchanges, “Project Vault: America Wants a Strategic Minerals Reserve — But Can It Stockpile What It Still Can’t Produce?,” May 2026.

[18] MP Materials Q3 2025 earnings release, November 6, 2025; USGS MCS 2026, Rare Earths (Heavy) chapter, citing $150 million Department of War loan in August 2025.

[19] Lynas Rare Earths Q3 FY2026 results; Argus Media, “Lynas rare earth output rises in 3Q,” November 3, 2025; Rare Earth Exchanges, “Lynas Doubles Down on Heavy Rare Earths,” February 25, 2026.

[20] IEA, op. cit. China exported 58,000 tons of rare earth magnets in 2024.

[21] Rare Earth Exchanges, “China Targets Chipmaking Bottleneck: Rare Earth Polishing Project Launches in Shenzhen,” March 19, 2026. (B-tier source; project is announced state R&D, not yet commercial-scale; treat as directional signal.)

[22] Brookings, “What will happen when Trump meets Xi?,” May 5, 2026; Pakistan Today, “Trump-Xi talks to focus on trade, Iran and Taiwan,” May 8, 2026.

[23] Washington Times, “Chinese fentanyl exports, lock on rare earths to top Trump’s agenda at summit with Xi,” April 20, 2026, citing USTR Jamieson Greer testimony to House Appropriations subcommittee.

Where the Uranium bottlenecks actually are

Dante — Sat, 09 May 2026 23:28:27 GMT

Energy is the critical bottleneck for AI infrastructure today. In The Half-Life of a Press Release, we examined recent Small Modular Reactor hyperscaler announcements and their critical dependence on nuclear fuel enrichment. In this piece, we will focus on American companies operating in this field.

In May 2026, McKinsey published this report [1] on the US domestic nuclear fuel cycle that put a number on the rebuild: $105–170 billion of capex through 2050, split across mining, conversion, enrichment, fabrication, and reprocessing.

That’s a useful frame, but it’s not the investable number. The investable number is which one or two segments will absorb more than half of the new awards in the next 36 months, because the rest of the chain cannot move without them.

This is the first in a six-part series on US-listed nuclear-fuel-cycle equities. I screened 22 names against four filters — small/mid-cap, off all-time-high, accelerating fundamentals, and early narrative — and by the end of the series, I’ll be down to a five-name long book.

But before any of that, you have to understand where the bottlenecks actually are. They are not where most of the public conversation says they are.

The five segments and what they cost

The fuel cycle decomposes into five sequential nodes plus two adjacencies (reactors and waste/storage). Here’s the McKinsey capex stack:

If you read those numbers naively, reprocessing is the biggest opportunity. It isn’t. Commercial reprocessing has been effectively blocked in the US since Jimmy Carter’s 1977 executive order [2] and remains uninvestable on any horizon shorter than a decade. The capex range is wide because it’s a greenfield-risk number for a thing that probably won’t get built before 2040.

Mining looks underweighted at $15–20B. It is but globally, there is no shortage of uranium-producing capacity. Kazatomprom alone supplies roughly 40% of global production at low cost [3]. Adding US mining is a national-security argument, not a global-capacity argument. The investable angle in mining is uranium-spot beta plus US-specific permitting and ramp execution — not ground-up mine economics.

The interesting numbers are conversion and enrichment.

Where the bottleneck actually is

I’d score the seven nodes like this for severity over the next decade. Severity scale: 5 = single point of failure for the chain; 1 = not a binding constraint.

Three observations from this table that surprised me when I started doing this work.

First, HALEU enrichment is the single tightest knot in the chain.

HALEU — high-assay low-enriched uranium, 5–19.75% U-235 — is what every advanced reactor needs for its first core:

Oklo Aurora,
TerraPower Natrium,
X-energy Xe-100,
Kairos Power KP-FHR.

Until 2024, virtually all commercial HALEU came from Russia. Today, Centrus Energy has produced the first ~900 kg of US-origin HALEU at Piketon, Ohio. That is the entire commercial Western supply.

Second, conversion is almost as tight — and there is no way to play it directly on the listed US tape.

The single operating US conversion facility is ConverDyn’s Metropolis Works in Illinois, running at roughly 7 ktU/yr against an original nameplate of 15 ktU/yr. Its parent is Honeywell. Honeywell is a $137B mega-cap where conversion is a low single-digit percent of revenue. There is no listed pure-play. This matters for the screen because it means even if you correctly identify conversion as the tightest commercial bottleneck, you cannot express it cleanly through a single name. Anyone who tells you they have a “conversion trade” via Honeywell is overstating their position.

Third, advanced fuel fabrication (TRISO and metallic alloys) is also acute, with similarly thin investable exposure. The NRC granted X-energy the first-ever Category II TRISO fuel fabrication license in February 2026. X-energy is private. The only public direct play in advanced fuel fab is BWX Technologies (NYSE: BWXT) — and BWXT is a $19B mid-cap trading near its all-time high, well-covered, and structurally above the size cap most thematic books carry.

Mining sits below those three in severity. It is a thematic-beta trade with a structural overlay, not a structural trade with a price overlay. That distinction matters: if uranium spot rolls over 25%, mining-name multiples compress fast. The conversion and HALEU bottlenecks don’t decompress that way.

The DOE award everyone should be paying attention to

On January 6, 2026, the US Department of Energy awarded $2.7 billion [4], split evenly three ways:

$900M to American Centrifuge Operating (a Centrus Energy subsidiary) for HALEU at Piketon, Ohio.
$900M to General Matter for HALEU at the former Paducah Gaseous Diffusion Plant in Kentucky. General Matter only emerged from stealth in April 2025 and signed its DOE land lease in August 2025.
$900M to Orano Federal Services for LEU at Project IKE in Oak Ridge, Tennessee — a piece of a roughly $5B greenfield enrichment project.

Plus a smaller $28M supplemental award to Global Laser Enrichment [5] (Silex / Cameco JV) for next-gen technology.

The structure of this award is, to me, the most consequential signal in the McKinsey article. The federal government had a choice: concentrate the bet behind one US-owned producer, or seed three separate efforts. It chose three. That decision compresses per-name optionality versus a winner-take-all outcome, but it converts the question from “will US-owned HALEU exist?” (speculative) to “which of three named producers will execute first?” (handicapping).

Two of the three are private. The only listed name that won a tranche is Centrus Energy (NYSE: LEU). That is why every conversation about US enrichment exposure starts and often ends with Centrus — the math of public-market exposure forces it.

What this means for stock-picking

If you’re a thematic investor with a US-listed mandate, the McKinsey frame collapses to a few hard observations.

1. HALEU enrichment is where bottleneck severity, federal funding, and listed exposure all converge. This is where the work has to be most rigorous, because the names are crowded and the cone of outcomes is wide.

2. Conversion is structurally critical but offers no clean public expression. A future Solstice / Honeywell Advanced Materials spinoff is the most-watched corporate-action catalyst in the cycle.

3. Mining is investable but it is a uranium-price trade with a structural overlay, not the other way around. The order of those words is the difference between a 30% drawdown and a five-bagger.

4. The picks-and-shovels lane — waste handling, dosimetry, decommissioning instrumentation — is its own structural thesis, and there is exactly one filter-compliant mid-cap in it. I’ll come back to that in Post 4.

5. The advanced reactor adjacency (NuScale, Oklo, Nano Nuclear, BWXT, GE Vernova) is the demand engine for the entire chain. But FOAK economics are still unproven and the narrative is loud. Post 5.

The single most important question I’m asking through the rest of this series isn’t “which of these names is great.” It’s “which of these names is great at a price I should actually pay.” Most of them aren’t, today.

What’s coming

Post 2 — HALEU enrichment. Centrus Energy as the McKinsey anchor name. ASP Isotopes’ Quantum Leap Energy subsidiary as the optionality slot. Why I think one of these is fundamentally cheap right now and the other one isn’t.
Post 3 — Conversion. The bottleneck nobody can play directly, and the Solstice spin that might fix that.
Post 4 — Picks-and-shovels. One mid-cap that passes the screen in two of seven segments simultaneously.
Post 5 — SMR demand. Why I think the post-CFPP-cancellation reset on NuScale is more advanced than the market recognizes — and why that doesn’t mean I’m long.
Post 6 — The book. Five-name long book, position-sized, with explicit falsification triggers for each.

Subscribe if you want this in your inbox over the next few weeks.

Further reading

[1] McKinsey & Co. — Understanding domestic nuclear fuel production options in the United States

[2] Jimmy Carter’s Executive Order

[3] Kazatomprom - Uranium market

[4] DOE — Awards $2.7 billion to restore American uranium enrichment

[5] ANS Nuclear Newswire — DOE awards $2.7B for HALEU and LEU enrichment

World Nuclear News — US enrichment funding recipients flesh out plans

Prohibiting Russian Uranium Imports Act P.L. 118-62

*Capacity Factor is a six-part series on US nuclear fuel-cycle equities. Next post: HALEU enrichment.*

The $500 Billion Umbrella

Julien Simon — Wed, 06 May 2026 15:39:44 GMT

In January 2025, Sam Altman stood in the White House beside Donald Trump, Masayoshi Son, and Larry Ellison to announce the largest AI infrastructure project in history. Stargate: $500 billion, four years, a network of gigawatt-scale data centers across the United States and eventually the world. Fifteen months later, the project is collapsing from the periphery inward — and the center isn’t holding either.

The scorecard. In March, OpenAI and Oracle scrapped plans to expand the flagship Stargate campus in Abilene, Texas, from 1.2 gigawatts to 2 gigawatts after financing negotiations broke down. [1] Crusoe, the site developer, had already been struggling with reliability problems — a winter storm took liquid-cooling infrastructure offline for days. [2]

Microsoft swept in to rent the abandoned 900 megawatt expansion site from Crusoe. [3] Stargate was supposed to free OpenAI from Microsoft’s cloud. Now, Microsoft is occupying the data center that OpenAI couldn’t fill. Oracle, Stargate’s infrastructure partner, is the landlord to Microsoft at the site OpenAI abandoned. The access moat built a building. Someone else moved in.

On April 9, OpenAI paused Stargate UK entirely, citing energy costs and the regulatory environment — and, per Bloomberg, reining in spending ahead of a planned IPO. [4] The Nscale partnership announced in September 2025 — 8,000 Nvidia processors at Cobalt Park, Tyneside, first quarter 2026 — passed its own deadline without breaking ground. [5] In Abu Dhabi, Iran’s Islamic Revolutionary Guard Corps has threatened to destroy the $30 billion Stargate UAE facility, releasing satellite imagery of the site. [6]

Three sites. Three different failure modes. Financing (Abilene). Energy costs and regulation (UK). Missile threats (UAE). The original Abilene campus is operational — multiple buildings running Nvidia GPUs for OpenAI. But that campus predated the Stargate announcement. The new infrastructure — the expansion, the international sites, the multi-gigawatt network — is what the $500 billion was supposed to buy. None of it has materialized.

Stargate is not an outlier. Thirty to fifty percent of all US data center builds planned for 2026 face delays or cancellation — roughly half the industry’s pipeline. [7] Of the 16 gigawatts of planned capacity, only 5 are under construction. By 2027, it gets worse: 6.3 gigawatts under construction against 21.5 announced. [8] The bottleneck is not money — it is transformers, switchgear, and batteries that nobody can source fast enough. Stargate is just the project with its name on the White House lawn.

The pivot. While the sites were stalling, OpenAI abandoned its plan to build and own data centers altogether. In mid-March, The Information reported that OpenAI is now renting server capacity from cloud providers instead of building its own facilities. [9] The company restructured its entire compute team in response to this shift. [10] Total projected spending dropped from $1.4 trillion through 2033 to $600 billion through 2030. [11] OpenAI signed a $100 billion expansion of its AWS agreement — making Amazon, not Oracle or SoftBank, the de facto third-party infrastructure backbone. [12]

On April 29, the Financial Times reported that OpenAI has “in practice abandoned the joint venture.” [13] One person involved with Stargate said the company had “sidelined first-party data centers.” An insider close to SoftBank put it more bluntly: “People can basically define what ‘Stargate’ is for themselves. To some extent, any compute project involving SoftBank or Oracle can be called ‘Stargate.’” [13] OpenAI itself now calls it “an umbrella for our compute strategy.” In Norway, another Stargate-branded site fell through; OpenAI couldn’t close an offtake deal with Nscale at the Narvik facility, and Microsoft stepped in to lease the capacity instead. [13] Partners are “feeling let down and misled.” One source told the FT they prefer Microsoft as a tenant because “they are more creditworthy.” [13]

On April 11, three of Stargate’s original infrastructure leads — including Peter Hoeschele, who ran the early datacenter effort — left OpenAI for Meta. [14] The people who built the project are leaving. The day the FT story ran, OpenAI published a blog post claiming it had “surpassed” its 10 gigawatt target, with “more than 3 GW added in the last 90 days alone.” [15] The language was careful: “The financing models and partnership structures may evolve, but what matters is capacity coming online at scale.” This is the tell. Three gigawatts of leased capacity from AWS and Oracle is not three gigawatts of Stargate infrastructure. When you rent a hotel room, you don’t get to claim you built a hotel. The pivot may produce better economics for OpenAI — controlling chip decisions while renting the buildings is a defensible strategy. The question is whether the $500 billion investment thesis survives the change.

What the financing reveals. SoftBank, Stargate’s financial partner, took out a $40 billion unsecured bridge loan on March 27 with a twelve-month maturity. [16] The loan’s primary purpose: funding a $30 billion follow-on investment in OpenAI, bringing SoftBank’s total equity exposure to approximately $64.6 billion in a single pre-IPO company. [17] The loan matures in March 2027, before most Stargate sites will produce a kilowatt. SoftBank is financing the equity bet, not the infrastructure. Oracle, the designated builder, carries over $100 billion in debt on $30 billion in equity, with CDS spreads at their highest since 2009 and its own bondholders suing over undisclosed financing needs. [18] Beyond the original Abilene campus, nobody is financing Stargate construction.

OpenAI has also signed chip deals totaling nearly 27 gigawatts — with Nvidia, AMD, Broadcom, and Cerebras. [19] Stargate’s total planned capacity, as of September 2025, is approximately 7 gigawatts. [20] The chip commitments exceed the infrastructure capacity by roughly 4-to-1. Either the chips go into other people’s data centers — which is what “renting from AWS” means — or the commitments are aspirational on both sides.

The sovereign compute casualty. The UK pause is not just about energy costs. OpenAI for Countries — the program extending Stargate to the UK, Australia, Greece, the UAE, Slovakia, Kazakhstan, and others — was a sovereignty product. [21] The pitch: run frontier models locally within your jurisdiction on dedicated infrastructure. That requires physical infrastructure that OpenAI controls. If OpenAI can’t build it in the UK — stable grid, rule of law, English-speaking talent, George Osborne on the payroll — it can’t build it in Kazakhstan or Greece either.

Stargate was the largest AI infrastructure announcement ever made. Fifteen months later, the company that announced it calls it “an umbrella.” No international site has broken ground. The builder is being sued by its bondholders. The financier is providing equity financing through a 12-month loan. The people who ran the project are leaving for Meta. Partners who signed up to build data centers are watching Microsoft take the leases. The $500 billion bought a valuation, not a data center.

What happens next? Two scenarios.

First, Oracle’s balance sheet forces a reckoning. Over $100 billion in debt, negative free cash flow, and a quarter-trillion dollars in off-balance-sheet lease commitments are grounds for a credit downgrade. [18] Oracle can no longer finance the buildout at investment-grade rates. The 4.5 gigawatt agreement with OpenAI shrinks or restructures. SoftBank’s bridge loan matures in March 2027 without the infrastructure to justify a rollover. The Stargate venture is formally wound down or absorbed into existing bilateral cloud contracts.

Second, the sovereign compute product dies. OpenAI for Countries promised governments dedicated infrastructure inside their borders. If OpenAI is renting, not building, the infrastructure is Amazon’s or Microsoft’s — subject to US jurisdiction, not sovereign control. Governments that signed memoranda of understanding on the promise of sovereign AI discover they bought ChatGPT Edu licenses and a press photo with Sam Altman. The dependency on US cloud infrastructure that the sovereign product was supposed to escape remains intact.

For any AI infrastructure deal that follows — Stargate or otherwise — the test is simple: a site under construction, a power purchase agreement in force, and a builder whose balance sheet can finish the job. Anything less is a press release.

Notes

[1] Brody Ford, Edward Ludlow, and Dina Bass, “Oracle and OpenAI End Plans to Expand Flagship Data Center,” Bloomberg, March 6, 2026.

[2] “OpenAI’s massive Stargate data center canceled as firm can’t reach terms with Oracle,” Tom’s Hardware, March 8, 2026. Crusoe liquid-cooling disruption during winter weather is cited in the piece.

[3] Dina Bass and Brody Ford, “Microsoft Rents Data Center Project Developed for Oracle, OpenAI,” Bloomberg, March 27, 2026. Crusoe confirmed approximately 900 MW capacity, with the first building expected in mid-2027. Earlier Bloomberg reporting (March 24) cited approximately 700 MW; the difference likely reflects site capacity vs. initial IT load.

[4] “OpenAI Pauses Stargate UK Data Center Citing Energy Costs,” Bloomberg, April 9, 2026. Bloomberg reports OpenAI is “reining in ambitious spending plans ahead of a highly anticipated public listing.” OpenAI statement: “We continue to explore Stargate UK and will move forward when the right conditions, such as regulation and the cost of energy, enable long-term infrastructure investment.” See also CNBC, April 9, 2026.

[5] “OpenAI’s flagship UK data project delayed in setback for Starmer,” The Telegraph, April 4, 2026. The original September 2025 announcement specified ~8,000 Nvidia processors at Cobalt Park, with a Q1 2026 target.

[6] “Iran threatens ‘complete and utter annihilation’ of OpenAI’s $30B Stargate AI data center in Abu Dhabi,” Tom’s Hardware, April 5, 2026. IRGC Brigadier General Ebrahim Zolfaghari’s statements; satellite imagery of the site included in the IRGC video.

[9] The Information, reporting on OpenAI’s shift from building to renting data center capacity, mid-March 2026. Cited by Data Center Dynamics, The Deep Dive, CNBC, and others.

[10] “OpenAI reorganizes leadership amid data center strategy readjustment,” Data Center Dynamics, March 18, 2026. Sachin Katti appointed to oversee Stargate groups; the compute team split into three divisions.

[11] “OpenAI’s data center pivot underscores Wall Street spending concerns ahead of IPO,” CNBC, March 22, 2026. Total projected compute spending reduced from $1.4 trillion (through 2033) to $600 billion (through 2030).

[12] OpenAI expanded its existing AWS agreement by $100 billion over eight years; AWS was designated the exclusive third-party cloud distribution provider for OpenAI’s enterprise platform. CNBC, February 27, 2026.

[16] SoftBank $40 billion unsecured bridge financing facility, March 27, 2026. Twelve-month maturity (March 25, 2027). Syndicated by JPMorgan Chase, Goldman Sachs, Mizuho, SMBC, and MUFG. Interest rate not publicly disclosed as of publication. Source.

[17] SoftBank’s cumulative OpenAI equity exposure: $19 billion initial Stargate equity + $30 billion follow-on = $49 billion confirmed. Additional Vision Fund 2 positions bring the estimated total to approximately $64.6 billion (~13% ownership). OpenAI’s funding round closed at $122 billion in March 2026 at an $852 billion post-money valuation (initial $110B close in February expanded to $122B by final close). Author compilation from S&P Global, CNBC, CNBC April 15, and SoftBank disclosures.

[19] Nvidia: 10 GW LOI, September 2025. AMD: 6 GW definitive agreement, October 2025. Broadcom: 10 GW custom silicon term sheet, October 2025. Cerebras: $10 billion / 750 MW inference deal, January 2026. Sources: respective company announcements and Tom’s Hardware compilation, February 24, 2026.

[20] OpenAI, “Building the compute infrastructure for the Intelligence Age,” April 29, 2026, confirms the original 10 GW commitment: “When we announced Stargate in January 2025, we committed to securing 10GW of AI infrastructure in the United States by 2029.” September 23, 2025, expansion announcement brought the total to “nearly 7 gigawatts” of Stargate-branded planned capacity.

[21] OpenAI for Countries program: UK, Australia, Greece, UAE, Slovakia, Kazakhstan, and others. OpenAI, September 2025. See also “OpenAI pauses its Stargate UK data center plan,” Engadget, April 9, 2026.

[18] Oracle Corporation Form 10-Q, period ended November 30, 2025 (SEC filing). Total debt: $108.1 billion ($8.1B current + $100.0B non-current). Total stockholders’ equity: $30.5 billion. Off-balance-sheet lease commitments of $248 billion are disclosed in notes to financial statements. Bondholder lawsuit: Ohio Carpenters’ Pension Plan v. Oracle, filed January 14, 2026, NYSC. Bloomberg, January 15, 2026.

[7] Sightline Climate, 2026 Data Center Outlook. Of ~16 GW of US data center capacity planned for 2026 across 140 projects, only ~5 GW is under active construction. 25% of projects have not disclosed their powering strategy. See also Bloomberg, April 1, 2026.

[8] 2027 pipeline: 6.3 GW under construction vs. 21.5 GW announced. Beyond 2028, 37 GW of planned capacity has not broken ground, and only 4.5 GW of that has begun work. Futurism, April 2026; ZeroHedge analysis citing Sightline Climate and Canaccord.

[14] Peter Hoeschele, Shamez Hemani, and Anuj Saharan left OpenAI and are joining Meta. Hoeschele led the early Stargate datacenter effort; Hemani worked on computing strategy; Saharan led within the computing organization. “Former OpenAI Stargate Leaders Plan to Join Meta Platforms,” Bloomberg, April 11, 2026.

[13] Financial Times, reported April 29, 2026. OpenAI has “in practice abandoned the joint venture.” One person involved with Stargate said the company had “sidelined first-party data centers.” OpenAI described Stargate as “an umbrella for our compute strategy.” A person close to SoftBank: “People can basically define what ‘Stargate’ is for themselves. To some extent, any compute project involving SoftBank or Oracle can be called ‘Stargate.’ Norway Stargate site abandoned; Microsoft leased the Narvik facility from Nscale. Partners “feeling let down and misled.” Source preference for Microsoft as tenant: “They are more creditworthy.” Cited via Tom’s Hardware, April 30, 2026. “Define for themselves” quote via BigGo Finance, May 1, 2026, citing FT sources. See also CNBC, April 15, 2026, for details on Norway.

[15] OpenAI, “Building the compute infrastructure for the Intelligence Age,” April 29, 2026. Claims to have “surpassed” 10 GW target with “more than 3GW added in the last 90 days alone.” Note: “capacity” in OpenAI’s usage includes leased capacity from third-party providers (AWS, Oracle, Microsoft), not only self-built infrastructure. The blog’s language — “the financing models and partnership structures may evolve” — is an implicit acknowledgment of the FT reporting published the same day.