# AI is not a *software business* anymore.

[Notes](https://www.orfloat.com/notes) · Briefing · 24 May 2026

Microsoft will spend roughly **$190 billion** on capital expenditure in calendar 2026 — and still expects to be capacity-constrained through year-end. The four biggest hyperscalers will spend close to **$700 billion** combined this year, roughly 3.5× what they spent two years ago. None of that looks like a classic software business. Six months from now, neither will your AI vendor contract.

The cloud was supposed to make infrastructure someone else's problem. Write code once, run it many times, scale on demand. The whole abstraction depended on supply staying ahead of demand by a comfortable margin. AI broke that abstraction — not gradually, and not theoretically. The Q3 FY26 results from Microsoft, published in late April, put the new shape of the industry on the public record.

## The numbers tell you what changed

Microsoft reported [$31.9 billion in capital expenditure](https://www.sec.gov/Archives/edgar/data/0000789019/000119312526191507/msft-20260331.htm) for the fiscal third quarter ending 31 March 2026, and guided to **more than $40 billion** for the following quarter. About two-thirds of the quarterly spend went to short-lived assets — primarily GPUs and CPUs that will be fully depreciated inside four years. For calendar 2026, the company expects total capex to land around [$190 billion](https://www.cnbc.com/2026/04/29/microsoft-msft-q3-earnings-report-2026.html) — and the CFO was explicit that this will not be enough. Azure demand still exceeds supply.

Microsoft is not alone. Alphabet, Amazon, and Meta are running the same play. Combined hyperscaler 2026 capex is now [projected to approach $700 billion](https://fortune.com/2026/04/30/big-tech-hyperscalers-will-spend-700-billion-on-ai-infrastructure-this-year-with-no-clear-end-in-sight-eye-on-ai/) — up from roughly $200 billion in 2024 and 6× the 2022 level. Meta raised its full-year guidance to $125–145 billion and explicitly blamed component and data-center costs. Amazon is now expected to post a [negative free cash flow year](https://www.cnbc.com/2026/02/06/google-microsoft-meta-amazon-ai-cash.html) for the first time in over a decade.

Mary Meeker, in her [340-page Bond Capital report](https://www.bondcap.com/reports/tai) published last May, used the word "unprecedented" on 51 separate pages to describe the rate of change. At NVIDIA's GTC 2026 keynote, Jensen Huang stopped calling Nvidia a chip company at all. He calls it an [AI factory company](https://www.datacenterfrontier.com/machine-learning/news/55364406/jensen-huang-maps-the-ai-factory-era-at-nvidia-gtc-2026) now. "Tokens are the new commodity," he said twice on the same slide. He was not being poetic.

## Tokens are not magic. They are manufactured.

The most important shift to understand is that every answer from a model is the output of a physical production system. GPUs, high-bandwidth memory, advanced packaging, substrates, optics, power, cooling, land, networking, and operations talent — that is the bill of materials behind the paragraph of text on your screen, the line of code Claude wrote you, or the agent that just finished summarising a contract. The user sees software. Behind the software is a factory turning electricity and silicon into intelligence.

> Six months ago an AI vendor agreement was structured like a software agreement. Today it is a supply contract in everything but name.

And that supply contract has terms most procurement teams have never written before: allocation, fallback, reserved capacity, multi-region failover. None of it was a line item a year ago. All of it should be in the next one you sign.

## What this changes for an operator in Muscat

At first glance, hyperscaler capex looks like a Silicon Valley story. It is not. The Gulf has been positioning itself as a sovereign AI-infrastructure region for a year and a half, and the placements are now public. Saudi Arabia's [HUMAIN](https://www.globaldatacenterhub.com/p/does-humains-12b-saudi-framework) secured a $1.2 billion AI-infrastructure framework in January. The UAE's G42, partnered with Oracle, NVIDIA, Cisco, and SoftBank under the [Stargate UAE](https://introl.com/blog/middle-east-ai-revolution-uae-saudi-arabia-100b-infrastructure-plans) banner, is building a 5 GW AI campus in Abu Dhabi with a 1 GW cluster going live in 2026. Oman's own *AI and Digital Future Programme* names data-centre capacity as a delivery mechanism for the digital economy.

For an operating business in Muscat, this means two things at once. The first is that *closer* inference is finally becoming a procurement option — your token round-trip will be in-region within the next eighteen months, with the residency and latency profile that regulated industries need. The second is that the same capacity rationing playing out in Redmond and Mountain View will play out in Riyadh and Abu Dhabi too, just on a slight lag. Whoever signs first sits at the front of the queue.

## The unit that actually matters is the token

Almost every AI plan we read inside client engagements is still priced in seats. Five Claude Pro seats. Ten Copilot seats. A team licence. Seats are the wrong unit for a factory output. The right unit is the token — input tokens, output tokens, cached tokens, and the routing decisions between them.

We forecast client workloads in tokens. We treat an agent that runs unattended for an hour as a different budget line than a chatbot that answers one question. We instrument every workflow to know — by name — which model was called, with what prompt, returning how many tokens, at what cost. None of that is exotic. It is just bookkeeping that nobody has bothered to install yet, because the abstraction of "seats" let everyone pretend the supply was infinite. It isn't.

## What we do about it

Inside a Discovery Phase, the supply-chain reality of AI shows up in three concrete deliverables.

- **A vendor audit.** We map the supply chain under your current AI contracts — which cloud, which region, which underlying chip family, what allocation terms exist (or don't), and what your fallback looks like when the next capacity squeeze hits.
- **A token-denominated forecast.** We size demand in tokens, not seats — separating chatbot work from agentic workloads and pricing each at the model class that actually fits.
- **A routing diagnostic.** Most operations burn premium-tier inference (Opus-class) on work that Haiku-class would do for a tenth of the cost. We find those, fix them, and the savings usually pay for the engagement before deployment.

Microsoft has already put $190 billion behind a view of the world. Most operating businesses in the Gulf have not put even the equivalent procurement framework into their plans. That is the gap. It will close — for the businesses that decide to close it. [Start a Discovery Phase.](https://www.orfloat.com/contact)

— Akram Ahmed, CTO · Orfloat · Muscat

## References

1. Microsoft Corp. *Form 10-Q, Fiscal Q3 2026 (quarter ended 31 March 2026).* [sec.gov/Archives/edgar/data/0000789019/…/msft-20260331](https://www.sec.gov/Archives/edgar/data/0000789019/000119312526191507/msft-20260331.htm)
2. CNBC. *Microsoft calls for $190 billion in 2026 capital spending on soaring memory prices.* 29 April 2026. [cnbc.com](https://www.cnbc.com/2026/04/29/microsoft-msft-q3-earnings-report-2026.html)
3. Fortune. *Big Tech is about to spend $700 billion on AI this year.* 30 April 2026. [fortune.com](https://fortune.com/2026/04/30/big-tech-hyperscalers-will-spend-700-billion-on-ai-infrastructure-this-year-with-no-clear-end-in-sight-eye-on-ai/)
4. Bond Capital / Mary Meeker. *Trends — Artificial Intelligence.* May 2025. [bondcap.com/reports/tai](https://www.bondcap.com/reports/tai)
5. Data Center Frontier. *Jensen Huang Maps the AI Factory Era at NVIDIA GTC 2026.* March 2026. [datacenterfrontier.com](https://www.datacenterfrontier.com/machine-learning/news/55364406/jensen-huang-maps-the-ai-factory-era-at-nvidia-gtc-2026)
6. Global Data Center Hub. *Does HUMAIN's $1.2B Saudi Framework Signal a New Model for AI Data Centers?* January 2026. [globaldatacenterhub.com](https://www.globaldatacenterhub.com/p/does-humains-12b-saudi-framework)
7. Introl. *Middle East AI Revolution: UAE and Saudi Arabia's $100B+ Infrastructure Plans.* [introl.com](https://introl.com/blog/middle-east-ai-revolution-uae-saudi-arabia-100b-infrastructure-plans)

[← Back to Notes](https://www.orfloat.com/notes)
