Neocloud: The Noun, the Workload, and the Balance Sheet
A single-topic deep dive into "neocloud" — the GPU-as-a-Service business. The word fuses three things worth separating: a genuine engineering cost edge, a depreciating asset financed with debt, and a marketing label. We argue the durable move is to ignore the noun, date the workload's half-life, watch the balance sheet, and watch for the next silicon swing — today GPU, tomorrow TPU and inference ASICs.
In this episode
- The noun vs. the workload. Allbirds becoming "NewBird AI" — a shoe company that six-x'd its stock on a GPU pivot funded by a $50M facility that had delivered just $3.25M — as the comic tell that the label has decoupled from the business.
- Why the cost edge is real. Bare-metal servers, stripped bills of materials, and 3.2 Tbps InfiniBand fabric explain the 2x–7x price gap over hyperscalers — structural, not magic. Even Microsoft rents from neoclouds.
- ...and why it compresses. Post-depreciation bare-metal margins of ~14–16%, advertised-not-transacted pricing, the 66% utilization breakeven, and the migration of durable value toward reliability, orchestration, and software.
- The financing layer. CoreWeave's ~$30B debt and Microsoft concentration, $20B+ in GPU-collateralized lending, the Nvidia–CoreWeave–Nebius circular-financing critique, collapsing H100 resale values, and the Michael Burry depreciation fight — steelmanned on both sides.
- The power floor. Interconnection queues as the binding constraint (2.3 TW backlog; Texas at 1.8% operational), why a PPA isn't physical delivery, and the Bitcoin-miner pivot (IREN–Microsoft, CoreWeave buying Core Scientific).
- The silicon swing. Inference overtaking training, TPUs/Trainium/Maia, Broadcom's design concentration, Anthropic's three-platform reality behind the "de-Nvidification" headline, and neocloud as one rung on cloud's 15-year re-bundling ladder.
- The architect's durable questions. A buyer's checklist for evaluating any "new cloud," including financial due diligence and asking when firm interconnection capacity physically exists.
Sources & References
Primary / originating sources (operator-provided — ground zero)
outputs/special_editions/_anchors/neocloud-briefing.md(operator-provided local document)- https://semianalysis.com/2024/10/03/ai-neocloud-playbook-and-anatomy/
- https://www.cnbc.com/2025/11/21/nvidia-gpus-google-tpus-aws-trainium-comparing-the-top-ai-chips.html
- https://www.edgeir.com/allbirds-dumps-footwear-and-bets-50m-on-gpu-cloud-pivot-as-newbird-ai-20260430
Research & critique (financing, economics, depreciation)
- CNBC — AI GPU depreciation, CoreWeave, Nvidia, and Michael Burry (Nov 14, 2025)
- Barron's — Nvidia, AI chip depreciation, and the accounting debate
- CoreWeave — SEC 10-Q filing (June 30, 2025)
- IO Fund — Nvidia, CoreWeave, Nebius circular financing
- Financial Times — GPU-collateralized debt and the neocloud financing risk
- Dave Friedman — Neoclouds hold more than $20 billion in GPU-collateralized debt
- Silicon Data — H100 GPU market value trends
- Epoch AI — Trends in machine learning (inference/training efficiency)
- McKinsey — The evolution of neoclouds and their next moves
Industry & implementation (pricing, market structure, deals)
- Forbes — Inside the neocloud economy: what's next for GPU-as-a-Service (Nov 6, 2025)
- Uptime Institute — Neoclouds: a cost-effective AI infrastructure alternative?
- Spheron Network — GPU cloud pricing comparison 2026
- Introl — AWS H200 GPU price increase, January 2026
- ABI Research — Neocloud market trends
- Reuters — Anthropic commits up to $200B in capacity on Google Cloud (May 5, 2026)
- Anthropic — Google and Broadcom TPU partnership (Apr 6, 2026)
- Deloitte — TMT Predictions 2026: compute power and AI
Power & grid (the binding constraint)
- World Economic Forum — Electricity and grid connectivity: the strategic bottleneck for AI (May 2026)
- Institute for Progress — Interconnection for AI
- Rocky Mountain Institute — Interconnection reform, AI data centers, and generator queues
- Harvard Belfer Center — AI data centers and the U.S. electric grid
- Hanwha Data Centers — Grid limitations: the power bottleneck (why a PPA isn't delivery)
- Blockspace Media — Bitcoin mining stocks and the AI pivot (IREN–Microsoft)
Geopolitics & sovereign demand
- CSIS — DeepSeek, Huawei, export controls, and the U.S.–China AI race
- SemiAnalysis — 2025 AI Diffusion export controls and "regulatory capture"
- UC Berkeley Goldman School — An evolving AI supply chain
Note on figures: several pricing and balance-sheet numbers cited above originate in secondary analysis and vendor blogs; advertised GPU rental rates are offered, not transacted, prices; and the "$200B" Anthropic–Google aggregate and the IREN/Core Scientific deal terms trace to secondary reporting pending primary confirmation. We've flagged these in the episode where they carry weight.
Have questions about this episode? Reach out.