OpenAI restructures under nonprofit control – commits $1.4T for 30+ GW compute

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

OpenAI just rewired its corporate guts on a livestream: a new nonprofit Foundation now controls OpenAI Group PBC with roughly 26% equity, and an IPO is “most likely” down the road. The governance reset arrives alongside an audacious scale plan — more than 30 GW of new compute and about $1.4T in obligations — meant to power an automation roadmap targeting an AI research “intern” by Sep 2026 and a credible “researcher” by Mar 2028. That’s the clearest statement yet that OpenAI expects deep learning to keep compounding without exotic detours.

The revised Microsoft deal tightens the bond: Redmond holds ~27%, Azure/API exclusivity remains until an independent panel verifies AGI, and OpenAI adds roughly $250B in extra Azure spend; Microsoft keeps product/model IP rights through 2032 under safety guardrails. OpenAI wants to industrialize data‑center builds to 1 GW per week at about $20B/GW, and it’s shifting distribution from ChatGPT to an “AI cloud” where third‑party builders create more value than the platform itself — with Atlas for Windows promised “in some number of months.” Management also claims the unit cost of intelligence is falling about 40× per year, pushing GPT‑3‑scale runs onto phones and driving a near‑term model leap within six months.

If those timelines hold, the real bottlenecks become electricity, concrete, and the AGI verification gate baked into the Microsoft pact.

Feature Spotlight

Feature: OpenAI’s new structure, AGI research timeline, and compute plan

OpenAI reset its governance and strategy: nonprofit control, Microsoft at 27% with exclusivity until AGI verification, intern‑level AI research by Sep 2026, full researcher by Mar 2028, and ~$1.4T/30+GW compute in flight.

Cross‑account highlight today: OpenAI’s livestream detailed a new nonprofit‑controlled structure, a revised Microsoft deal, a roadmap to automated AI research, and massive compute build‑out. Excludes all other OpenAI items from the rest of the report.

Jump to Feature: OpenAI’s new structure, AGI research timeline, and compute plan topics

🌐 Feature: OpenAI’s new structure, AGI research timeline, and compute plan

Compute scale-up: 30+ GW new build and ~$1.4T obligations; 1 GW/week factory target at ~$20B/GW

OpenAI disclosed commitments for 30+ GW of new compute capacity and about $1.4 trillion in obligations over “the next many years,” and an aspiration to industrialize data center builds at ~1 GW per week over a five‑year lifecycle at roughly $20B/GW, with robotics to accelerate construction infrastructure stack, infrastructure slides, gw and spend slide. This underscores electricity as a binding constraint and complements the prior public call for a 100 GW/year U.S. expansion, following up on power memo.

OpenAI restructures under nonprofit control – commits $1.4T for 30+ GW compute

Executive Summary

Feature: OpenAI’s new structure, AGI research timeline, and compute plan

Table of Contents

🌐 Feature: OpenAI’s new structure, AGI research timeline, and compute plan

Compute scale-up: 30+ GW new build and ~$1.4T obligations; 1 GW/week factory target at ~$20B/GW

Microsoft–OpenAI definitive deal: 27% stake, Azure/API exclusivity until AGI panel verifies AGI; IP rights through 2032

OpenAI targets automated AI research: intern by Sep 2026, full researcher by Mar 2028

OpenAI Foundation now controls OpenAI Group PBC; nonprofit holds ~26% equity as LLC converts to PBC

Safety stack detailed: value/goal alignment, reliability, robustness, systemic safety; CoT faithfulness under study

OpenAI tees up an “AI cloud” where builders create more value than the platform; Atlas for Windows planned

Unit cost of intelligence down ~40×/year; GPT‑3 scale runs on a phone, GPT‑4 costs falling fast

Personal AGI device teased for everyday use across work and life

🛠️ Agent platforms in the IDE and cloud

GitHub unveils Agent HQ with third‑party agents; Codex lands in VS Code Insiders

Cloudflare shows how to host Claude Agent SDK in Sandboxes with bash tool enabled

Factory 1.9 ships mixed‑model sessions, custom subagents, and a GitHub App for inline PR reviews

LangChain DeepAgents 0.2 adds pluggable backends for agent filesystems and long‑run memory

OpenRouter adds resettable API key limits and usage analytics for multi‑agent fleets

⚡ Serving tricks: faster switches and sturdier toolcalling

vLLM Sleep Mode enables zero‑reload model switches with 18–200× faster swaps and 61–88% quicker first token

vLLM and Kimi K2 fix tool‑calling drift; now >99.9% success and 76% schema accuracy, with ‘Enforcer’ incoming

vLLM flags Sleep Mode as a lever to cut GPU costs for model marketplaces; Aegaeon already runs on vLLM

vLLM adds Anthropic API compatibility to ease Claude‑based app migration

🧪 New multimodal models land across providers

Nemotron Nano 12B v2 VL lands on OpenRouter with free logged tier and multiple no‑logging providers

Replicate hosts Nemotron Nano 12B v2 VL for document/video intelligence in 10 languages

Baseten rolls out Nemotron Nano 2 VL with finance‑grade agent patterns and day‑zero support

Hyperbolic adds latest NVIDIA Nemotron models, expanding VL deployment options

🤖 Humanoids get real: 1X NEO preorders and G1 muscle

1X opens NEO preorders at $20k or $499/mo; 2026 U.S. deliveries and detailed spec sheet

Unitree G1 pulls a 1,400 kg car; physics and posture make the stunt plausible

🏭 AI compute ramp: DOE supercomputer, DPUs, and multi‑site training

DOE and NVIDIA to build Solstice supercomputer with 100k Blackwells for open science

NVIDIA forecasts 6M Grace Blackwells in first five quarters, ~$500B ramp through 2026

EpochAI says 10 GW multi‑site training across 23 U.S. locations is feasible with fat pipes

NVIDIA’s BlueField‑4 DPU pairs 64‑core Grace with 800 GbE to offload IO for AI data centers

Qualcomm enters data‑center inference with AI200/AI250 accelerators; shares jump ~11%

💼 Enterprise adoption and market moves

Wharton: 75% of firms already see AI ROI; leaders using AI daily hits 46%

Chegg to cut ~45% of staff, citing AI disruption and search traffic decline

Google Labs launches Pomelli, an AI marketing agent live in US/CA/AU/NZ

OpenAI posts AI Deployment Manager role in India, signaling local expansion

Baseten adds NVIDIA Nemotron Nano 2 VL to power finance agents and extraction

Delve markets agentic RAG to auto‑complete 200‑page security questionnaires

Fitbit rolls out Gemini‑powered personal health coach to eligible U.S. Android users

Gemini rolls into Google Home voice assistant in the U.S., boosting distribution

Groq to power HUMAIN One real‑time AI OS for enterprise assistants

Netflix to share how it scales AI agents to 3,000+ developers in Anthropic webinar

🛡️ Risk reporting and legal pressure

Judge lets authors’ copyright claims against OpenAI proceed; fair use unresolved

Anthropic publishes pilot sabotage risk report; METR reviewed an unredacted version

OpenAI reports 1M weekly suicide‑related chats; GPT‑5 lifts desirable responses to 91%

🎬 Creative AI: video, design, and assistants

Adobe MAX: Express AI Assistant, Firefly 5 (4MP), and Project Graph previews

Google launches Pomelli marketing agent on Labs in US/CA/AU/NZ

Grok Imagine preps ‘Extend video’ and a video/image generation selector on web

Hailuo 2.3 jumps to #5 on Video Arena’s Image-to-Video board

CapCut’s AI Design drives prompt‑to‑poster workflows for campaigns and social

Higgsfield Instadump turns 1 selfie into 15 pro shots with preset packs

Fully AI‑generated sitcom clip made with LTX‑2 circulates as a quality showcase

🧭 Agentic parsing and compliance RAG

Delve unveils agentic compliance RAG that finishes 200‑page security questionnaires in minutes

LlamaParse adds agentic chart parsing to convert complex graphs into accurate tables

📊 Evals and live competitions

Kimi K2 Vendor Verifier adds case‑by‑case tool‑call metrics; vLLM shows 99.9% success, 76% schema accuracy

ARC Prize 2025 nears close: 1.3K teams, 13.9K submissions with 6 days left

MiniMax M2 posts strong evals at 8% of Claude’s price and 2× speed, ranks #5 on Artificial Analysis

Six frontier LLMs face off in a three‑day Texas Hold’em tournament, no prompts allowed

BadScientist: AI‑written fake papers hit up to 82% acceptance by LLM reviewers

Hailuo 2.3 jumps to #5 on Video Arena’s Image‑to‑Video leaderboard

Study: Sycophantic AI flatters 50% more than humans, reducing conflict repair intentions

🗣️ Voice everywhere: Home, Windows, and wearables

Google rolls Gemini into the Home voice assistant for U.S. users

Fitbit rolls out Gemini‑powered personal health coach with multi‑agent architecture

Typeless voice control lands on Windows, bringing speech‑first workflows to PCs

Microsoft adds podcast feature to Copilot, advancing voice‑first content in assistants

On this page