ChatGPT Agent Mode opens to 3 paid tiers – 4.5× faster on Sudoku

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

OpenAI just flipped on Agent Mode (Atlas) inside ChatGPT for Plus, Pro, and Business, turning the product from chat window into click-and-act assistant. It matters because Atlas works directly in the browser—researching, planning, and executing steps—without the glue code most agents demand. Early tests are mixed: one study finds it solves medium Sudoku about 4.5× faster than a human baseline, but it stumbles on reflex-timing games like Chrome’s T‑Rex Runner and Flappy Bird. Windows support is missing in this preview, and the rollout follows a brief pause on Atlas extensions for security.

Hands-on users say the basics—navigating, reading, simple clicks—feel solid, but Atlas often stalls when composing or formatting inside DOM‑heavy web apps. The new “thinking” view doesn’t help much either; auto‑scroll keeps yanking you to the bottom, making the reasoning trace hard to audit mid-run. Power users comparing it to Perplexity’s Comet argue there’s “no reason to switch” yet unless Atlas proves better at real tasks, especially content creation and edit flows.

If you’re eyeing desktop agents, note the parallel track: OpenAI’s Codex CLI added an experimental Windows sandbox this week, hinting at tighter guardrails coming to agent operations even as Atlas’s own Windows build sits out this preview.

Feature Spotlight

Feature: ChatGPT Agent Mode goes hands‑on

ChatGPT Agent Mode (Atlas) enters preview for Plus/Pro/Business, enabling agents to research, plan and act in‑browser—early evals show strengths in logic tasks but gaps in real‑time control; broad user feedback begins.

Cross‑account focus today: OpenAI’s Agent Mode (Atlas) opens preview to Plus/Pro/Business. Threads include real usage, UX feedback, and an early web‑games eval; strong Sudoku, weak reflex timing. This section owns all Atlas items.

Jump to Feature: ChatGPT Agent Mode goes hands‑on topics

🧭 Feature: ChatGPT Agent Mode goes hands‑on

ChatGPT Agent Mode opens preview to Plus, Pro and Business users

OpenAI flipped on Agent Mode in ChatGPT (Atlas) for paid accounts, enabling agents to research, plan, and take actions while you browse OpenAI announcement. The rollout follows extensions pause that temporarily disabled Atlas browser extensions for security.

Hands-on prompts and early testing are already circulating among power users hands-on try.

ChatGPT Agent Mode opens to 3 paid tiers – 4.5× faster on Sudoku

Executive Summary

Feature: ChatGPT Agent Mode goes hands‑on

Table of Contents

🧭 Feature: ChatGPT Agent Mode goes hands‑on

ChatGPT Agent Mode opens preview to Plus, Pro and Business users

Paper: Atlas aces medium Sudoku ~4.5× faster than humans but struggles on reflex timing games

Early takes pit Atlas against Perplexity Comet; Windows support called out as missing

Power users say Atlas stalls on DOM-heavy creation tasks despite basic browsing working

Thinking trace auto‑scroll frustrates Atlas users trying to read reasoning history

🏗️ AI infrastructure: campuses, energy and financing

Amazon switches on Indiana AI campus for Anthropic with >500k Trainium 2, targeting 2.2 GW buildout

OpenAI picks Michigan for >1 GW Stargate campus; “largest investment in state history”

Debt wave funds AI buildout: AI capex now ~25% of US IG bond supply; Meta $30B, Oracle $18B, RPLDCI $27B

Samsung and NVIDIA to build AI “mega‑factory” with 50k GPUs; cuLitho targets ~20× faster computational lithography

TSMC clears ~$49B A14 fab in Taichung for 1.4 nm; mass production targeted 2H’28

UBS model projects NVIDIA unit mix through 4Q26 with GB200 ramp and Rubin CPX on the horizon

Google Cloud ascends on AI; Alphabet guides $91–$93B 2025 capex and signals larger 2026 build

Meta stock falls 11% as 2025 AI capex lifted to $70–$72B; investors question near‑term ROI

Michigan officials detail Stargate jobs and environmental protections for OpenAI campus

RPO and depreciation math split AI capex into two cycles: near‑term contracted vs speculative builds

🛠️ Builder tooling: coding agents and research assistants

Codex CLI v0.53 adds experimental Windows filesystem/network sandbox

Claude Code v2.0.31: Vertex web search, Shift+Tab on Windows, and MCP fixes

Kimi CLI tech preview: shell UI with command exec, Zsh integration, and MCP

Vercel Agent adds automated ‘Investigations’ for incidents; $100 credit for new users

FactoryAI Droid can import Claude agents directly from .claude/agents

LangChain earns AWS Generative AI Competency; LangSmith now on AWS Marketplace

LlamaIndex ships native MCP search so coding agents can query its docs directly

Ollama v0.12.8 boosts Qwen3‑VL and engine stability; desktop adds reasoning‑effort control

Opera rolls out Deep Research Agent in Neon for long‑form web analysis

Perplexity launches ‘Patents’ agent for IP research, free in beta to subscribers

🧪 Models: ‘thinking’ Qwen and multimodal Nemotron on vLLM

Qwen3 Max Thinking appears in LM Arena, signaling release

vLLM adds NVIDIA Nemotron Nano 2 VL (12B) for video and document intelligence

🧩 Interoperability: MCP workflows and agent imports

LlamaIndex adds native MCP search endpoint for agent tooling

Claude Code v2.0.31 ships MCP subagent stability fixes

FactoryAI Droid can now import Claude agents directly

Kimi CLI tech preview lands with MCP and Agent Client Protocol support

CopilotKit + LangGraph demo predictive state updates with human-in-the-loop sync

💼 Enterprise adoption and partnerships

Amazon lights up Indiana AI campus for Anthropic with >500k Trainium 2 chips and 2.2 GW plan

Perplexity signs multi‑year Getty Images license to display credited photos in AI search

Figma buys Weavy and unveils ‘Figma Weave’ for AI media generation pipelines

LangChain earns AWS Generative AI Competency; LangSmith now on AWS Marketplace

Modal partners with Datalab to scale Marker OCR pipelines with ~10× throughput on GPUs

⚙️ Systems: sandboxes and local runtimes

Codex CLI v0.53 adds experimental Windows sandbox for safer agent runs

Ollama v0.12.8 boosts local Qwen3‑VL with FlashAttention and engine fixes

Northflank microVMs help scale secure production sandboxes during heavy launch traffic

Ollama desktop adds per‑chat “reasoning effort” and model picker controls

🛡️ Safety, abuse and rights

ASCAP, BMI, SOCAN align on registering partly AI-made songs; pure‑AI works remain ineligible

Vercel BotID auto‑blocks sophisticated botnet in ~5 minutes after 500% traffic spike

🧠 Training recipes: precision, adapters, and looping

Engineers push FP16 over BF16 in RL fine‑tuning to cut train/infer divergence

Samsung’s zFLoRA fuses adapters for zero‑latency fine‑tuning

ByteDance’s LoopLM Ouro trades recurrence for depth; small models gain, no extrapolation beyond T=4

CISPO RL loss fixes clipping‑induced CoT collapse, enabling longer reasoning chains

🗂️ Agent data: RAG retrievers and high‑throughput parsing

Marker on Modal GPUs delivers ~10× document parsing throughput

NVIDIA posts Nemotron RAG collection with text, multimodal, layout and “Omni” retrievers

OpenRouter launches cross‑provider embeddings directory

Perplexity debuts ‘Patents’ agent for IP research

📊 Evals and capability tracking

EpochAI fixes GPT-5 scoring bug; ‘high’ now edges ‘medium’, tie on ECI

Quarterly State of AI: GPT‑5 (high) leads; US and China dominate model releases

📚 Research: computer use, decoding, memory and video reasoning

Surfer 2 unifies web/desktop/mobile computer-use agents, beating prior systems

AutoDeco lets LLMs learn their own decoding policy, moving beyond hand-tuned strategies

Transformers and Mamba memorize as geometry, solving 50K‑node path queries in one step

Video generators aren’t zero‑shot reasoners: MME‑CoF scores under 2/4 and fails on long chains

🎃 Creative AI: Halloween effects, music, and recipes

Higgsfield drops 1080p Halloween horror pack with Minimax + Kling, free gens and credits promo

ElevenLabs Music adds stem separation and in‑painting, launches 24‑hour Halloween radio and 50% promo

Sora’s ‘Monster Manor’ and character tools power Halloween shorts from creators

Gemini shares Halloween creation playbook: Veo 3.1 monsters, costume ideas, ‘animate nightmares’ and invites

ChatGPT image generation shows year‑over‑year gains on Halloween costume kit prompt

ComfyUI hosts Wan 2.2 Animate live session with control and quality tips