OpenAI GPT‑5.1 предлагает адаптивное рассуждение — 8 предустановленных стилей и 2 модели сокращают шаблон подсказки.

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

OpenAI внедрила GPT‑5.1 в ChatGPT, сочетая быстрый Instant с моделью Thinking, которая адаптирует, сколько рассуждений требуется на каждую задачу.

Обновление добавляет восемь предустановленных стилей плюс настройку теплоты/эмодзи, сокращает boilerplate подсказок и тихо переводит GPT‑4.1 в раздел Legacy.

Доступ к API появится на этой неделе через gpt‑5.1‑chat‑latest и gpt‑5.1, чтобы можно было направлять реальную работу.

Ранние данные показывают, что Thinking расходует меньше токенов на задачи в диапазоне 10-й–30-й перцентилей и больше на 70-й–90-й, а не на фиксированную цепочку рассуждений.

Пользователи отмечают более точное выполнение инструкций (например, соблюдение запретов на стили), меньше раболепия, чем у GPT‑4o, и более выразительный стиль; наблюдения сообщества связывают «polaris‑alpha» OpenRouter с 5.1, который занимает лидирующие позиции на доске Creative Writing v3.

Codex тоже осваивает 5.1, уже слог gpt‑5.1‑codex слит — полезен для агентных стэков кодирования, которым нужен единый мозг планирования как для IDE, так и для CLI.

Если у вас есть вопросы по миграции и особенностям персонализации, AMA OpenAI запланировано на 14:00 PT сегодня; приносите реальные подсказки и наблюдайте расход токенов, прежде чем переключать переключатель.

Feature Spotlight

Feature: GPT‑5.1 ships adaptive reasoning + personas

OpenAI rolls out GPT‑5.1 (Instant/Thinking) with adaptive reasoning and 8 preset personas; API this week (gpt‑5.1‑chat‑latest, GPT‑5.1). Big bump in instruction‑following and conversation quality for builders.

Cross‑account launch: OpenAI rolls GPT‑5.1 to ChatGPT with Instant/Thinking variants and richer personalization. Mostly model/UX improvements, early dev takes on instruction‑following, vision, tone. API lands this week.

Jump to Feature: GPT‑5.1 ships adaptive reasoning + personas topics

✨ Feature: GPT‑5.1 ships adaptive reasoning + personas

OpenAI rolls out GPT-5.1 Instant/Thinking to ChatGPT, API this week

OpenAI has started rolling out GPT‑5.1 across ChatGPT with two variants—Instant as the new high‑volume default, and Thinking for heavier reasoning—initially to Plus, Team, Pro and Business users, with free and logged‑out access to follow and API access promised later this week as gpt-5.1-chat-latest (Instant) and gpt-5.1 (Thinking) using adaptive reasoning. OpenAI release OpenAI blog post

GPT‑5.1 Instant is positioned as the warmer, more conversational chat model that still runs at GPT‑5 Instant speeds, while GPT‑5.1 Thinking is the advanced reasoner that decides when to "think longer" before answering; both are trained on the same reasoning stack as OpenAI’s heavy thinking models. Sam Altman comment welcome thread OpenAI says it is serving a base of 800M+ ChatGPT users, which is why they’re pairing the new model with richer tone controls instead of a single default persona. fidji summary

In the model picker, GPT‑5.1 now appears with Auto/Instant/Thinking options while GPT‑5 has been pushed under a Legacy section that OpenAI says will remain for about three months, and GPT‑4.1 is beginning a gradual retirement from ChatGPT. model list ui legacy selector gpt4-1 retired Power users report that GPT‑5.1 is already live for most Plus, Team and Pro accounts. rollout status

For coders, this also builds on GPT‑5 Codex mini, where we saw GPT‑5‑Codex‑Mini double tokens/sec, as Codex now adds a gpt-5.1-codex model definition in its config and an OpenAI engineer confirms you’ll be able to route GPT‑5.1 through Codex once the API ships. codex model def Codex pull request codex availability

The point is: if you’re standardizing on OpenAI, GPT‑5.1 will very quickly become the new default surface—both for end‑user ChatGPT usage and for API‑backed apps—while older 5.x and 4.x models get pushed into a legacy corner you probably don’t want to depend on long term.

OpenAI GPT‑5.1 предлагает адаптивное рассуждение — 8 предустановленных стилей и 2 модели сокращают шаблон подсказки.

Executive Summary

Feature: GPT‑5.1 ships adaptive reasoning + personas

Table of Contents

✨ Feature: GPT‑5.1 ships adaptive reasoning + personas

OpenAI rolls out GPT-5.1 Instant/Thinking to ChatGPT, API this week

Builders say GPT-5.1 feels faster, less sycophantic and better at writing

GPT-5.1 Thinking adds adaptive reasoning and “short CoT”

🏭 AI datacenters, power, and superfactory notes

Anthropic commits $50B to US AI datacenters in Texas and New York

Analysts warn of 44 GW US AI power shortfall as DC spend tops new oil

Microsoft details Fairwater 2 AI Superfactory and 100k+ GB300s this quarter

🛠️ Agentic coding stacks and IDE workflows

LMArena’s new Code Arena brings live agentic coding evals and a WebDev leaderboard

Claude Agent SDK demo shows multi-agent deep research workflow with file-based handoff

Claude Code expands from desktop into web and iOS for Team and Enterprise

Code Arena now underpins a community-voted WebDev leaderboard for coding agents

Factory 1.10 gives Droid programmable hooks and completion sounds for long agent runs

RepoPrompt MCP lets Composer‑1 route complex work to GPT‑5 via your ChatGPT sub

Zed 0.212 focuses on Git worktrees, Pull (Rebase), and settings deep-links

🧩 MCP interoperability and stability trade‑offs

Factory 1.10 adds in-editor MCP Manager for browsing and auth

MCP’s "no stability" philosophy clashes with code-level integrations

Builders use Claude Code + Skills to auto-tune MCP tool prompts

Engineers question MCP’s pull-only model and lack of eventing

📊 Evals: live web‑dev arena, coding agents, and attitudes

Claude Opus tops new WebDev leaderboard as Code Arena goes live

MiniMax M2 hits 48% on KingBench but trails GLM‑4.6 in real coding tests

Multi‑model “attitude” evals show AIs disagree wildly on the same idea

Vals AI debuts comparison view and pushes “fluid” language benchmarking

🛡️ Privacy battles and jailbreak surface area

OpenAI CISO warns NYT data demand would expose up to 20M private chats

Memory-based GPT‑5.1 jailbreak shows policy steering gaps

ChatGPT group chats wall off personal memory and custom instructions

New ChatGPT personalities raise questions about attitude bias and reliability

🧠 Reasoning training and RL reproducibility

Tencent’s DRIVE recipe boosts RL code models with 2‑stage hard‑focused data curation

TorchTitan+vLLM hit bitwise‑consistent RL with KL=0.0 on Qwen3‑1.7B

Meta’s “Path Not Taken” dissects how RLVR updates weights off principal directions

📚 New findings: human‑aligned vision, grounding, JEPA, IPW

DeepMind’s AlignNet makes vision models organize concepts more like humans

LeCun’s LeJEPA removes heuristics from self‑supervised vision, hits 79% on ImageNet

GroundCUA dataset and GroundNext models push desktop UI grounding forward

Intelligence per Watt paper shows 5.3× gain for local LMs since 2023

Study finds reasoning LMs hold up, then abruptly fail as problem depth rises

💼 Agent web infra funding and enterprise moves

Parallel raises $100M to make the web agent‑ready

🎨 Creative gen and 3D world models

World Labs launches Marble, an editable 3D world model for text‑to‑scene creation

fal.ai ships Qwen Image Edit Plus LoRA gallery and Editto for precise image/video edits

Tavus previews a five‑agent “AI office” for video and voice coworkers

Grok Imagine 1.0 now shows Midjourney‑level aesthetics in some prompts

🎙️ Voice AI: Live UX gains and licensed voices

Google upgrades Gemini Live with accents, personas, and finer voice control

ElevenLabs launches Iconic Marketplace with licensed celeb voices like Michael Caine and McConaughey

LiveCaptions template turns ElevenLabs Scribe into end‑to‑end live event captions

🤖 Embodied AI: Claude‑assisted dog, humanoid sync, VLM SDKs

Anthropic’s Project Fetch shows Claude users program a robodog ~2× faster

XPENG IRON humanoids run in near‑perfect sync, reinforcing 2026 factory ambitions

Perceptron ships Physical AI platform and SDK for Isaac‑0.1 and Qwen3VL‑235B

Musk says future Optimus robots could shadow people and host “mind snapshots”

Reachy Mini teaser shows tabletop humanoid chatting and switching languages

🚀 Frontier model watch (non‑OpenAI)

Gemini 3 ‘riftrunner’ shows up on LMArena as a likely RC

Nemotron Nano 12B 2 VL is free on OpenRouter as NVIDIA ships 49B reasoner

Kimi K2 Thinking edges toward mainstream via Perplexity and infra tuning

Perceptron serves Isaac‑0.1 and Qwen3VL‑235B for vision‑language robotics

On this page

Analysts warn of 44 GW US AI power shortfall as DC spend tops new oil