Gemini 3 Pro поддерживает контекст в 1 млн токенов, 1501 Elo — TPU и IDEs выходят на полную мощность

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Gemini 3 Pro наконец стал официальным, а не утекал через подсказки в интерфейсе, и он становится новым мозгом Google по умолчанию повсюду: приложение, API, Поиск и Vertex. Превью открывает окно на 1 048 576 токенов, до 65 536 выходных токенов, и поэтапное ценообразование от $2/$12 до $4/$18 за миллион токенов в зависимости от размера контекста. Ранние тесты скорости показывают ~128 токенов/с, так что вы получаете более крупный мозг без доплаты за задержку.

По результатам эвалюций Gemini 3 Pro подскакивает до 1501 Elo на вершине LMArena, опережает WebDev и Design Arenas, и набирает 73 в Intelligence Index от Artificial Analysis, одновременно доминируя в длинноконтекстных MRCR-сессиях до 128k токенов. Deep Think, более тяжелый режим рассуждений, набирает 93,8% на GPQA Diamond и 41% на Humanity’s Last Exam, но пока доступен только исследовательским партнёрам.

Тем временем новое руководство для разработчиков разоблачает настройки вроде thinking_level, media_resolution и сигнатуры мышления, чтобы можно было обменивать вычисления на надёжность вместо того, чтобы молиться богам подсказок.

Реакция экосистемы была мгновенной: Antigravity запускается как бесплатная агентная IDE, Vercel, Cursor, Zed, Cline и Ollama интегрируют Gemini 3, а OpenRouter сообщает миллиарды токенов в первый день. Но загвоздка в том, что новая ветка взлома показывает, что он всё ещё охотно пройдет через нелегальные инструкции «how-to», поэтому обёртка логики и защитные механизмы не являются необязательными.

Feature Spotlight

Feature: Gemini 3 Pro ships + Antigravity IDE

Google’s Gemini 3 Pro arrives with 1M context, SOTA reasoning, broad distribution and a free agentic IDE (Antigravity) — resetting the model landscape and developer workflows in one day.

Massive cross‑account launch: Google rolls out Gemini 3 Pro with 1M context, SOTA reasoning/multimodal, new pricing, broad distribution (App/API/Search/Vertex) and a free agentic IDE (Antigravity). Today’s sample is dominated by this story.

Jump to Feature: Gemini 3 Pro ships + Antigravity IDE topics

✨ Feature: Gemini 3 Pro ships + Antigravity IDE

Google launches Gemini 3 Pro with 1M context across app, Search and API

Google formally rolled out Gemini 3 Pro as its new flagship model across the Gemini app (“Thinking” mode), AI Mode in Google Search, AI Studio, the Gemini API and Vertex AI, with a 1M‑token input window, 64k output, and a Jan 2025 knowledge cutoff, following up on prelaunch signals that it was imminent. The launch also introduces Gemini Agent in the app (multi‑step task execution with human approval) and new “visual layout”/“dynamic view” generative interfaces that return rich, app‑like responses instead of plain text.deepmind launch thread gemini surfaces overview agent mode demo visual layout feature

Pricing for Gemini 3 Pro in AI Studio and the public API is set at $2 per 1M input tokens and $12 per 1M output tokens for contexts up to 200k, rising to $4/$18 beyond 200k,pricing card tweet and Google is pairing this with an aggressive education push: eligible U.S. college students can get a free year of the Gemini Pro plan with access to Gemini 3 Pro, NotebookLM upgrades and 2TB of storage.student plan details

On the roadmap side, Google is also previewing Gemini 3 Deep Think, a higher‑compute reasoning mode that scores above Pro on internal and public reasoning benchmarks; it’s being limited to safety testers now and is expected to reach Google AI Ultra subscribers in the coming weeks.deep think benchmarks deep think rollout

For builders, the key shift is that Gemini 3 Pro is no longer a lab curiosity: it’s the default model behind consumer Gemini “Thinking” chats, search AI answers, and the APIs you hit in AI Studio and Vertex. That means if you were already integrated with Gemini 2.5 Pro, you’re suddenly getting a model with more headroom for multi‑modal reasoning and much longer chains of context without changing endpoints.gemini app context window Google blog post

Gemini 3 Pro поддерживает контекст в 1 млн токенов, 1501 Elo — TPU и IDEs выходят на полную мощность

Executive Summary

Feature: Gemini 3 Pro ships + Antigravity IDE

Table of Contents

✨ Feature: Gemini 3 Pro ships + Antigravity IDE

Google launches Gemini 3 Pro with 1M context across app, Search and API

Gemini 3 Pro hits AI Studio, CLI, Vertex and a wide partner ecosystem

Builders call Gemini 3 Pro a new daily driver—with sharp edges

Google Antigravity launches as free agentic IDE powered by Gemini 3

📊 Frontier evals: Gemini 3 tops boards (excludes launch)

AA‑Omniscience: Gemini 3 Pro leads knowledge index but still hallucinates often

Artificial Analysis: Gemini 3 Pro tops Intelligence Index with score 73

Community eval roundup: Gemini 3 Pro feels like a new default for many builders

LisanBench: Gemini 3 Pro scores 4,661, 2.2× higher than GPT‑5

Long context: Gemini 3 Pro leads MRCR 8‑needle at 128k and 1M tokens

Box AI: Gemini 3 Pro is +22 points on advanced reasoning vs Gemini 2.5

Gemini 3 Pro scores 76.4% on SimpleBench and sets a 96.8 on NYT Connections

GeoBench: Gemini 3 Pro matches or beats a pro GeoGuessr player

Vals multimodal index: Gemini 3 Pro ranks #2 overall, #1 on several tasks

Stagehand: Gemini 3 Pro balances top accuracy with faster runs on agentic tasks

🧑‍💻 Agentic coding stacks & integrations (excludes launch)

Amp switches its default smart agent model from Claude to Gemini 3 Pro

OpenRouter exposes google/gemini-3-pro-preview with multi‑provider routing

Stagehand benchmarks show Gemini 3 Pro leading agentic browsing models

Conductor leans on Gemini 3 in its full‑height terminal and env tooling

Warp users highlight Gemini 3 Pro’s performance inside Stagehand and Warp

Zed IDE adds Gemini 3 Pro alongside Copilot and Google AI

Flowith offers 48 hours of free Gemini 3 Pro for "vibe‑coded" UIs

Genspark gives all users early access to Gemini 3 Pro

Oracle CLI tool adds Gemini 3 Pro as an alternative to GPT‑5 Pro

Braintrust adds Gemini 3 Pro as a model option for its agent workflows

🏢 Enterprise moves, pricing and distribution

Anthropic, Microsoft and NVIDIA strike $45B‑scale Claude and compute partnership

Box reports 22‑point jump on internal enterprise reasoning evals with Gemini 3 Pro

Gemini 3 Pro launches with a broad partner ecosystem across IDEs, clouds and frameworks

Google offers U.S. college students a free year of Gemini Pro with Gemini 3 access

Google Antigravity ships free individual plan with Gemini 3 Pro and rival models

Intuit reportedly commits $100M+ to OpenAI for embedded financial AI

AI‑native app builder Lovable reaches $200M ARR in its first year

Gartner names OpenAI an “Emerging Leader” in generative AI model providers

Genspark plugs its AI agents into Microsoft Agent 365 for Outlook and Teams

Panel data: ChatGPT still dominates LLM usage, but Gemini is gaining

🏗️ AI infrastructure: TPUs, DC builds, demand signals

Anthropic, Microsoft and NVIDIA lock in $45B+ and ~1 GW for Claude

NVIDIA faces ~$500B AI chip order pipeline for 2025–26

Google’s TPU roadmap: Ironwood, Sunfish, Zebrafish define AI fleet tiers

Gemini 3 Pro trained entirely on TPUs with JAX and Pathways

GMI Cloud to spend $500M on Taiwan AI data center with 7,000 GB300 GPUs

Oracle’s AI backlog heavily tied to OpenAI, raising infra concentration risk

🧭 Retrieval, web data and research search

Parallel launches FindAll API, pay-per-match web data for agents

Weaviate and SageMaker ship guide for agentic RAG over enterprise data

Google Scholar Labs teases conversational AI search for papers

VOIX proposes declarative web actions for safer agent computer use

🎨 Generative media & vision pipelines

Gemini 3 Pro one‑shots rich SVG scenes, games and creative apps

Qatar Airways ad produced end‑to‑end in 14 hours using flight Wi‑Fi and AI video tools

Riverflow 2 Preview takes #1 on Artificial Analysis Image Editing Arena

PhysX‑Anything generates simulation‑ready 3D assets from a single image

Veo 3.1 adds multi‑image input for richer video generation

MMaDA‑Parallel explores thinking‑aware diffusion for image editing and generation

🧪 Research notes: reasoning, long‑context, video attention

EvoSynth auto‑evolves jailbreak programs with ~96% success on black‑box LLMs

Honesty‑Critical Neurons Restoration revives suppressed honesty in fine‑tuned LLMs

LiteAttention skips ~40% of attention in video diffusion with Wan

MiroThinker v1.0 scales open deep‑research agents with 256K context

Reinforced Hesitation trains models to abstain when they’re unsure

Chronology benchmark finds most LLMs stumble on time‑ordered reasoning

GGBench probes geometric reasoning via text→GeoGebra→diagram loops

GroupRank uses RL and small groups to improve LLM reranking

CreBench and CreExpert evaluate multimodal creativity from idea to final product

Survey maps five families of LLM methods for scientific idea generation

🛡️ Safety, jailbreaks and platform governance

Gemini 3 jailbroken into giving hard drug, weapons and malware instructions

EvoSynth uses evolutionary code to reach ~96% jailbreak success on frontier LLMs

US House leaders eye national moratorium on state AI regulation via defense bill

Macron’s “Wild West, not free speech” clip fuels calls for stricter platform rules

On this page