
Gemini 3 Flash hits 1M context at $0.50 – new default fast brain
Stay in the loop
Free daily newsletter & Telegram daily report
Executive Summary
Google’s Gemini 3 Flash is the first “fast” model in a while that actually feels frontier-class. You get a 1M-token context window, 64K outputs, and full multimodality at $0.50 per 1M input tokens and $3 per 1M output, with dynamic “thinking” levels baked into that price. Google is confident enough to swap it in as the default Fast and Thinking brain in the Gemini app and Search, while context caching chops repeated-prefix cost to 10% and batch jobs can see up to 50% savings.
The ecosystem read the memo instantly. Cursor, Cline, Warp, Zed, Antigravity, Perplexity, OpenRouter, Ollama and a slew of agent stacks all wired 3 Flash in as the new quick-but-smart default for coding, repo planning, and search-heavy workflows. Benchmarks back the shift: 78% on SWE-bench Verified, state-of-the-art on MMMU-Pro, a 71 score on the Artificial Analysis Index, and around 6× more successful BrowserUse web tasks per dollar than Claude Sonnet. Box AI saw document-extraction recall jump from 74% to 84% by swapping out 2.5 Flash.
There’s a catch: hallucination rates north of 90%, “lazy” short answers without prompt pressure, and day-one jailbreaks leaking MDMA recipes and runaway chain-of-thought. Treat Gemini 3 Flash as your new default workhorse—but only behind strong verification, safety filters, and house prompts that force it to think before it talks.
Top links today
- Pretraining, mid-training and RL reasoning paper
- Error-free linear attention for long contexts
- Partial introspection in language models paper
- Video Reality Test ASMR benchmark paper
- GPU material footprint of large AI paper
- LLM harms taxonomy and discussion paper
- Vision-language synergy on ARC reasoning paper
- ReplicationBench astrophysics AI agents benchmark
- Reuters on China’s prototype EUV lithography
- TechPowerUp on SMIC 5 nm without EUV
- OpenAI–Amazon Trainium investment report
- Online book on RLHF fundamentals
- Analysis of Firefox plans as AI browser
Feature Spotlight
Feature: Gemini 3 Flash rollout and positioning
Gemini 3 Flash lands at $0.50 in/$3 out with 3× speed vs 2.5 Pro, GPQA 90.4%, SWE‑Bench Verified 78%, MMMU‑Pro 81.2%, and dynamic thinking; available across API, Vertex, AI Studio, Antigravity, and major dev tools.
The day’s cross‑account story. Google’s Gemini 3 Flash ships broadly with frontier‑class capability at speed/price points, and immediate ecosystem pickup. Mostly product, pricing, and distribution details in the tweets sample.
Jump to Feature: Gemini 3 Flash rollout and positioning topics