Gemini 3.1 Flash‑Lite hits 363 tok/s at $0.25/M – adds thinking levels

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Google DeepMind rolled out Gemini 3.1 Flash‑Lite in preview via the Gemini API in Google AI Studio; the product hook is “thinking levels,” a per-request dial to trade reasoning depth against latency/cost while it’s positioned as an “intelligence at scale” model for workloads like UI/dashboard generation and simulations. A widely shared community table pegs it at 363 tokens/s output with $0.25 input and $1.50 output per 1M tokens (no caching); the same post claims it beats Gemini 2.5 Flash on many tasks, but the artifact is a screenshot and not an independently reproducible benchmark pack.

• Black Forest Labs/FLUX.2 [pro]: claims 2× faster generation with no price increase; posts typography-heavy design samples as proof points.
• Topaz Labs/Wonder 2: announced as “now local,” shifting enhancement from cloud to on-device; bundles mention Astra “Scene Controls” and Starlight Fast 2 with limited observable deltas.
• OmniLottie: paper claims editable vector animation generation as Lottie tokens; introduces MMLottie-2M (2M examples) plus a standardized eval protocol.

Across threads, speed keeps getting treated as the gating variable—Flash‑Lite throughput, FLUX latency cuts, and local Topaz polish—while reliability constraints remain visible (e.g., reports Seedance 2.0 blocks face shots, forcing close-ups onto Kling 3).

While you're reading this, something just shipped.

New models, tools, and workflows drop daily. The creators who win are the ones who know first.

Last week: 47 releases tracked · 12 breaking changes flagged · 3 pricing drops caught

Gemini 3.1 Flash‑Lite arrives: speed, pricing, and “thinking levels” control

Gemini 3.1 Flash‑Lite is rolling out with a speed/price jump and adjustable “thinking levels,” signaling a new default for high‑volume creative + coding tasks where latency and cost decide the tool.

Today’s biggest cross-account story is Gemini 3.1 Flash‑Lite rolling out in preview, with creators benchmarking speed/price and highlighting new “thinking levels” to tune reasoning. This category is for LLM releases + eval tables that impact creative coding, writing, and agent workflows.

Jump to Gemini 3.1 Flash‑Lite arrives: speed, pricing, and “thinking levels” control topics

⚡ Gemini 3.1 Flash‑Lite arrives: speed, pricing, and “thinking levels” control

A shared benchmark/pricing table puts Gemini 3.1 Flash‑Lite in the fast-cheap lane

Gemini 3.1 Flash‑Lite eval snapshot (community): A widely shared comparison table for Gemini 3.1 Flash‑Lite highlights a “boundary of intelligence” push and claims it’s “beating 2.5 Flash on many tasks,” per the Benchmark table post. It’s a creator-relevant artifact because it pairs creative-coding constraints (speed and cost) with a quick scan of reasoning, multimodal, and long-context scores.

From the table image, Flash‑Lite is shown at 363 tokens/s output with $0.25 input / $1.50 output per 1M tokens (no caching), and it’s placed alongside other small/fast options (including GPT‑5 mini, Claude 4.5 Haiku, and Grok 4.1) in the Benchmark table post. Another thread quote captures the immediate vibe as “Flash‑Lite is so darn fast, I love it,” in the same post’s context at Benchmark table post.

Gemini 3.1 Flash‑Lite hits 363 tok/s at $0.25/M – adds thinking levels

Executive Summary

While you're reading this, something just shipped.

Top links today

Gemini 3.1 Flash‑Lite arrives: speed, pricing, and “thinking levels” control

Table of Contents

⚡ Gemini 3.1 Flash‑Lite arrives: speed, pricing, and “thinking levels” control

A shared benchmark/pricing table puts Gemini 3.1 Flash‑Lite in the fast-cheap lane

Gemini 3.1 Flash‑Lite hits preview with a tunable “thinking levels” knob

Gemini 3.1 Flash‑Lite is being framed as an iteration engine, not a chat model

🎬 Video models in the wild: extensions, shorts, and motion stress tests

Grok Imagine’s video extension shows a 30-second short workflow in the wild

Freepik Spaces exposes a concrete “model menu” for short-film pipelines

Seedance 2 creators are stress-testing fast transformations in anime style

Seedance 2 anime walk-cycle tests put speed (not quality) in the spotlight

An AI-generated trailer is circulating as a pacing template, not a tool demo

Luma’s Aquatic Forest clip is a clean environment-and-camera demo

🏁 What shipped: emotional shorts, creator studios, and playable drops

Freepik releases ROOTS, an AI-made short built end-to-end inside Freepik

Terminus Breach launches on play.fun as a creator-monetized game drop

The Prince of the Sea: a Seedance 2.0 demo short targeting painterly realism

ARQ launches with “AI Is Replacing Everyone Except Humans” and a new site

Terminus Breach claims $1,500 day-one revenue, spotlighting play.fun economics

AI Tom and Jerry clip goes viral as an emotional nostalgia-short format

Champion Spirit: ARQ shares a brand video translating a real gym into a cinematic cut

🖼️ Image models & visuals: text rendering, design mocks, and “realism” pushes

Black Forest Labs says FLUX.2 [pro] is 2× faster at the same price

Nano Banana 2’s text rendering shows up in real ad-style layouts

Floor plan uploads are being pitched as an interior-design shortcut in Nano Banana 2

Hidden-object puzzles keep working as a Firefly + Nano Banana 2 visual format

Grok image gen is being steered toward realistic phone selfies with long prompts

Nano Banana 2 is being used to ‘upgrade’ Pokémon into premium card art

Topaz Gigapixel AI ‘before/after’ reels are getting reposted as proof of detail recovery

🧩 Prompts & style codes you can paste today (SREFs, JSON specs, grids)

Midjourney —sref 968833677 for premium graphic-novel fantasy concept art

3×3 pose-collection prompt for consistent character turnarounds

A Grok prompt template for realistic outdoor selfie portraits

Midjourney —sref 2515650061 for European graphic-novel editorial illustration

Midjourney —sref 3445336118 for prismatic “ethereal magic” lighting

Midjourney —sref 642191218 for Franco‑Belgian steampunk narrative illustration

Nano Banana 2 “Holochrome Gradients” prompt for chrome product renders

Midjourney —sref 2828278251 for cozy kawaii illustration

Midjourney —sref 583019043 for engraved black-and-white sci‑fi vistas

Nano Banana 2 daily prompt: fruit turned into a miniature luxury house

🧠 Workflow recipes & agents: from Zillow-to-video to “AI PM” build loops

A PM-first spec loop for Cursor: brain dump → tech spec → phased checklist

Calico AI turns Zillow photos into cinematic listing videos with AI VO

An open-sourced “AI game engine” pipeline: Nano Banana tiles + Tripo models

Pika AI Selves: personal AI “twin” with voice and cross-app presence

You.com launches a Research API for agentic, cited deep search

microHQ is pitched as an agent layer across email, meetings, pipeline, and X

Prismer open-sources an end-to-end “papers to code” research workspace

CaravoAI pitches a single integration layer for 200+ APIs for agents

🧱 3D & interactive creation: 2D→3D assets, game engines, and GDC workflows

Techhalla’s open-source AI game demo treats assets as swappable modules

A PM-style workflow for building interactive demos with coding agents

Autodesk Flow Studio schedules a GDC live session on AI game ideation

Tripo AI keeps showing up as the 2D-to-3D conversion step for creators

A 2D floor plan gets turned into a 3D tour inside Freepik Spaces

Meshy ties GDC booth traffic to a high-ticket giveaway

🛠️ Finishing the shot: upscalers, local enhancement, and frame polish

Topaz Wonder 2 moves to local processing

Seedance clips get pushed to 2K via Magnific upscales

Freepik Spaces treats upscaling as a standard node

Seedance 2 paired with Topaz for 60fps fluidity

Topaz Astra adds Scene Controls

Topaz Gigapixel AI’s “before/after” reveal format keeps working

Topaz ships Starlight Fast 2 alongside the local push

⚖️ Copyright, privacy defaults, and the authorship fight

SCOTUS lets stand the rule that copyright requires human authorship

Stanford paper claims default chat-training and two-tier protections

The “you didn’t make anything” authorship line hardens into memes

📚 Free curricula & creator education drops (worth bookmarking)

Anthropic publishes a free, end-to-end Claude learning path (Claude → API → MCP → Skills)

Microsoft’s AI for Beginners repo gets re-promoted as a full bootcamp-style track

Anima_Labs outlines a story-first AI creation tutorial with a multi-tool stack

A creator-led AI animation course opens a waitlist

🧰 Where creators work: studios, aggregators, and in-app model access

Freepik Spaces spotlights a node-based studio stack with a full short film

Notion Custom Agents adds MiniMax M2.5 as an open-weight model option