releaseMarch 26, 2026

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Google launched Gemini 3.1 Flash Live in AI Studio, the API, and Gemini Live with stronger audio tool use, lower latency, and 128K context. Voice-agent teams should benchmark quality, latency, and thinking settings before switching.

Gemini Multimodal Voice Agents Realtime AI

3 min read

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

TL;DR

Google launched Gemini 3.1 Flash Live across the API, AI Studio, and Gemini Live, positioning it as a realtime voice-and-vision model for agents with audio streaming, video streaming, transcription, and 128K context launch thread feature rundown.
The headline quality gain is tool use: Google's benchmark chart puts the model at 90.8% on ComplexFuncBench Audio, up from 71.5% for Gemini 2.5 Flash Native Audio in the newer comparison.
Google and external benchmarking both frame this as a quality/latency tradeoff release: Artificial Analysis AA benchmark measured 95.9% Big Bench Audio at high thinking with 2.98s time-to-first-audio, versus 70.5% at minimal thinking with 0.96s TTFA