breakingMarch 16, 2026

Artificial Analysis ranks Nemotron 3 VoiceChat at 77.8% conversational dynamics

Artificial Analysis published results for NVIDIA's Nemotron 3 VoiceChat, putting the 12B model at the open-weight pareto frontier across conversational dynamics and speech reasoning. Consider it for open voice agents, but compare against proprietary systems that still lead the category by a wide margin.

Voice Agents Realtime AI Benchmarks

3 min read

Artificial Analysis ranks Nemotron 3 VoiceChat at 77.8% conversational dynamics

TL;DR

Artificial Analysis says NVIDIA's Nemotron 3 VoiceChat is a ~12B open-weight speech-to-speech model that now sits on the open-model Pareto frontier for both conversational dynamics and speech reasoning.
On Artificial Analysis' benchmarks, the benchmark thread puts Nemotron 3 VoiceChat at 77.8% on conversational dynamics and 29.2% on Big Bench Audio speech reasoning, making it the only open model in its comparison set that lands near the top on both axes.
The same comparison post says open speech-to-speech models still trail proprietary systems badly, with Step-Audio R1.1 at 96% on Big Bench Audio and Grok Voice Agent and Gemini 2.5 Flash (Thinking) both at 92%.
NVIDIA appears to be positioning the model as more than a quiet research drop: Artificial Analysis linked NVIDIA's early access