releaseMarch 16, 2026

Mistral releases Small 4 119B MoE with 256K context

Mistral shipped Mistral Small 4, a 119B MoE model with 6.5B active parameters, multimodal input, configurable reasoning, and Apache 2.0 weights. Deploy it quickly in existing stacks if you use SGLang or vLLM, which added day-one support.

Mistral Multimodal LLM Serving Developer Experience

3 min read

Mistral releases Small 4 119B MoE with 256K context

TL;DR

Mistral has released Mistral Small 4, a 119B mixture-of-experts model with 6.5B active parameters per token, 256K context, text-and-image input, and a single checkpoint that merges instruct, reasoning, and coding/agentic behavior, according to the release chatter and the launch summary.
The model is open under Apache 2.0 and is already exposed in Mistral's own stack: a Playground screenshot shows mistral-small-latest mapped to “Mistral Small 4,” while the launch post link points to the official model release.
For deployment teams, support landed immediately in both serving ecosystems: