Mistral Small 4 — Signal Lens

Overview

Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on April 13, 2026.

Mistral Small 4 is Mistral AI’s latest small-model open-weight release, aimed at teams that want a stronger quality-per-dollar envelope without moving into the cost and infrastructure footprint of large frontier systems. In practical terms, it extends Mistral’s “small but production-usable” lane rather than replacing the large-model tier.

For Signal Lens readers, it matters because it is one of the clearest current examples of an open-weight model that can plausibly sit in real assistant and coding workflows instead of only acting as a local curiosity.

Capabilities

Mistral positions Small 4 as a general long-context model that is still useful for coding, assistant work, and enterprise-style retrieval flows. It is not the replacement route for deprecated Devstral 2 in Mistral’s current docs; that role now belongs to Mistral Medium 3.5. Small 4 is still meant to hold up in developer workflows where cost and deployment flexibility matter.

This makes it attractive for high-volume internal copilots, private knowledge assistants, or self-hosted team helpers where a good-enough open model beats a more expensive proprietary default.

Technical Details

Official Mistral docs list Mistral Small 4 as a 26.03 open release with a 256K context window. The same docs publish the model as a 119B-parameter architecture with 6.5B active parameters, which is the most important technical detail because it explains why the model can aim above older “small model” expectations without paying full dense-model cost at inference time.

As with several Mistral open releases, Mistral’s public docs foreground context-window size rather than a distinct completion ceiling. This profile therefore records maxOutput as the same 256K snapshot limit for consistency with the repo’s comparison model rather than as a separately published provider guarantee.

Pricing & Access

Published Mistral API pricing is:

Input: $0.15 per 1M tokens
Output: $0.60 per 1M tokens

The model is also available as open weights, which matters more here than it would for many premium proprietary releases. Access choice is part of the value proposition.

Best Use Cases

Use Mistral Small 4 for long-context internal assistants, low-cost code review helpers, multilingual document pipelines, and privacy-sensitive deployments where API spend or data residency is a hard constraint.

It is a better fit for “deploy broadly and cheaply” than for “squeeze every last drop of frontier reasoning performance out of a hard problem.”

Comparisons

Mistral Small 3.2 (Mistral AI): Small 4 is the documented replacement and newer higher-capability small-model route with a much larger published context window.
Mistral Medium 3.5 (Mistral AI): Medium 3.5 is the stronger current Mistral route for agentic coding; Small 4 is the broader efficient generalist.
Qwen3.5 (Alibaba): Qwen3.5 remains strong for bilingual and APAC-oriented deployments; Small 4 is a strong Western open alternative for global or EU-oriented teams.