Grok 4.1 Fast — Signal Lens

Overview

Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on July 10, 2026.

Grok 4.1 Fast is now a deprecated xAI tool-calling route. xAI’s May 2026 retirement guide says both grok-4-1-fast-reasoning and grok-4-1-fast-non-reasoning retired from the API on May 15, 2026 at 12:00 PM PT and now redirect to Grok 4.3.

This page is retained so teams can identify old integrations and migrate deliberately. xAI routes the reasoning variant to Grok 4.3 with low reasoning effort and the non-reasoning variant to Grok 4.3 with reasoning_effort set to none.

Capabilities

Historically, the official launch and pricing pages highlighted this capability profile:

Strong tool-calling performance for server-side agents.
Large 2M-token context window for long-horizon workflows.
Agentic search and research with web, X, code execution, file search, and MCP-style tool support.
Better long-context stability than earlier xAI fast models, with explicit emphasis on multi-turn performance.
Practical fit for enterprise tasks where speed, cost, and tool use all matter together.

This is no longer the xAI model line to choose for new production agents. Use this page for redirect tracking and use Grok 4.3 with the right reasoning effort for new work.

Technical Details

xAI’s launch materials described Grok 4.1 Fast as a tool-calling model and emphasized reinforcement learning in simulated tool environments. The retiring public API variants are:

Two variants: reasoning and non-reasoning.
2M-token context window.
Native fit with xAI’s Agent Tools API, including web search, X search, code execution, collections search, and MCP support.

xAI’s current retirement documentation does not publish a separate maximum output-token limit for these deprecated variants. Here, maxOutput: 0 means “not separately published,” not zero output capacity. Developers should enforce explicit generation limits and latency budgets in their applications.

Pricing & Access

xAI’s public pricing page may still expose historical variant names, but the retirement guide supersedes them for new planning:

Input: $0.20 per 1M tokens
Output: $0.50 per 1M tokens

Additional notes:

Cached input is discounted.
Tool calls are billed separately.
Large-context requests are priced at a premium.

Historical access options:

xAI API
xAI Enterprise API
Agent Tools API-based workflows

Best Use Cases

Use this page for:

identifying old grok-4-1-fast-* request paths
testing Grok 4.3 replacement behavior
checking whether non-reasoning callers should set reasoning_effort to none
documenting historical price and context assumptions

Do not start new deployments on Grok 4.1 Fast. The redirect target remains Grok 4.3; for new difficult coding and agentic work, also evaluate Grok 4.5.

Comparisons

Grok 4.3 (xAI): Recommended replacement for both reasoning and non-reasoning 4.1 Fast variants.
Grok 4.5 (xAI): Current flagship for difficult coding and agentic knowledge work.
Grok 4.20 (xAI): 1M-context preview lane, not the default replacement for old fast callers.
Claude Sonnet 4.6 (Anthropic): Stronger general production baseline outside the xAI ecosystem.