Grok 4.1 Fast
xAI · Grok
Deprecated xAI fast tool-calling route whose reasoning and non-reasoning slugs now redirect to Grok 4.3.
Overview
Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on May 16, 2026.
Grok 4.1 Fast is now a deprecated xAI tool-calling route. xAI’s May 2026 retirement guide says both grok-4-1-fast-reasoning and grok-4-1-fast-non-reasoning retired from the API on May 15, 2026 at 12:00 PM PT and now redirect to Grok 4.3.
This page is retained so teams can identify old integrations and migrate deliberately. xAI routes the reasoning variant to Grok 4.3 with low reasoning effort and the non-reasoning variant to Grok 4.3 with reasoning_effort set to none.
Capabilities
Historically, the official launch and pricing pages highlighted this capability profile:
- Strong tool-calling performance for server-side agents.
- Large 2M-token context window for long-horizon workflows.
- Agentic search and research with web, X, code execution, file search, and MCP-style tool support.
- Better long-context stability than earlier xAI fast models, with explicit emphasis on multi-turn performance.
- Practical fit for enterprise tasks where speed, cost, and tool use all matter together.
This is no longer the xAI model line to choose for new production agents. Use this page for redirect tracking and use Grok 4.3 with the right reasoning effort for new work.
Technical Details
xAI’s launch materials described Grok 4.1 Fast as a tool-calling model and emphasized reinforcement learning in simulated tool environments. The retiring public API variants are:
- Two variants: reasoning and non-reasoning.
- 2M-token context window.
- Native fit with xAI’s Agent Tools API, including web search, X search, code execution, collections search, and MCP support.
As with other xAI entries in this repo, maxOutput is stored as the documented context ceiling for comparison convenience. Developers should enforce explicit generation limits and latency budgets in their applications.
Pricing & Access
xAI’s public pricing page may still expose historical variant names, but the retirement guide supersedes them for new planning:
- Input: $0.20 per 1M tokens
- Output: $0.50 per 1M tokens
Additional notes:
- Cached input is discounted.
- Tool calls are billed separately.
- Large-context requests are priced at a premium.
Historical access options:
- xAI API
- xAI Enterprise API
- Agent Tools API-based workflows
Best Use Cases
Use this page for:
- identifying old
grok-4-1-fast-*request paths - testing Grok 4.3 replacement behavior
- checking whether non-reasoning callers should set
reasoning_efforttonone - documenting historical price and context assumptions
Do not start new deployments on Grok 4.1 Fast. For new xAI text, tool, and code workloads, use Grok 4.3.
Comparisons
- Grok 4.3 (xAI): Recommended replacement for both reasoning and non-reasoning 4.1 Fast variants.
- Grok 4.20 (xAI): 2M-context preview lane, not the default replacement for old fast callers.
- Claude Sonnet 4.6 (Anthropic): Stronger general production baseline outside the xAI ecosystem.