Gemini Robotics-ER 1.6
Google · Gemini Robotics
Google DeepMind's robotics-tuned Gemini for embodied reasoning, spatial planning, and physical agent tasks.
Overview
Freshness note: Model capabilities, limits, and pricing can change quickly. This profile is a point-in-time snapshot last verified on May 1, 2026.
Gemini Robotics-ER 1.6 is Google DeepMind’s April 14, 2026 robotics-tuned Gemini variant. It is built to act as the high-level reasoning model for a robot, calling tools such as Google Search, vision-language-action (VLA) models, or user-defined functions, while handling spatial reasoning, multi-view understanding, and instrument reading natively. DeepMind positions it as a meaningful step over Gemini Robotics-ER 1.5 on practical industrial tasks: instrument reading accuracy reportedly jumps from 23% to 93% on agentic vision evaluations, and spatial reasoning improves on pointing, counting, and success-detection tasks.
This is a specialized model. For typical chat or coding workflows, the standard Gemini 3.1 Pro line is the right reference. Robotics-ER 1.6 is the entry to consult when the question is “how does Google approach embodied reasoning for physical agents?”
Capabilities
DeepMind’s release materials highlight a specific capability profile:
- High-level embodied reasoning for robots, with native tool calling against VLA models, search, and custom functions.
- Spatial and physical reasoning improvements on pointing, counting, multi-view understanding, and success detection.
- Instrument reading at roughly 93% accuracy on agentic-vision tasks, including analog gauges, pressure meters, and sight glasses — a capability essentially absent in the prior 1.5 release.
- Improved compliance with safety policies on adversarial spatial-reasoning tasks, with DeepMind describing it as the safest robotics model in the family to date.
Robotics-ER 1.6 is already deployed inside Boston Dynamics’ Spot robot for live industrial inspections, which gives the release more practical signal than a typical research preview.
Technical Details
Public anchors at this snapshot:
- Built on the Gemini multimodal foundation with text, image, and video inputs.
- Designed as a planner and reasoning model that issues tool calls to lower-level VLA controllers, not as an end-to-end action policy.
- Available through the Gemini API and Google AI Studio at this snapshot.
- Marked as a preview release; DeepMind may adjust capabilities and access before GA.
Specific token-window numbers in the frontmatter are best-effort placeholders. Robotics-ER 1.6 inherits Gemini’s standard long-context capacity but exposes a different practical envelope when used in real-time robotics loops.
Pricing & Access
Pricing was not separately published as a standalone Robotics-ER tier at the time of this snapshot. Standard Gemini API rates apply when calling the model through the Gemini API or Vertex AI integrations.
Access options:
- Gemini API for developers
- Google AI Studio for prototyping
- Robotics partners through Google DeepMind’s integration paths
- Live deployment alongside VLA models such as Gemini Robotics 1.5
Best Use Cases
Choose Gemini Robotics-ER 1.6 for:
- Industrial inspection workflows where reading analog gauges and instruments matters.
- Robotics planners that delegate motor control to VLA models while keeping high-level reasoning in a frontier model.
- Embodied agents that need spatial reasoning, multi-view understanding, and tool-call orchestration in one surface.
- Research and prototyping work building on Boston Dynamics Spot or similar physical platforms.
This is not the right model for general assistant work, chat, or non-physical agentic coding; the standard Gemini 3.1 Pro and Flash lines are better defaults there.
Comparisons
- Gemini Robotics-ER 1.5 (Google): Direct predecessor; 1.6 substantially improves instrument reading and spatial reasoning while introducing stronger safety behavior.
- Gemini 3.0 Flash (Google): General-purpose Gemini Flash variant; Robotics-ER 1.6 is the robotics-specialized counterpart with explicit embodied-reasoning training.
- Other embodied-AI research models: Typically narrower research demos; Robotics-ER 1.6 differentiates through industrial deployment and multi-view spatial reasoning.