Date: March 13, 2025
Google DeepMind unveils Gemini Robotics, an AI system designed to make robots smarter, more adaptable, and capable of real-world problem-solving.
Google DeepMind is pushing robotics into new territory with the launch of two groundbreaking AI models: Gemini Robotics and Gemini Robotics-ER. These latest models promise to make robots smarter, more versatile, and far better at understanding our world.
Built on Google’s advanced Gemini 2.0 platform, Gemini Robotics is a vision-language-action (VLA) model that integrates sight, language processing, and physical action into a single system. Unlike traditional AI confined to digital outputs like text or images, this model enables robots to interpret natural language commands and execute complex tasks in real-world environments.
Meanwhile, Gemini Robotics-ER (Embodied Reasoning) adds advanced spatial awareness, allowing robots to navigate and reason about their surroundings with greater precision.
“We’ve been able to bring the world-understanding—the general-concept understanding—of Gemini 2.0 to robotics,” said Kanishka Rao, a robotics researcher at Google DeepMind who spearheaded the project. In a press briefing, Rao highlighted the models’ ability to control various robots across hundreds of scenarios, even those not included in their training data.
Demonstrations showcased the technology’s potential. Robots equipped with Gemini Robotics performed tasks like folding origami, packing snacks into a Ziploc bag, and plugging devices into power strips—all in response to spoken instructions. One video featured an Apptronik-developed humanoid robot, Apollo, rearranging letters on a tabletop while conversing with a human operator. The system’s adaptability shone when objects slipped or environments changed, with robots quickly recalibrating to complete their tasks.
The implications are vast. Google DeepMind is partnering with Apptronik, a Texas-based robotics firm, to integrate Gemini 2.0 into next-generation humanoid robots. “We’re excited to explore how these models can push the boundaries of what robots can achieve,” Rao added. The company is also collaborating with select testers (Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools) to refine Gemini Robotics-ER, signaling a cautious but ambitious rollout.
Carolina Parada, head of robotics at Google DeepMind, emphasized the models’ advancements in three key areas: generality, interactivity, and dexterity. “This enables us to build robots that are more capable, more responsive, and more robust to changes in their environment,” she said during the briefing. Parada noted that unlike humans, these robots don’t learn on the fly—yet—but the foundation is being laid for future breakthroughs.
Safety remains a priority amid such rapid progress. Google DeepMind introduced ASIMOV, a new benchmark named after sci-fi author Isaac Asimov, to assess risks in AI-powered robotics. The tool evaluates whether a robot’s actions could lead to unintended consequences, such as grasping an object dangerously close to a human. “We’re building this technology with safety top of mind,” Parada said, acknowledging that commercialization is still years away.
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. Armed with a Bachelor's in Business Administration and a knack for crafting compelling narratives and a sharp specialization in everything from Predictive Analytics to FinTech—and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
Apple Taps Anthropic to Supercharge Xcode with AI-Powered Coding Assistant
Apple collaborates with Amazon-backed Anthropic to create a next-gen AI assistant for Xcode, aiming to revolutionize how developers write, edit, and test code through an intuitive “vibe-coding” experience.
How Much Does a Digital Marketing Agency Cost?
Discover the factors that manipulate the marketing agency costs and drive you to hefty bills. Observe and plan smartly! We got some tips too.
Quantum Leap: Amaravati to Build India’s First Tech Village
Amravati’s quantum computing village, India’s first, pioneers a tech revolution with IBM, TCS, and L&T, fostering innovation in quantum research and collaboration.
Microsoft Goes Passwordless by Default, Pushing Passkeys Mainstream
Microsoft ditches passwords for new users—passkeys are in, friction is out. Is this the tech giants’ way of embracing smarter sign-ins?