Google DeepMind has made a groundbreaking announcement in the field of artificial intelligence with the introduction of Gemini Robotics On-Device, a cutting-edge product designed specifically for robotics. This innovative device enables robots to operate locally on their hardware, eliminating the need for a constant internet connection.
By harnessing the multimodal reasoning capabilities of the Gemini2.0 model, Gemini Robotics On-Device demonstrates remarkable flexibility and the ability to generalize tasks. It has been meticulously optimized for a variety of intelligent operations, including dexterous tasks such as folding laundry and unzipping bags, all of which can be executed directly on the robot's body.
One notable advantage of Gemini Robotics On-Device is its suitability for delay-sensitive applications. It ensures reliable performance even in environments with poor network connectivity. To help developers make the most of this new technology, Google will also release the Gemini Robotics SDK. This software development kit allows developers to assess the model's performance in specific tasks. With the SDK, developers can test the model in DeepMind's MuJoCo physics simulator and quickly adapt it to new domains, requiring only 50 to 100 demonstrations.
In terms of performance, Gemini Robotics On-Device's adaptability across multiple tasks is truly impressive. The model excelled in seven different dexterous manipulation tasks of varying difficulty, handling objects and scenarios it had never encountered before. This not only showcases its adaptability across different robots but also highlights its versatility.
DeepMind's breakthrough with Gemini Robotics On-Device represents a significant advancement in building powerful robot models, taking a crucial step towards the era of true embodied intelligence.