Jan Humplik — Scifaro

Gemini Robotics: Bringing AI into the Physical World

Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. This report…

Robotics · Computer Science 2025-03-27 Gemini Robotics Team , Saminda Abeyruwan , Joshua Ainslie , Jean-Baptiste Alayrac , Montserrat Gonzalez Arenas , Travis Armstrong , Ashwin Balakrishna , Robert Baruch , Maria Bauza , Michiel Blokzijl , Steven Bohez , Konstantinos Bousmalis , Anthony Brohan , Thomas Buschmann , Arunkumar Byravan , Serkan Cabi , Ken Caluwaerts , Federico Casarini , Oscar Chang , Jose Enrique Chen , Xi Chen , Hao-Tien Lewis Chiang , Krzysztof Choromanski , David D'Ambrosio , Sudeep Dasari , Todor Davchev , Coline Devin , Norman Di Palo , Tianli Ding , Adil Dostmohamed , Danny Driess , Yilun Du , Debidatta Dwibedi , Michael Elabd , Claudio Fantacci , Cody Fong , Erik Frey , Chuyuan Fu , Marissa Giustina , Keerthana Gopalakrishnan , Laura Graesser , Leonard Hasenclever , Nicolas Heess , Brandon Hernaez , Alexander Herzog , R. Alex Hofer , Jan Humplik , Atil Iscen , Mithun George Jacob , Deepali Jain , Ryan Julian , Dmitry Kalashnikov , M. Emre Karagozler , Stefani Karp , Chase Kew , Jerad Kirkland , Sean Kirmani , Yuheng Kuang , Thomas Lampe , Antoine Laurens , Isabel Leal , Alex X. Lee , Tsang-Wei Edward Lee , Jacky Liang , Yixin Lin , Sharath Maddineni , Anirudha Majumdar , Assaf Hurwitz Michaely , Robert Moreno , Michael Neunert , Francesco Nori , Carolina Parada , Emilio Parisotto , Peter Pastor , Acorn Pooley , Kanishka Rao , Krista Reymann , Dorsa Sadigh , Stefano Saliceti , Pannag Sanketi , Pierre Sermanet , Dhruv Shah , Mohit Sharma , Kathryn Shea , Charles Shu , Vikas Sindhwani , Sumeet Singh , Radu Soricut , Jost Tobias Springenberg , Rachel Sterneck , Razvan Surdulescu , Jie Tan , Jonathan Tompson , Vincent Vanhoucke , Jake Varley , Grace Vesom , Giulia Vezzani , Oriol Vinyals , Ayzaan Wahid , Stefan Welker , Paul Wohlhart , Fei Xia , Ted Xiao , Annie Xie , Jinyu Xie , Peng Xu , Sichun Xu , Ying Xu , Zhuo Xu , Yuxiang Yang , Rui Yao , Sergey Yaroshenko , Wenhao Yu , Wentao Yuan , Jingwei Zhang , Tingnan Zhang , Allan Zhou , Yuxiang Zhou

Proc4Gem: Foundation models for physical agency through procedural generation

In robot learning, it is common to either ignore the environment semantics, focusing on tasks like whole-body control which only require reasoning about robot-environment contacts, or conversely to ignore contact dynamics, focusing on…

Robotics · Computer Science 2025-03-12 Yixin Lin , Jan Humplik , Sandy H. Huang , Leonard Hasenclever , Francesco Romano , Stefano Saliceti , Daniel Zheng , Jose Enrique Chen , Catarina Barros , Adrian Collister , Matt Young , Adil Dostmohamed , Ben Moran , Ken Caluwaerts , Marissa Giustina , Joss Moore , Kieran Connell , Francesco Nori , Nicolas Heess , Steven Bohez , Arunkumar Byravan

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

We introduce Diffusion Augmented Agents (DAAG), a novel framework that leverages large language models, vision language models, and diffusion models to improve sample efficiency and transfer learning in reinforcement learning for embodied…

Machine Learning · Computer Science 2024-07-31 Norman Di Palo , Leonard Hasenclever , Jan Humplik , Arunkumar Byravan

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to…

Robotics · Computer Science 2024-06-03 Jacky Liang , Fei Xia , Wenhao Yu , Andy Zeng , Montserrat Gonzalez Arenas , Maria Attarian , Maria Bauza , Matthew Bennice , Alex Bewley , Adil Dostmohamed , Chuyuan Kelly Fu , Nimrod Gileadi , Marissa Giustina , Keerthana Gopalakrishnan , Leonard Hasenclever , Jan Humplik , Jasmine Hsu , Nikhil Joshi , Ben Jyenis , Chase Kew , Sean Kirmani , Tsang-Wei Edward Lee , Kuang-Huei Lee , Assaf Hurwitz Michaely , Joss Moore , Ken Oslund , Dushyant Rao , Allen Ren , Baruch Tabanpour , Quan Vuong , Ayzaan Wahid , Ted Xiao , Ying Xu , Vincent Zhuang , Peng Xu , Erik Frey , Ken Caluwaerts , Tingnan Zhang , Brian Ichter , Jonathan Tompson , Leila Takayama , Vincent Vanhoucke , Izhak Shafran , Maja Mataric , Dorsa Sadigh , Nicolas Heess , Kanishka Rao , Nik Stewart , Jie Tan , Carolina Parada

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

We apply multi-agent deep reinforcement learning (RL) to train end-to-end robot soccer policies with fully onboard computation and sensing via egocentric RGB vision. This setting reflects many challenges of real-world robotics, including…

Robotics · Computer Science 2024-05-07 Dhruva Tirumala , Markus Wulfmeier , Ben Moran , Sandy Huang , Jan Humplik , Guy Lever , Tuomas Haarnoja , Leonard Hasenclever , Arunkumar Byravan , Nathan Batchelor , Neil Sreendra , Kushal Patel , Marlon Gwira , Francesco Nori , Martin Riedmiller , Nicolas Heess

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

We investigate whether Deep Reinforcement Learning (Deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies in dynamic…

Robotics · Computer Science 2024-04-12 Tuomas Haarnoja , Ben Moran , Guy Lever , Sandy H. Huang , Dhruva Tirumala , Jan Humplik , Markus Wulfmeier , Saran Tunyasuvunakool , Noah Y. Siegel , Roland Hafner , Michael Bloesch , Kristian Hartikainen , Arunkumar Byravan , Leonard Hasenclever , Yuval Tassa , Fereshteh Sadeghi , Nathan Batchelor , Federico Casarini , Stefano Saliceti , Charles Game , Neil Sreendra , Kushal Patel , Marlon Gwira , Andrea Huber , Nicole Hurley , Francesco Nori , Raia Hadsell , Nicolas Heess

Language to Rewards for Robotic Skill Synthesis

Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to…

Robotics · Computer Science 2023-06-21 Wenhao Yu , Nimrod Gileadi , Chuyuan Fu , Sean Kirmani , Kuang-Huei Lee , Montse Gonzalez Arenas , Hao-Tien Lewis Chiang , Tom Erez , Leonard Hasenclever , Jan Humplik , Brian Ichter , Ted Xiao , Peng Xu , Andy Zeng , Tingnan Zhang , Nicolas Heess , Dorsa Sadigh , Jie Tan , Yuval Tassa , Fei Xia

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents. Skill reuse is one of the most common approaches, but current methods have considerable…

Machine Learning · Computer Science 2023-01-12 Giulia Vezzani , Dhruva Tirumala , Markus Wulfmeier , Dushyant Rao , Abbas Abdolmaleki , Ben Moran , Tuomas Haarnoja , Jan Humplik , Roland Hafner , Michael Neunert , Claudio Fantacci , Tim Hertweck , Thomas Lampe , Fereshteh Sadeghi , Nicolas Heess , Martin Riedmiller

NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields

We present a system for applying sim2real approaches to "in the wild" scenes with realistic visuals, and to policies which rely on active perception using RGB cameras. Given a short video of a static scene collected using a generic phone,…

Robotics · Computer Science 2022-10-12 Arunkumar Byravan , Jan Humplik , Leonard Hasenclever , Arthur Brussee , Francesco Nori , Tuomas Haarnoja , Ben Moran , Steven Bohez , Fereshteh Sadeghi , Bojan Vujatovic , Nicolas Heess

Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data

Robots will experience non-stationary environment dynamics throughout their lifetime: the robot dynamics can change due to wear and tear, or its surroundings may change over time. Eventually, the robots should perform well in all of the…

Robotics · Computer Science 2022-08-19 Wenxuan Zhou , Steven Bohez , Jan Humplik , Abbas Abdolmaleki , Dushyant Rao , Markus Wulfmeier , Tuomas Haarnoja , Nicolas Heess

Inferring couplings in networks across order-disorder phase transitions

Statistical inference is central to many scientific endeavors, yet how it works remains unresolved. Answering this requires a quantitative understanding of the intrinsic interplay between statistical models, inference methods and data…

Biological Physics · Physics 2022-06-28 Vudtiwat Ngampruetikorn , Vedant Sachdeva , Johanna Torrence , Jan Humplik , David J. Schwab , Stephanie E. Palmer

Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors

We investigate the use of prior knowledge of human and animal movement to learn reusable locomotion skills for real legged robots. Our approach builds upon previous work on imitating human or dog Motion Capture (MoCap) data to learn a…

Robotics · Computer Science 2022-04-01 Steven Bohez , Saran Tunyasuvunakool , Philemon Brakel , Fereshteh Sadeghi , Leonard Hasenclever , Yuval Tassa , Emilio Parisotto , Jan Humplik , Tuomas Haarnoja , Roland Hafner , Markus Wulfmeier , Michael Neunert , Ben Moran , Noah Siegel , Andrea Huber , Francesco Romano , Nathan Batchelor , Federico Casarini , Josh Merel , Raia Hadsell , Nicolas Heess

Importance Weighted Policy Learning and Adaptation

The ability to exploit prior experience to solve novel problems rapidly is a hallmark of biological learning systems and of great practical importance for artificial ones. In the meta reinforcement learning literature much recent work has…

Machine Learning · Computer Science 2021-06-07 Alexandre Galashov , Jakub Sygnowski , Guillaume Desjardins , Jan Humplik , Leonard Hasenclever , Rae Jeong , Yee Whye Teh , Nicolas Heess

Meta reinforcement learning as task inference

Humans achieve efficient learning by relying on prior knowledge about the structure of naturally occurring tasks. There is considerable interest in designing reinforcement learning (RL) algorithms with similar properties. This includes…

Machine Learning · Computer Science 2019-10-23 Jan Humplik , Alexandre Galashov , Leonard Hasenclever , Pedro A. Ortega , Yee Whye Teh , Nicolas Heess

Semiparametric energy-based probabilistic models

Probabilistic models can be defined by an energy function, where the probability of each state is proportional to the exponential of the state's negative energy. This paper considers a generalization of energy-based models in which the…

Neurons and Cognition · Quantitative Biology 2016-05-25 Jan Humplik , Gašper Tkačik