Deepak Pathak — Scifaro

DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies

Large-scale, diverse robot datasets have emerged as a promising path toward enabling dexterous manipulation policies to generalize to novel environments, but acquiring such datasets presents many challenges. While teleoperation provides…

Robotics · Computer Science 2026-05-19 Tony Tao , Mohan Kumar Srirama , Jason Jingzhou Liu , Kenneth Shaw , Deepak Pathak

Solving Physics Olympiad via Reinforcement Learning on Physics Simulators

We have witnessed remarkable advances in LLM reasoning capabilities with the advent of DeepSeek-R1. However, much of this progress has been fueled by the abundance of internet question-answer (QA) pairs, a major bottleneck going forward,…

Machine Learning · Computer Science 2026-04-14 Mihir Prabhudesai , Aryan Satpathy , Yangmin Li , Zheyang Qin , Nikash Bhardwaj , Amir Zadeh , Chuan Li , Katerina Fragkiadaki , Deepak Pathak

YieldSAT: A Multimodal Benchmark Dataset for High-Resolution Crop Yield Prediction

Crop yield prediction requires substantial data to train scalable models. However, creating yield prediction datasets is constrained by high acquisition costs, heterogeneous data quality, and data privacy regulations. Consequently, existing…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Miro Miranda , Deepak Pathak , Patrick Helber , Benjamin Bischke , Hiba Najjar , Francisco Mena , Cristhian Sanchez , Akshay Pai , Diego Arenas , Matias Valdenegro-Toro , Marcela Charfuelan , Marlon Nuske , Andreas Dengel

ViPRA: Video Prediction for Robot Actions

Can we turn a video prediction model into a robot policy? Videos, including those of humans or teleoperated robots, capture rich physical interactions. However, most of them lack labeled actions, which limits their use in robot learning. We…

Robotics · Computer Science 2026-03-31 Sandeep Routray , Hengkai Pan , Unnat Jain , Shikhar Bahl , Deepak Pathak

Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling

We introduce Latent Particle World Model (LPWM), a self-supervised object-centric world model scaled to real-world multi-object datasets and applicable in decision-making. LPWM autonomously discovers keypoints, bounding boxes, and object…

Machine Learning · Computer Science 2026-03-06 Tal Daniel , Carl Qi , Dan Haramati , Amir Zadeh , Chuan Li , Aviv Tamar , Deepak Pathak , David Held

Expanding the Capabilities of Reinforcement Learning via Text Feedback

The success of RL for LLM post-training stems from an unreasonably uninformative source: a single bit of information per rollout as binary reward or preference label. At the other extreme, distillation offers dense supervision but requires…

Machine Learning · Computer Science 2026-02-12 Yuda Song , Lili Chen , Fahim Tajwar , Remi Munos , Deepak Pathak , J. Andrew Bagnell , Aarti Singh , Andrea Zanette

Iterative Refinement Improves Compositional Image Generation

Text-to-image (T2I) models have achieved remarkable progress, yet they continue to struggle with complex prompts that require simultaneously handling multiple objects, relations, and attributes. Existing inference-time strategies, such as…

Computer Vision and Pattern Recognition · Computer Science 2026-01-22 Shantanu Jaiswal , Mihir Prabhudesai , Nikash Bhardwaj , Zheyang Qin , Amir Zadeh , Chuan Li , Katerina Fragkiadaki , Deepak Pathak

Generative Classifiers Avoid Shortcut Solutions

Discriminative approaches to classification often learn shortcuts that hold in-distribution but fail even under minor distribution shift. This failure mode stems from an overreliance on features that are spuriously correlated with the…

Machine Learning · Computer Science 2026-01-01 Alexander C. Li , Ananya Kumar , Deepak Pathak

High Torque Density PCB Axial Flux Permanent Magnet Motor for Micro Robots

Quasi-direct-drive (QDD) actuation is transforming legged and manipulator robots by eliminating high-ratio gearboxes, yet it demands motors that deliver very high torque at low speed within a thin, disc-shaped joint envelope. Axial-flux…

Robotics · Computer Science 2025-12-09 Jianren Wang , Quanting Xie , Jie Han , Yang Zhang , Christopher G. Atkeson , Abhinav Gupta , Deepak Pathak , Yonatan Bisk

IFG: Internet-Scale Guidance for Functional Grasping Generation

Large Vision Models trained on internet-scale data have demonstrated strong capabilities in segmenting and semantically understanding object parts, even in cluttered, crowded scenes. However, while these models can direct a robot toward the…

Robotics · Computer Science 2025-11-13 Ray Muxin Liu , Mingxuan Li , Kenneth Shaw , Deepak Pathak

Evolutionary Policy Optimization

On-policy reinforcement learning (RL) algorithms are widely used for their strong asymptotic performance and training stability, but they struggle to scale with larger batch sizes, as additional parallel environments yield redundant data…

Machine Learning · Computer Science 2025-11-13 Jianren Wang , Yifan Su , Abhinav Gupta , Deepak Pathak

Diffusion Beats Autoregressive in Data-Constrained Settings

Autoregressive (AR) models have long dominated the landscape of large language models, driving progress across a wide range of tasks. Recently, diffusion-based language models have emerged as a promising alternative, though their advantages…

Machine Learning · Computer Science 2025-10-28 Mihir Prabhudesai , Mengning Wu , Amir Zadeh , Katerina Fragkiadaki , Deepak Pathak

LocoFormer: Generalist Locomotion via Long-context Adaptation

Modern locomotion controllers are manually tuned for specific embodiments. We present LocoFormer, a generalist omni-bodied locomotion model that can control previously unseen legged and wheeled robots, even without precise knowledge of…

Robotics · Computer Science 2025-09-30 Min Liu , Deepak Pathak , Ananye Agarwal

Self-Questioning Language Models

Can large language models improve without external data -- by generating their own questions and answers? We hypothesize that a pre-trained language model can improve its reasoning skills given only a single prompt specifying the topic…

Machine Learning · Computer Science 2025-09-11 Lili Chen , Mihir Prabhudesai , Katerina Fragkiadaki , Hao Liu , Deepak Pathak

Deep Reactive Policy: Learning Reactive Manipulator Motion Planning for Dynamic Environments

Generating collision-free motion in dynamic, partially observable environments is a fundamental challenge for robotic manipulators. Classical motion planners can compute globally optimal trajectories but require full environment knowledge…

Robotics · Computer Science 2025-09-09 Jiahui Yang , Jason Jingzhou Liu , Yulong Li , Youssef Khaky , Kenneth Shaw , Deepak Pathak

Can LLMs Lie? Investigation beyond Hallucination

Large language models (LLMs) have demonstrated impressive capabilities across a variety of tasks, but their increasing autonomy in real-world applications raises concerns about their trustworthiness. While hallucinations-unintentional…

Machine Learning · Computer Science 2025-09-04 Haoran Huan , Mihir Prabhudesai , Mengning Wu , Shantanu Jaiswal , Deepak Pathak

Intrinsic Explainability of Multimodal Learning for Crop Yield Prediction

Multimodal learning enables various machine learning tasks to benefit from diverse data sources, effectively mimicking the interplay of different factors in real-world applications, particularly in agriculture. While the heterogeneous…

Artificial Intelligence · Computer Science 2025-08-12 Hiba Najjar , Deepak Pathak , Marlon Nuske , Andreas Dengel

Maximizing Confidence Alone Improves Reasoning

Reinforcement learning (RL) has enabled machine learning models to achieve significant advances in many fields. Most recently, RL has empowered frontier language models to solve challenging math, science, and coding problems. However,…

Machine Learning · Computer Science 2025-06-30 Mihir Prabhudesai , Lili Chen , Alex Ippoliti , Katerina Fragkiadaki , Hao Liu , Deepak Pathak

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with…

Robotics · Computer Science 2025-05-15 Embodiment Collaboration , Abby O'Neill , Abdul Rehman , Abhinav Gupta , Abhiram Maddukuri , Abhishek Gupta , Abhishek Padalkar , Abraham Lee , Acorn Pooley , Agrim Gupta , Ajay Mandlekar , Ajinkya Jain , Albert Tung , Alex Bewley , Alex Herzog , Alex Irpan , Alexander Khazatsky , Anant Rai , Anchit Gupta , Andrew Wang , Andrey Kolobov , Anikait Singh , Animesh Garg , Aniruddha Kembhavi , Annie Xie , Anthony Brohan , Antonin Raffin , Archit Sharma , Arefeh Yavary , Arhan Jain , Ashwin Balakrishna , Ayzaan Wahid , Ben Burgess-Limerick , Beomjoon Kim , Bernhard Schölkopf , Blake Wulfe , Brian Ichter , Cewu Lu , Charles Xu , Charlotte Le , Chelsea Finn , Chen Wang , Chenfeng Xu , Cheng Chi , Chenguang Huang , Christine Chan , Christopher Agia , Chuer Pan , Chuyuan Fu , Coline Devin , Danfei Xu , Daniel Morton , Danny Driess , Daphne Chen , Deepak Pathak , Dhruv Shah , Dieter Büchler , Dinesh Jayaraman , Dmitry Kalashnikov , Dorsa Sadigh , Edward Johns , Ethan Foster , Fangchen Liu , Federico Ceola , Fei Xia , Feiyu Zhao , Felipe Vieira Frujeri , Freek Stulp , Gaoyue Zhou , Gaurav S. Sukhatme , Gautam Salhotra , Ge Yan , Gilbert Feng , Giulio Schiavi , Glen Berseth , Gregory Kahn , Guangwen Yang , Guanzhi Wang , Hao Su , Hao-Shu Fang , Haochen Shi , Henghui Bao , Heni Ben Amor , Henrik I Christensen , Hiroki Furuta , Homanga Bharadhwaj , Homer Walke , Hongjie Fang , Huy Ha , Igor Mordatch , Ilija Radosavovic , Isabel Leal , Jacky Liang , Jad Abou-Chakra , Jaehyung Kim , Jaimyn Drake , Jan Peters , Jan Schneider , Jasmine Hsu , Jay Vakil , Jeannette Bohg , Jeffrey Bingham , Jeffrey Wu , Jensen Gao , Jiaheng Hu , Jiajun Wu , Jialin Wu , Jiankai Sun , Jianlan Luo , Jiayuan Gu , Jie Tan , Jihoon Oh , Jimmy Wu , Jingpei Lu , Jingyun Yang , Jitendra Malik , João Silvério , Joey Hejna , Jonathan Booher , Jonathan Tompson , Jonathan Yang , Jordi Salvador , Joseph J. Lim , Junhyek Han , Kaiyuan Wang , Kanishka Rao , Karl Pertsch , Karol Hausman , Keegan Go , Keerthana Gopalakrishnan , Ken Goldberg , Kendra Byrne , Kenneth Oslund , Kento Kawaharazuka , Kevin Black , Kevin Lin , Kevin Zhang , Kiana Ehsani , Kiran Lekkala , Kirsty Ellis , Krishan Rana , Krishnan Srinivasan , Kuan Fang , Kunal Pratap Singh , Kuo-Hao Zeng , Kyle Hatch , Kyle Hsu , Laurent Itti , Lawrence Yunliang Chen , Lerrel Pinto , Li Fei-Fei , Liam Tan , Linxi "Jim" Fan , Lionel Ott , Lisa Lee , Luca Weihs , Magnum Chen , Marion Lepert , Marius Memmel , Masayoshi Tomizuka , Masha Itkina , Mateo Guaman Castro , Max Spero , Maximilian Du , Michael Ahn , Michael C. Yip , Mingtong Zhang , Mingyu Ding , Minho Heo , Mohan Kumar Srirama , Mohit Sharma , Moo Jin Kim , Muhammad Zubair Irshad , Naoaki Kanazawa , Nicklas Hansen , Nicolas Heess , Nikhil J Joshi , Niko Suenderhauf , Ning Liu , Norman Di Palo , Nur Muhammad Mahi Shafiullah , Oier Mees , Oliver Kroemer , Osbert Bastani , Pannag R Sanketi , Patrick "Tree" Miller , Patrick Yin , Paul Wohlhart , Peng Xu , Peter David Fagan , Peter Mitrano , Pierre Sermanet , Pieter Abbeel , Priya Sundaresan , Qiuyu Chen , Quan Vuong , Rafael Rafailov , Ran Tian , Ria Doshi , Roberto Martín-Martín , Rohan Baijal , Rosario Scalise , Rose Hendrix , Roy Lin , Runjia Qian , Ruohan Zhang , Russell Mendonca , Rutav Shah , Ryan Hoque , Ryan Julian , Samuel Bustamante , Sean Kirmani , Sergey Levine , Shan Lin , Sherry Moore , Shikhar Bahl , Shivin Dass , Shubham Sonawani , Shubham Tulsiani , Shuran Song , Sichun Xu , Siddhant Haldar , Siddharth Karamcheti , Simeon Adebola , Simon Guist , Soroush Nasiriany , Stefan Schaal , Stefan Welker , Stephen Tian , Subramanian Ramamoorthy , Sudeep Dasari , Suneel Belkhale , Sungjae Park , Suraj Nair , Suvir Mirchandani , Takayuki Osa , Tanmay Gupta , Tatsuya Harada , Tatsuya Matsushima , Ted Xiao , Thomas Kollar , Tianhe Yu , Tianli Ding , Todor Davchev , Tony Z. Zhao , Travis Armstrong , Trevor Darrell , Trinity Chung , Vidhi Jain , Vikash Kumar , Vincent Vanhoucke , Vitor Guizilini , Wei Zhan , Wenxuan Zhou , Wolfram Burgard , Xi Chen , Xiangyu Chen , Xiaolong Wang , Xinghao Zhu , Xinyang Geng , Xiyuan Liu , Xu Liangwei , Xuanlin Li , Yansong Pang , Yao Lu , Yecheng Jason Ma , Yejin Kim , Yevgen Chebotar , Yifan Zhou , Yifeng Zhu , Yilin Wu , Ying Xu , Yixuan Wang , Yonatan Bisk , Yongqiang Dou , Yoonyoung Cho , Youngwoon Lee , Yuchen Cui , Yue Cao , Yueh-Hua Wu , Yujin Tang , Yuke Zhu , Yunchu Zhang , Yunfan Jiang , Yunshuang Li , Yunzhu Li , Yusuke Iwasawa , Yutaka Matsuo , Zehan Ma , Zhuo Xu , Zichen Jeff Cui , Zichen Zhang , Zipeng Fu , Zipeng Lin

FACTR: Force-Attending Curriculum Training for Contact-Rich Policy Learning

Many contact-rich tasks humans perform, such as box pickup or rolling dough, rely on force feedback for reliable execution. However, this force information, which is readily available in most robot arms, is not commonly used in…

Robotics · Computer Science 2025-04-28 Jason Jingzhou Liu , Yulong Li , Kenneth Shaw , Tony Tao , Ruslan Salakhutdinov , Deepak Pathak