Related papers: Physical Reasoning Using Dynamics-Aware Models

Forward Prediction for Physical Reasoning

Physical reasoning requires forward prediction: the ability to forecast what will happen next given some initial world state. We study the performance of state-of-the-art forward-prediction models in the complex physical-reasoning tasks of…

Machine Learning · Computer Science 2021-03-31 Rohit Girdhar , Laura Gustafson , Aaron Adcock , Laurens van der Maaten

Describing Physics For Physical Reasoning: Force-based Sequential Manipulation Planning

Physical reasoning is a core aspect of intelligence in animals and humans. A central question is what model should be used as a basis for reasoning. Existing work considered models ranging from intuitive physics and physical simulators to…

Robotics · Computer Science 2020-07-07 Marc Toussaint , Jung-Su Ha , Danny Driess

Solving Physics Puzzles by Reasoning about Paths

We propose a new deep learning model for goal-driven tasks that require intuitive physical reasoning and intervention in the scene to achieve a desired end goal. Its modular structure is motivated by hypothesizing a sequence of intuitive…

Artificial Intelligence · Computer Science 2020-11-17 Augustin Harter , Andrew Melnik , Gaurav Kumar , Dhruv Agarwal , Animesh Garg , Helge Ritter

PHYRE: A New Benchmark for Physical Reasoning

Understanding and reasoning about physics is an important ability of intelligent agents. We develop the PHYRE benchmark for physical reasoning that contains a set of simple classical mechanics puzzles in a 2D physical environment. The…

Machine Learning · Computer Science 2019-08-16 Anton Bakhtin , Laurens van der Maaten , Justin Johnson , Laura Gustafson , Ross Girshick

Reasoning in machine vision: learning to think fast and slow

Reasoning is a hallmark of human intelligence, enabling adaptive decision-making in complex and unfamiliar scenarios. In contrast, machine intelligence remains bound to training data, lacking the ability to dynamically refine solutions at…

Computer Vision and Pattern Recognition · Computer Science 2025-06-30 Shaheer U. Saeed , Yipei Wang , Veeru Kasivisvanathan , Brian R. Davidson , Matthew J. Clarkson , Yipeng Hu , Daniel C. Alexander

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Existing benchmarks fail to capture a crucial aspect of intelligence: physical reasoning, the integrated ability to combine domain knowledge, symbolic reasoning, and understanding of real-world constraints. To address this gap, we introduce…

Artificial Intelligence · Computer Science 2025-05-30 Hui Shen , Taiqiang Wu , Qi Han , Yunta Hsieh , Jizhou Wang , Yuyue Zhang , Yuxin Cheng , Zijian Hao , Yuansheng Ni , Xin Wang , Zhongwei Wan , Kai Zhang , Wendong Xu , Jing Xiong , Ping Luo , Wenhu Chen , Chaofan Tao , Zhuoqing Mao , Ngai Wong

Reward Design for Physical Reasoning in Vision-Language Models

Physical reasoning over visual inputs demands tight integration of visual perception, domain knowledge, and multi-step symbolic inference. Yet even state-of-the-art Vision Language Models (VLMs) fall far short of human performance on…

Artificial Intelligence · Computer Science 2026-04-16 Derek Lilienthal , Manisha Mukherjee , Sameera Horawalavithana

Learning Reward for Physical Skills using Large Language Model

Learning reward functions for physical skills are challenging due to the vast spectrum of skills, the high-dimensionality of state and action space, and nuanced sensory feedback. The complexity of these tasks makes acquiring expert…

Robotics · Computer Science 2023-10-24 Yuwei Zeng , Yiqing Xu

Physics-Informed Model-Based Reinforcement Learning

We apply reinforcement learning (RL) to robotics tasks. One of the drawbacks of traditional RL algorithms has been their poor sample efficiency. One approach to improve the sample efficiency is model-based RL. In our model-based RL…

Machine Learning · Computer Science 2023-05-16 Adithya Ramesh , Balaraman Ravindran

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards

Preference-based reinforcement learning (PbRL) aligns a robot behavior with human preferences via a reward function learned from binary feedback over agent behaviors. We show that dynamics-aware reward functions improve the sample…

Artificial Intelligence · Computer Science 2024-02-29 Katherine Metcalf , Miguel Sarabia , Natalie Mackraz , Barry-John Theobald

Performative Reinforcement Learning

We introduce the framework of performative reinforcement learning where the policy chosen by the learner affects the underlying reward and transition dynamics of the environment. Following the recent literature on performative…

Machine Learning · Computer Science 2023-06-08 Debmalya Mandal , Stelios Triantafyllou , Goran Radanovic

Hierarchical Relational Inference

Common-sense physical reasoning in the real world requires learning about the interactions of objects and their dynamics. The notion of an abstract object, however, encompasses a wide variety of physical objects that differ greatly in terms…

Machine Learning · Computer Science 2020-12-16 Aleksandar Stanić , Sjoerd van Steenkiste , Jürgen Schmidhuber

Emergence of Locomotion Behaviours in Rich Environments

The reinforcement learning paradigm allows, in principle, for complex behaviours to be learned directly from simple reward signals. In practice, however, it is common to carefully hand-design the reward function to encourage a particular…

Artificial Intelligence · Computer Science 2017-07-12 Nicolas Heess , Dhruva TB , Srinivasan Sriram , Jay Lemmon , Josh Merel , Greg Wayne , Yuval Tassa , Tom Erez , Ziyu Wang , S. M. Ali Eslami , Martin Riedmiller , David Silver

Enhancing Reasoning Skills in the Process of Teaching and Learning Physics via Dynamic Problem Solving Strategies: a Preparation for Future Learning

The large number of published articles in physics journals under the title "Comments on ..." and "Reply to ..." is indicative that the conceptual understanding of physical phenomena is very elusive and hard to grasp even to experts, but it…

General Physics · Physics 2011-11-18 Sergio Rojas

Visual Reasoning by Progressive Module Networks

Humans learn to solve tasks of increasing complexity by building on top of previously acquired knowledge. Typically, there exists a natural progression in the tasks that we learn - most do not require completely independent solutions, but…

Computer Vision and Pattern Recognition · Computer Science 2018-10-01 Seung Wook Kim , Makarand Tapaswi , Sanja Fidler

Learning to Rearrange with Physics-Inspired Risk Awareness

Real-world applications require a robot operating in the physical world with awareness of potential risks besides accomplishing the task. A large part of risky behaviors arises from interacting with objects in ignorance of affordance. To…

Robotics · Computer Science 2022-06-28 Meng Song , Yuhan Liu , Zhengqin Li , Manmohan Chandraker

Object-centric Denoising Diffusion Models for Physical Reasoning

Reasoning about the trajectories of multiple, interacting objects is integral to physical reasoning tasks in machine learning. This involves conditions imposed on the objects at different time steps, for instance initial states or desired…

Machine Learning · Computer Science 2025-07-08 Moritz Lange , Raphael C. Engelhardt , Wolfgang Konen , Andrew Melnik , Laurenz Wiskott

Reasoning Depth and Environment Complexity: A Controlled Study of RLVR Data Allocation across Logical Reasoning Tasks

Reinforcement learning with verifiable rewards (RLVR) has become central to post-training reasoning models, yet a key limitation of existing studies is their narrow view of the reasoning space: difficulty is treated as reasoning depth…

Computation and Language · Computer Science 2026-05-27 Yihua Zhu , Qianying Liu , Fei Cheng , Jiaxin Wang , Akiko Aizawa , Sadao Kurohashi , Hidetoshi Shimodaira

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

The ability to precisely derive mathematical objects is a core requirement for downstream STEM applications, including mathematics, physics, and chemistry, where reasoning must culminate in formally structured expressions. Yet, current LM…

Artificial Intelligence · Computer Science 2026-03-20 Pranjal Aggarwal , Marjan Ghazvininejad , Seungone Kim , Ilia Kulikov , Jack Lanchantin , Xian Li , Tianjian Li , Bo Liu , Graham Neubig , Anaelia Ovalle , Swarnadeep Saha , Sainbayar Sukhbaatar , Sean Welleck , Jason Weston , Chenxi Whitehouse , Adina Williams , Jing Xu , Ping Yu , Weizhe Yuan , Jingyu Zhang , Wenting Zhao

Physical Derivatives: Computing policy gradients by physical forward-propagation

Model-free and model-based reinforcement learning are two ends of a spectrum. Learning a good policy without a dynamic model can be prohibitively expensive. Learning the dynamic model of a system can reduce the cost of learning the policy,…

Robotics · Computer Science 2022-01-19 Arash Mehrjou , Ashkan Soleymani , Stefan Bauer , Bernhard Schölkopf