Related papers: A Spatial-Constraint Model for Manipulating Static…

Deep Visual Constraints: Neural Implicit Models for Manipulation Planning from Visual Input

Manipulation planning is the problem of finding a sequence of robot configurations that involves interactions with objects in the scene, e.g., grasping and placing an object, or more general tool-use. To achieve such interactions,…

Robotics · Computer Science 2022-08-01 Jung-Su Ha , Danny Driess , Marc Toussaint

Accurate Vision-based Manipulation through Contact Reasoning

Planning contact interactions is one of the core challenges of many robotic tasks. Optimizing contact locations while taking dynamics into account is computationally costly and, in environments that are only partially observable, executing…

Robotics · Computer Science 2020-04-20 Alina Kloss , Maria Bauza , Jiajun Wu , Joshua B. Tenenbaum , Alberto Rodriguez , Jeannette Bohg

Learning Task Constraints from Demonstration for Hybrid Force/Position Control

We present a novel method for learning hybrid force/position control from demonstration. We learn a dynamic constraint frame aligned to the direction of desired force using Cartesian Dynamic Movement Primitives. In contrast to approaches…

Robotics · Computer Science 2022-05-05 Adam Conkey , Tucker Hermans

Force Policy: Learning Hybrid Force-Position Control Policy under Interaction Frame for Contact-Rich Manipulation

Contact-rich manipulation demands human-like integration of perception and force feedback: vision should guide task progress, while high-frequency interaction control must stabilize contact under uncertainty. Existing learning-based…

Robotics · Computer Science 2026-05-12 Hongjie Fang , Shirun Tang , Mingyu Mei , Haoxiang Qin , Zihao He , Jingjing Chen , Ying Feng , Chenxi Wang , Wanxi Liu , Zaixing He , Cewu Lu , Shiquan Wang

Active Animations of Reduced Deformable Models with Environment Interactions

We present an efficient spacetime optimization method to automatically generate animations for a general volumetric, elastically deformable body. Our approach can model the interactions between the body and the environment and automatically…

Graphics · Computer Science 2017-09-11 Zherong Pan , Dinesh Manocha

Teaching contact-rich tasks from visual demonstrations by constraint extraction

Contact-rich manipulation involves kinematic constraints on the task motion, typically with discrete transitions between these constraints during the task. Allowing the robot to detect and reason about these contact constraints can support…

Robotics · Computer Science 2023-04-05 Christian Hegeler , Filippo Rozzi , Loris Roveda , Kevin Haninger

Interactive Perception for Deformable Object Manipulation

Interactive perception enables robots to manipulate the environment and objects to bring them into states that benefit the perception process. Deformable objects pose challenges to this due to significant manipulation difficulty and…

Robotics · Computer Science 2024-10-27 Zehang Weng , Peng Zhou , Hang Yin , Alexander Kravberg , Anastasiia Varava , David Navarro-Alarcon , Danica Kragic

4D Visualization of Dynamic Events from Unconstrained Multi-View Videos

We present a data-driven approach for 4D space-time visualization of dynamic events from videos captured by hand-held multiple cameras. Key to our approach is the use of self-supervised neural networks specific to the scene to compose…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Aayush Bansal , Minh Vo , Yaser Sheikh , Deva Ramanan , Srinivasa Narasimhan

Synthesizing Physically Plausible Human Motions in 3D Scenes

We present a physics-based character control framework for synthesizing human-scene interactions. Recent advances adopt physics simulation to mitigate artifacts produced by data-driven kinematic approaches. However, existing physics-based…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Liang Pan , Jingbo Wang , Buzhen Huang , Junyu Zhang , Haofan Wang , Xu Tang , Yangang Wang

Situational Fusion of Visual Representation for Visual Navigation

A complex visual navigation task puts an agent in different situations which call for a diverse range of visual perception abilities. For example, to "go to the nearest chair", the agent might need to identify a chair in a living room using…

Computer Vision and Pattern Recognition · Computer Science 2021-08-05 Bokui Shen , Danfei Xu , Yuke Zhu , Leonidas J. Guibas , Li Fei-Fei , Silvio Savarese

Towards Active Vision for Action Localization with Reactive Control and Predictive Learning

Visual event perception tasks such as action localization have primarily focused on supervised learning settings under a static observer, i.e., the camera is static and cannot be controlled by an algorithm. They are often restricted by the…

Computer Vision and Pattern Recognition · Computer Science 2021-11-11 Shubham Trehan , Sathyanarayanan N. Aakur

Spatiotemporal sensistivity and visual attention for efficient rendering of dynamic environments

We present a method to accelerate global illumination computation in dynamic environments by taking advantage of limitations of the human visual system. A model of visual attention is used to locate regions of interest in a scene and to…

Graphics · Computer Science 2007-05-23 Yang Li Hector Yee

Implicit State Estimation via Video Replanning

Video-based representations have gained prominence in planning and decision-making due to their ability to encode rich spatiotemporal dynamics and geometric relationships. These representations enable flexible and generalizable solutions…

Robotics · Computer Science 2026-02-11 Po-Chen Ko , Jiayuan Mao , Yu-Hsiang Fu , Hsien-Jeng Yeh , Chu-Rong Chen , Wei-Chiu Ma , Yilun Du , Shao-Hua Sun

FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation

Learning to manipulate objects efficiently, particularly those involving sustained contact (e.g., pushing, sliding) and articulated parts (e.g., drawers, doors), presents significant challenges. Traditional methods, such as robot-centric…

Robotics · Computer Science 2025-03-18 Shijie Fang , Wenchang Gao , Shivam Goel , Christopher Thierauf , Matthias Scheutz , Jivko Sinapov

Make Interaction Situated: Designing User Acceptable Interaction for Situated Visualization in Public Environments

Situated visualization blends data into the real world to fulfill individuals' contextual information needs. However, interacting with situated visualization in public environments faces challenges posed by user acceptance and contextual…

Human-Computer Interaction · Computer Science 2024-08-08 Qian Zhu , Zhuo Wang , Wei Zeng , Wai Tong , Weiyue Lin , Xiaojuan Ma

Learning Manipulation under Physics Constraints with Visual Perception

Understanding physical phenomena is a key competence that enables humans and animals to act and interact under uncertain perception in previously unseen environments containing novel objects and their configurations. In this work, we…

Robotics · Computer Science 2019-04-23 Wenbin Li , Aleš Leonardis , Jeannette Bohg , Mario Fritz

Static force field representation of environments based on agents nonlinear motions

This paper presents a methodology that aims at the incremental representation of areas inside environments in terms of attractive forces. It is proposed a parametric representation of velocity fields ruling the dynamics of moving agents. It…

Machine Learning · Computer Science 2019-09-10 Damian Campo , Alejandro Betancourt , Lucio Marcenaro , Carlo Regazzoni

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Vision-language-action (VLA) models have recently shown strong potential in enabling robots to follow language instructions and execute precise actions. However, most VLAs are built upon vision-language models pretrained solely on 2D data,…

Robotics · Computer Science 2025-10-20 Fuhao Li , Wenxuan Song , Han Zhao , Jingbo Wang , Pengxiang Ding , Donglin Wang , Long Zeng , Haoang Li

Learning Physics-Based Manipulation in Clutter: Combining Image-Based Generalization and Look-Ahead Planning

Physics-based manipulation in clutter involves complex interaction between multiple objects. In this paper, we consider the problem of learning, from interaction in a physics simulator, manipulation skills to solve this multi-step…

Robotics · Computer Science 2019-07-29 Wissam Bejjani , Mehmet R. Dogar , Matteo Leonetti

Playable Environments: Video Manipulation in Space and Time

We present Playable Environments - a new representation for interactive video generation and manipulation in space and time. With a single image at inference time, our novel framework allows the user to move objects in 3D while generating a…

Computer Vision and Pattern Recognition · Computer Science 2022-03-17 Willi Menapace , Stéphane Lathuilière , Aliaksandr Siarohin , Christian Theobalt , Sergey Tulyakov , Vladislav Golyanik , Elisa Ricci