English
Related papers

Related papers: FlowControl: Optical Flow Based Visual Servoing

200 papers

Imitation learning has been applied to mimic the operation of a human cameraman in several autonomous cinematography systems. To imitate different filming styles, existing methods train multiple models, where each model handles a particular…

Computer Vision and Pattern Recognition · Computer Science 2019-12-24 Chong Huang , Yuanjie Dang , Peng Chen , Xin Yang , Kwang-Ting , Cheng

Optical flow estimation is a crucial subfield of computer vision, serving as a foundation for video tasks. However, the real-world robustness is limited by animated synthetic datasets for training. This introduces domain gaps when applied…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Yingping Liang , Ying Fu , Yutao Hu , Wenqi Shao , Jiaming Liu , Debing Zhang

Few-shot imitation learning relies on only a small amount of task-specific demonstrations to efficiently adapt a policy for a given downstream tasks. Retrieval-based methods come with a promise of retrieving relevant past experiences to…

Robotics · Computer Science 2024-10-14 Li-Heng Lin , Yuchen Cui , Amber Xie , Tianyu Hua , Dorsa Sadigh

In everyday life collaboration tasks between human operators and robots, the former necessitate simple ways for programming new skills, the latter have to show adaptive capabilities to cope with environmental changes. The joint use of…

Robotics · Computer Science 2023-09-15 Rocco Felici , Matteo Saveriano , Loris Roveda , Antonio Paolillo

This paper investigates training better visual world models for robot manipulation, i.e., models that can predict future visual observations by conditioning on past frames and robot actions. Specifically, we consider world models that…

Robotics · Computer Science 2025-05-16 Jun Guo , Xiaojian Ma , Yikai Wang , Min Yang , Huaping Liu , Qing Li

Manipulation has long been a challenging task for robots, while humans can effortlessly perform complex interactions with objects, such as hanging a cup on the mug rack. A key reason is the lack of a large and uniform dataset for teaching…

Robotics · Computer Science 2025-06-09 Hongyan Zhi , Peihao Chen , Siyuan Zhou , Yubo Dong , Quanxi Wu , Lei Han , Mingkui Tan

We present FloVD, a novel video diffusion model for camera-controllable video generation. FloVD leverages optical flow to represent the motions of the camera and moving objects. This approach offers two key benefits. Since optical flow can…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Wonjoon Jin , Qi Dai , Chong Luo , Seung-Hwan Baek , Sunghyun Cho

Imitation Learning (IL) enables robots to learn complex skills from demonstrations without explicit task modeling, but it typically requires large amounts of demonstrations, creating significant collection costs. Prior work has investigated…

Robotics · Computer Science 2026-03-02 Runze Tang , Penny Sweetser

Enabling robots to execute novel manipulation tasks zero-shot is a central goal in robotics. Most existing methods assume in-distribution tasks or rely on fine-tuning with embodiment-matched data, limiting transfer across platforms. We…

Robotics · Computer Science 2025-10-10 Hongyu Li , Lingfeng Sun , Yafei Hu , Duy Ta , Jennifer Barry , George Konidaris , Jiahui Fu

Robots can acquire complex manipulation skills by learning policies from expert demonstrations, which is often known as vision-based imitation learning. Generating policies based on diffusion and flow matching models has been shown to be…

Robotics · Computer Science 2024-12-17 Qinglun Zhang , Zhen Liu , Haoqiang Fan , Guanghui Liu , Bing Zeng , Shuaicheng Liu

Generative manipulation policies can fail catastrophically under deployment-time distribution shift, yet many failures are near-misses: the robot reaches almost-correct poses and would succeed with a small corrective motion. We propose…

Robotics · Computer Science 2026-03-05 Edgar Welte , Yitian Shi , Rosa Wolf , Maximillian Gilles , Rania Rayyes

Visual servoing, the method of controlling robot motion through feedback from visual sensors, has seen significant advancements with the integration of optical flow-based methods. However, its application remains limited by inherent…

Robotic instruction following tasks require seamless integration of visual perception, task planning, target localization, and motion execution. However, existing task planning methods for instruction following are either data-driven or…

Robotics · Computer Science 2025-03-05 Zijun Lin , Chao Tang , Hanjing Ye , Hong Zhang

3D single object tracking (SOT) is a crucial task in fields of mobile robotics and autonomous driving. Traditional motion-based approaches achieve target tracking by estimating the relative movement of target between two consecutive frames.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-03 Shuo Li , Yubo Cui , Zhiheng Li , Zheng Fang

Imagining multiple consecutive frames given one single snapshot is challenging, since it is difficult to simultaneously predict diverse motions from a single image and faithfully generate novel frames without visual distortions. In this…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Lu Sheng , Junting Pan , Jiaming Guo , Jing Shao , Xiaogang Wang , Chen Change Loy

Optical flow captures the motion of pixels in an image sequence over time, providing information about movement, depth, and environmental structure. Flying insects utilize this information to navigate and avoid obstacles, allowing them to…

Robotics · Computer Science 2025-04-22 Yu Hu , Yuang Zhang , Yunlong Song , Yang Deng , Feng Yu , Linzuo Zhang , Weiyao Lin , Danping Zou , Wenxian Yu

We present DOME, a novel method for one-shot imitation learning, where a task can be learned from just a single demonstration and then be deployed immediately, without any further data collection or training. DOME does not require prior…

Robotics · Computer Science 2022-07-29 Eugene Valassakis , Georgios Papagiannis , Norman Di Palo , Edward Johns

Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction Warping, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D…

Visual Servoing has been effectively used to move a robot into specific target locations or to track a recorded demonstration. It does not require manual programming, but it is typically limited to settings where one demonstration maps to…

Robotics · Computer Science 2022-05-18 Sergio Izquierdo , Max Argus , Thomas Brox

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints and angles, even in the presence of optical distortions. In robotics, this ability is referred to as visual servoing: moving a tool or…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Fereshteh Sadeghi , Alexander Toshev , Eric Jang , Sergey Levine
‹ Prev 1 2 3 10 Next ›