Related papers: FlowControl: Optical Flow Based Visual Servoing

One-Shot Imitation Filming of Human Motion Videos

Imitation learning has been applied to mimic the operation of a human cameraman in several autonomous cinematography systems. To imitate different filming styles, existing methods train multiple models, where each model handles a particular…

Computer Vision and Pattern Recognition · Computer Science 2019-12-24 Chong Huang , Yuanjie Dang , Peng Chen , Xin Yang , Kwang-Ting , Cheng

Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images

Optical flow estimation is a crucial subfield of computer vision, serving as a foundation for video tasks. However, the real-world robustness is limited by animated synthetic datasets for training. This introduces domain gaps when applied…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Yingping Liang , Ying Fu , Yutao Hu , Wenqi Shao , Jiaming Liu , Debing Zhang

FlowRetrieval: Flow-Guided Data Retrieval for Few-Shot Imitation Learning

Few-shot imitation learning relies on only a small amount of task-specific demonstrations to efficiently adapt a policy for a given downstream tasks. Retrieval-based methods come with a promise of retrieving relevant past experiences to…

Robotics · Computer Science 2024-10-14 Li-Heng Lin , Yuchen Cui , Amber Xie , Tianyu Hua , Dorsa Sadigh

Imitation Learning-based Visual Servoing for Tracking Moving Objects

In everyday life collaboration tasks between human operators and robots, the former necessitate simple ways for programming new skills, the latter have to show adaptive capabilities to cope with environmental changes. The joint use of…

Robotics · Computer Science 2023-09-15 Rocco Felici , Matteo Saveriano , Loris Roveda , Antonio Paolillo

FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation

This paper investigates training better visual world models for robot manipulation, i.e., models that can predict future visual observations by conditioning on past frames and robot actions. Specifically, we consider world models that…

Robotics · Computer Science 2025-05-16 Jun Guo , Xiaojian Ma , Yikai Wang , Min Yang , Huaping Liu , Qing Li

3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model

Manipulation has long been a challenging task for robots, while humans can effortlessly perform complex interactions with objects, such as hanging a cup on the mug rack. A key reason is the lack of a large and uniform dataset for teaching…

Robotics · Computer Science 2025-06-09 Hongyan Zhi , Peihao Chen , Siyuan Zhou , Yubo Dong , Quanxi Wu , Lei Han , Mingkui Tan

FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis

We present FloVD, a novel video diffusion model for camera-controllable video generation. FloVD leverages optical flow to represent the motions of the camera and moving objects. This approach offers two key benefits. Since optical flow can…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Wonjoon Jin , Qi Dai , Chong Luo , Seung-Hwan Baek , Sunghyun Cho

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

Imitation Learning (IL) enables robots to learn complex skills from demonstrations without explicit task modeling, but it typically requires large amounts of demonstrations, creating significant collection costs. Prior work has investigated…

Robotics · Computer Science 2026-03-02 Runze Tang , Penny Sweetser

NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos

Enabling robots to execute novel manipulation tasks zero-shot is a central goal in robotics. Most existing methods assume in-distribution tasks or rely on fine-tuning with embodiment-matched data, limiting transfer across platforms. We…

Robotics · Computer Science 2025-10-10 Hongyu Li , Lingfeng Sun , Yafei Hu , Duy Ta , Jennifer Barry , George Konidaris , Jiahui Fu

FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

Robots can acquire complex manipulation skills by learning policies from expert demonstrations, which is often known as vision-based imitation learning. Generating policies based on diffusion and flow matching models has been shown to be…

Robotics · Computer Science 2024-12-17 Qinglun Zhang , Zhen Liu , Haoqiang Fan , Guanghui Liu , Bing Zeng , Shuaicheng Liu

FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation

Generative manipulation policies can fail catastrophically under deployment-time distribution shift, yet many failures are near-misses: the robot reaches almost-correct poses and would succeed with a small corrective motion. We propose…

Robotics · Computer Science 2026-03-05 Edgar Welte , Yitian Shi , Rosa Wolf , Maximillian Gilles , Rania Rayyes

Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks

Visual servoing, the method of controlling robot motion through feedback from visual sensors, has seen significant advancements with the integration of optical flow-based methods. However, its application remains limited by inherent…

Robotics · Computer Science 2024-12-10 Pranjali Pathre , Gunjan Gupta , M. Nomaan Qureshi , Mandyam Brunda , Samarth Brahmbhatt , K. Madhava Krishna

FlowPlan: Zero-Shot Task Planning with LLM Flow Engineering for Robotic Instruction Following

Robotic instruction following tasks require seamless integration of visual perception, task planning, target localization, and motion execution. However, existing task planning methods for instruction following are either data-driven or…

Robotics · Computer Science 2025-03-05 Zijun Lin , Chao Tang , Hanjing Ye , Hong Zhang

FlowTrack: Point-level Flow Network for 3D Single Object Tracking

3D single object tracking (SOT) is a crucial task in fields of mobile robotics and autonomous driving. Traditional motion-based approaches achieve target tracking by estimating the relative movement of target between two consecutive frames.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-03 Shuo Li , Yubo Cui , Zhiheng Li , Zheng Fang

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot

Imagining multiple consecutive frames given one single snapshot is challenging, since it is difficult to simultaneously predict diverse motions from a single image and faithfully generate novel frames without visual distortions. In this…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Lu Sheng , Junting Pan , Jiaming Guo , Jing Shao , Xiaogang Wang , Chen Change Loy

Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

Optical flow captures the motion of pixels in an image sequence over time, providing information about movement, depth, and environmental structure. Flying insects utilize this information to navigate and avoid obstacles, allowing them to…

Robotics · Computer Science 2025-04-22 Yu Hu , Yuang Zhang , Yunlong Song , Yang Deng , Feng Yu , Linzuo Zhang , Weiyao Lin , Danping Zou , Wenxian Yu

Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning

We present DOME, a novel method for one-shot imitation learning, where a task can be learned from just a single demonstration and then be deployed immediately, without any further data collection or training. DOME does not require prior…

Robotics · Computer Science 2022-07-29 Eugene Valassakis , Georgios Papagiannis , Norman Di Palo , Edward Johns

One-shot Imitation Learning via Interaction Warping

Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction Warping, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D…

Robotics · Computer Science 2023-11-07 Ondrej Biza , Skye Thompson , Kishore Reddy Pagidi , Abhinav Kumar , Elise van der Pol , Robin Walters , Thomas Kipf , Jan-Willem van de Meent , Lawson L. S. Wong , Robert Platt

Conditional Visual Servoing for Multi-Step Tasks

Visual Servoing has been effectively used to move a robot into specific target locations or to track a recorded demonstration. It does not require manual programming, but it is typically limited to settings where one demonstration maps to…

Robotics · Computer Science 2022-05-18 Sergio Izquierdo , Max Argus , Thomas Brox

Sim2Real View Invariant Visual Servoing by Recurrent Control

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints and angles, even in the presence of optical distortions. In robotics, this ability is referred to as visual servoing: moving a tool or…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Fereshteh Sadeghi , Alexander Toshev , Eric Jang , Sergey Levine