Related papers: Developing Motion Code Embedding for Action Recogn…

Estimating Motion Codes from Demonstration Videos

A motion taxonomy can encode manipulations as a binary-encoded representation, which we refer to as motion codes. These motion codes innately represent a manipulation action in an embedded space that describes the motion's mechanical…

Robotics · Computer Science 2021-06-02 Maxat Alibayev , David Paulius , Yu Sun

A Motion Taxonomy for Manipulation Embedding

To represent motions from a mechanical point of view, this paper explores motion embedding using the motion taxonomy. With this taxonomy, manipulations can be described and represented as binary strings called motion codes. Motion codes…

Robotics · Computer Science 2020-07-15 David Paulius , Nicholas Eales , Yu Sun

Finding Action Tubes

We address the problem of action detection in videos. Driven by the latest progress in object detection from 2D images, we build action models using rich feature hierarchies derived from shape and kinematic cues. We incorporate appearance…

Computer Vision and Pattern Recognition · Computer Science 2014-11-25 Georgia Gkioxari , Jitendra Malik

Motion Inversion for Video Customization

In this work, we present a novel approach for motion customization in video generation, addressing the widespread gap in the exploration of motion representation within video generative models. Recognizing the unique challenges posed by the…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Luozhou Wang , Ziyang Mai , Guibao Shen , Yixun Liang , Xin Tao , Pengfei Wan , Di Zhang , Yijun Li , Yingcong Chen

The TIME Machine: On The Power of Motion for Efficient Perception

Video representation learning has seen tremendous progress in recent years. This has been driven by many factors, including the scale of training and the success of visual models trained contrastively with language. While these factors have…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Mantas Skackauskas , Xinyue Hao , Laura Sevilla-Lara

Macroblock Classification Method for Video Applications Involving Motions

In this paper, a macroblock classification method is proposed for various video processing applications involving motions. Based on the analysis of the Motion Vector field in the compressed video, we propose to classify Macroblocks of each…

Multimedia · Computer Science 2016-11-17 Weiyao Lin , Ming-Ting Sun , Hongxiang Li , Zhenzhong Chen , Wei Li , Bing Zhou

Diving Deep into the Motion Representation of Video-Text Models

Videos are more informative than images because they capture the dynamics of the scene. By representing motion in videos, we can capture dynamic activities. In this work, we introduce GPT-4 generated motion descriptions that capture…

Computer Vision and Pattern Recognition · Computer Science 2024-06-10 Chinmaya Devaraj , Cornelia Fermuller , Yiannis Aloimonos

Classifying and Visualizing Motion Capture Sequences using Deep Neural Networks

The gesture recognition using motion capture data and depth sensors has recently drawn more attention in vision recognition. Currently most systems only classify dataset with a couple of dozens different actions. Moreover, feature…

Computer Vision and Pattern Recognition · Computer Science 2014-09-02 Kyunghyun Cho , Xi Chen

Hierarchical Motion Understanding via Motion Programs

Current approaches to video analysis of human motion focus on raw pixels or keypoints as the basic units of reasoning. We posit that adding higher-level motion primitives, which can capture natural coarser units of motion such as backswing…

Computer Vision and Pattern Recognition · Computer Science 2021-04-23 Sumith Kulal , Jiayuan Mao , Alex Aiken , Jiajun Wu

Learning Long-term Motion Embeddings for Efficient Kinematics Generation

Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models exhibit strong comprehension of scene dynamics, exploring multiple possible futures through full video synthesis remains…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Nick Stracke , Kolja Bauer , Stefan Andreas Baumann , Miguel Angel Bautista , Josh Susskind , Björn Ommer

Deep motion estimation for parallel inter-frame prediction in video compression

Standard video codecs rely on optical flow to guide inter-frame prediction: pixels from reference frames are moved via motion vectors to predict target video frames. We propose to learn binary motion codes that are encoded based on an input…

Image and Video Processing · Electrical Eng. & Systems 2019-12-12 André Nortje , Herman A. Engelbrecht , Herman Kamper

Improving HEVC Encoding of Rendered Video Data Using True Motion Information

This paper shows that motion vectors representing the true motion of an object in a scene can be exploited to improve the encoding process of computer generated video sequences. Therefore, a set of sequences is presented for which the true…

Image and Video Processing · Electrical Eng. & Systems 2023-09-14 Christian Herglotz , David Müller , Andreas Weinlich , Frank Bauer , Michael Ortner , Marc Stamminger , André Kaup

A new way of video compression via forward-referencing using deep learning

To exploit high temporal correlations in video frames of the same scene, the current frame is predicted from the already-encoded reference frames using block-based motion estimation and compensation techniques. While this approach can…

Computer Vision and Pattern Recognition · Computer Science 2022-08-16 S. M. A. K. Rajin , M. Murshed , M. Paul , S. W. Teng , J. Ma

Manipulation Motion Taxonomy and Coding for Robots

This paper introduces a taxonomy of manipulations as seen especially in cooking for 1) grouping manipulations from the robotics point of view, 2) consolidating aliases and removing ambiguity for motion types, and 3) provide a path to…

Robotics · Computer Science 2020-08-03 David Paulius , Yongqiang Huang , Jason Meloncon , Yu Sun

Learning to Recognize 3D Human Action from A New Skeleton-based Representation Using Deep Convolutional Neural Networks

Recognizing human actions in untrimmed videos is an important challenging task. An effective 3D motion representation and a powerful learning model are two key factors influencing recognition performance. In this paper we introduce a new…

Computer Vision and Pattern Recognition · Computer Science 2018-12-31 Huy-Hieu Pham , Louahdi Khoudour , Alain Crouzil , Pablo Zegers , Sergio A. Velastin

Leveraging Motion Priors in Videos for Improving Human Segmentation

Despite many advances in deep-learning based semantic segmentation, performance drop due to distribution mismatch is often encountered in the real world. Recently, a few domain adaptation and active learning approaches have been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Yu-Ting Chen , Wen-Yen Chang , Hai-Lun Lu , Tingfan Wu , Min Sun

Understanding image motion with group representations

Motion is an important signal for agents in dynamic environments, but learning to represent motion from unlabeled video is a difficult and underconstrained problem. We propose a model of motion based on elementary group properties of…

Computer Vision and Pattern Recognition · Computer Science 2018-02-27 Andrew Jaegle , Stephen Phillips , Daphne Ippolito , Kostas Daniilidis

Masked Motion Encoding for Self-Supervised Video Representation Learning

How to learn discriminative video representation from unlabeled videos is challenging but crucial for video analysis. The latest attempts seek to learn a representation model by predicting the appearance contents in the masked regions.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Xinyu Sun , Peihao Chen , Liangwei Chen , Changhao Li , Thomas H. Li , Mingkui Tan , Chuang Gan

Motion Transformer for Unsupervised Image Animation

Image animation aims to animate a source image by using motion learned from a driving video. Current state-of-the-art methods typically use convolutional neural networks (CNNs) to predict motion information, such as motion keypoints and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Jiale Tao , Biao Wang , Tiezheng Ge , Yuning Jiang , Wen Li , Lixin Duan

Learning for Video Compression

One key challenge to learning-based video compression is that motion predictive coding, a very effective tool for video compression, can hardly be trained into a neural network. In this paper we propose the concept of PixelMotionCNN (PMCNN)…

Multimedia · Computer Science 2019-01-15 Zhibo Chen , Tianyu He , Xin Jin , Feng Wu