Related papers: Accelerating temporal action proposal generation v…

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles…

Computer Vision and Pattern Recognition · Computer Science 2016-08-03 Limin Wang , Yuanjun Xiong , Zhe Wang , Yu Qiao , Dahua Lin , Xiaoou Tang , Luc Van Gool

Context-aware Proposal Network for Temporal Action Detection

This technical report presents our first place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. The task aims to localize temporal boundaries of action instances with specific classes in long…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Xiang Wang , Huaxin Zhang , Shiwei Zhang , Changxin Gao , Yuanjie Shao , Nong Sang

Proposal Relation Network for Temporal Action Detection

This technical report presents our solution for temporal action detection task in AcitivityNet Challenge 2021. The purpose of this task is to locate and identify actions of interest in long untrimmed videos. The crucial challenge of the…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Xiang Wang , Zhiwu Qing , Ziyuan Huang , Yutong Feng , Shiwei Zhang , Jianwen Jiang , Mingqian Tang , Changxin Gao , Nong Sang

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

Temporal action proposal generation is an challenging and promising task which aims to locate temporal regions in real-world videos where action or event may occur. Current bottom-up proposal generation methods can generate proposals with…

Computer Vision and Pattern Recognition · Computer Science 2019-07-24 Tianwei Lin , Xiao Liu , Xin Li , Errui Ding , Shilei Wen

TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals

Temporal Action Proposal (TAP) generation is an important problem, as fast and accurate extraction of semantically important (e.g. human actions) segments from untrimmed videos is an important step for large-scale video analysis. We propose…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Jiyang Gao , Zhenheng Yang , Chen Sun , Kan Chen , Ram Nevatia

TDN: Temporal Difference Networks for Efficient Action Recognition

Temporal modeling still remains challenging for action recognition in videos. To mitigate this issue, this paper presents a new video architecture, termed as Temporal Difference Network (TDN), with a focus on capturing multi-scale temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Limin Wang , Zhan Tong , Bin Ji , Gangshan Wu

Rethinking the Faster R-CNN Architecture for Temporal Action Localization

We propose TAL-Net, an improved approach to temporal action localization in video that is inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key shortcomings of existing approaches: (1) we improve receptive…

Computer Vision and Pattern Recognition · Computer Science 2018-04-23 Yu-Wei Chao , Sudheendra Vijayanarasimhan , Bryan Seybold , David A. Ross , Jia Deng , Rahul Sukthankar

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Accurate temporal action proposals play an important role in detecting actions from untrimmed videos. The existing approaches have difficulties in capturing global contextual information and simultaneously localizing actions with different…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Jialin Gao , Zhixiang Shi , Jiani Li , Guanshuo Wang , Yufeng Yuan , Shiming Ge , Xi Zhou

MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation

Temporally locating and classifying action segments in long untrimmed videos is of particular interest to many applications like surveillance and robotics. While traditional approaches follow a two-step pipeline, by generating frame-wise…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Yazan Abu Farha , Juergen Gall

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content. This…

Computer Vision and Pattern Recognition · Computer Science 2018-09-27 Tianwei Lin , Xu Zhao , Haisheng Su , Chongjing Wang , Ming Yang

Boundary Content Graph Neural Network for Temporal Action Proposal Generation

Temporal action proposal generation plays an important role in video action understanding, which requires localizing high-quality action content precisely. However, generating temporal proposals with both precise boundaries and high-quality…

Computer Vision and Pattern Recognition · Computer Science 2020-08-05 Yueran Bai , Yingying Wang , Yunhai Tong , Yang Yang , Qiyue Liu , Junhui Liu

Temporal Action Proposal Generation with Transformers

Transformer networks are effective at modeling long-range contextual information and have recently demonstrated exemplary performance in the natural language processing domain. Conventionally, the temporal action proposal generation (TAPG)…

Computer Vision and Pattern Recognition · Computer Science 2021-05-26 Lining Wang , Haosen Yang , Wenhao Wu , Hongxun Yao , Hujie Huang

Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

Temporal action localization in videos presents significant challenges in the field of computer vision. While the boundary-sensitive method has been widely adopted, its limitations include incomplete use of intermediate and global…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Qing Song , Yang Zhou , Mengjie Hu , Chun Liu

Temporal Segment Networks for Action Recognition in Videos

Deep convolutional networks have achieved great success for image recognition. However, for action recognition in videos, their advantage over traditional methods is not so evident. We present a general and flexible video-level framework…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Limin Wang , Yuanjun Xiong , Zhe Wang , Yu Qiao , Dahua Lin , Xiaoou Tang , Luc Van Gool

Multi-granularity Generator for Temporal Action Proposal

Temporal action proposal generation is an important task, aiming to localize the video segments containing human actions in an untrimmed video. In this paper, we propose a multi-granularity generator (MGG) to perform the temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Yuan Liu , Lin Ma , Yifeng Zhang , Wei Liu , Shih-Fu Chang

Temporal Recurrent Networks for Online Action Detection

Most work on temporal action detection is formulated as an offline problem, in which the start and end times of actions are determined after the entire video is fully observed. However, important real-time applications including…

Computer Vision and Pattern Recognition · Computer Science 2019-03-26 Mingze Xu , Mingfei Gao , Yi-Ting Chen , Larry S. Davis , David J. Crandall

TEINet: Towards an Efficient Architecture for Video Recognition

Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs have witnessed remarkable progress in action recognition from videos. However, compared with their 2D counterparts, 3D convolutions often…

Computer Vision and Pattern Recognition · Computer Science 2019-11-22 Zhaoyang Liu , Donghao Luo , Yabiao Wang , Limin Wang , Ying Tai , Chengjie Wang , Jilin Li , Feiyue Huang , Tong Lu

Temporal Context Network for Activity Localization in Videos

We present a Temporal Context Network (TCN) for precise temporal localization of human activities. Similar to the Faster-RCNN architecture, proposals are placed at equal intervals in a video which span multiple temporal scales. We propose a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-09 Xiyang Dai , Bharat Singh , Guyue Zhang , Larry S. Davis , Yan Qiu Chen

A Real-time Action Representation with Temporal Encoding and Deep Compression

Deep neural networks have achieved remarkable success for video-based action recognition. However, most of existing approaches cannot be deployed in practice due to the high computational cost. To address this challenge, we propose a new…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Kun Liu , Wu Liu , Huadong Ma , Mingkui Tan , Chuang Gan

Temporal Context Aggregation Network for Temporal Action Proposal Refinement

Temporal action proposal generation aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet important task in the video understanding field. The proposals generated by current methods still suffer from…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Zhiwu Qing , Haisheng Su , Weihao Gan , Dongliang Wang , Wei Wu , Xiang Wang , Yu Qiao , Junjie Yan , Changxin Gao , Nong Sang