English
Related papers

Related papers: Structured Attention Composition for Temporal Acti…

200 papers

We address the problem of temporal action localization in videos. We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores.…

Computer Vision and Pattern Recognition · Computer Science 2017-04-18 Zehuan Yuan , Jonathan C. Stroud , Tong Lu , Jia Deng

Temporal action localization plays an important role in video analysis, which aims to localize and classify actions in untrimmed videos. The previous methods often predict actions on a feature space of a single-temporal scale. However, the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Zan Gao , Xinglei Cui , Tao Zhuo , Zhiyong Cheng , An-An Liu , Meng Wang , Shenyong Chen

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke

Temporal action localization is an important and challenging task that aims to locate temporal regions in real-world untrimmed videos where actions occur and recognize their classes. It is widely acknowledged that video context is a…

Computer Vision and Pattern Recognition · Computer Science 2021-03-10 Xin Qin , Hanbin Zhao , Guangchen Lin , Hao Zeng , Songcen Xu , Xi Li

Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Guiqin Wang , Peng Zhao , Cong Zhao , Shusen Yang , Jie Cheng , Luziwei Leng , Jianxing Liao , Qinghai Guo

Weakly supervised temporal action localization is a challenging vision task due to the absence of ground-truth temporal locations of actions in the training videos. With only video-level supervision during training, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Ashraful Islam , Chengjiang Long , Richard Radke

Recognizing human actions is fundamentally a spatio-temporal reasoning problem, and should be, at least to some extent, invariant to the appearance of the human and the objects involved. Motivated by this hypothesis, in this work, we take…

Computer Vision and Pattern Recognition · Computer Science 2021-11-04 Gorjan Radevski , Marie-Francine Moens , Tinne Tuytelaars

Nowadays, the interaction between humans and robots is constantly expanding, requiring more and more human motion recognition applications to operate in real time. However, most works on temporal action detection and recognition perform…

Computer Vision and Pattern Recognition · Computer Science 2021-06-29 Vasiliki I. Vasileiou , Nikolaos Kardaris , Petros Maragos

We introduce Activity Graph Transformer, an end-to-end learnable model for temporal action localization, that receives a video as input and directly predicts a set of action instances that appear in the video. Detecting and localizing…

Computer Vision and Pattern Recognition · Computer Science 2021-01-29 Megha Nawhal , Greg Mori

Most of the current action localization methods follow an anchor-based pipeline: depicting action instances by pre-defined anchors, learning to select the anchors closest to the ground truth, and predicting the confidence of anchors with…

Computer Vision and Pattern Recognition · Computer Science 2020-08-25 Le Yang , Houwen Peng , Dingwen Zhang , Jianlong Fu , Junwei Han

Temporal action localization in untrimmed videos is an important but difficult task. Difficulties are encountered in the application of existing methods when modeling temporal structures of videos. In the present study, we developed a novel…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Yuan Zhou , Hongru Li , Sun-Yuan Kung

Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we present the structured segment network (SSN), a novel framework which models the temporal structure of each action instance via a structured…

Computer Vision and Pattern Recognition · Computer Science 2017-09-19 Yue Zhao , Yuanjun Xiong , Limin Wang , Zhirong Wu , Xiaoou Tang , Dahua Lin

Online temporal action localization from an untrimmed video stream is a challenging problem in computer vision. It is challenging because of i) in an untrimmed video stream, more than one action instance may appear, including background…

Computer Vision and Pattern Recognition · Computer Science 2020-03-18 Da-Hye Yoon , Nam-Gyu Cho , Seong-Whan Lee

Locating actions in long untrimmed videos has been a challenging problem in video content analysis. The performances of existing action localization approaches remain unsatisfactory in precisely determining the beginning and the end of an…

Computer Vision and Pattern Recognition · Computer Science 2018-06-27 Haonan Qiu , Yingbin Zheng , Hao Ye , Yao Lu , Feng Wang , Liang He

State-of-the-art temporal action detectors inefficiently search the entire video for specific actions. Despite the encouraging progress these methods achieve, it is crucial to design automated approaches that only explore parts of the video…

Computer Vision and Pattern Recognition · Computer Science 2018-07-30 Humam Alwassel , Fabian Caba Heilbron , Bernard Ghanem

The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e.g., context and background, in an untrimmed video. While prior approaches have achieved substantial progress…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Kun Xia , Le Wang , Sanping Zhou , Nanning Zheng , Wei Tang

This paper proposes a novel multi-modal transformer network for detecting actions in untrimmed videos. To enrich the action features, our transformer network utilizes a new multi-modal attention mechanism that computes the correlations…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Matthew Korban , Scott T. Acton , Peter Youngs

Inspired by the observation that humans are able to process videos efficiently by only paying attention where and when it is needed, we propose an interpretable and easy plug-in spatial-temporal attention mechanism for video action…

Computer Vision and Pattern Recognition · Computer Science 2019-06-04 Lili Meng , Bo Zhao , Bo Chang , Gao Huang , Wei Sun , Frederich Tung , Leonid Sigal

In this paper, we consider the problem of temporal action localization under low-shot (zero-shot & few-shot) scenario, with the goal of detecting and classifying the action instances from arbitrary categories within some untrimmed videos,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Chen Ju , Zeqian Li , Peisen Zhao , Ya Zhang , Xiaopeng Zhang , Qi Tian , Yanfeng Wang , Weidi Xie

Temporal action proposals are a common module in action detection pipelines today. Most current methods for training action proposal modules rely on fully supervised approaches that require large amounts of annotated temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Jingwei Ji , Kaidi Cao , Juan Carlos Niebles
‹ Prev 1 2 3 10 Next ›