Related papers: SELF-VS: Self-supervised Encoding Learning For Vid…

Progressive Video Summarization via Multimodal Self-supervised Learning

Modern video summarization methods are based on deep neural networks that require a large amount of annotated data for training. However, existing datasets for video summarization are small-scale, easily leading to over-fitting of the deep…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Li Haopeng , Ke Qiuhong , Gong Mingming , Tom Drummond

Masked Autoencoder for Unsupervised Video Summarization

Summarizing a video requires a diverse understanding of the video, ranging from recognizing scenes to evaluating how much each frame is essential enough to be selected as a summary. Self-supervised learning (SSL) is acknowledged for its…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Minho Shim , Taeoh Kim , Jinhyung Kim , Dongyoon Wee

Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video

Current video summarization methods rely heavily on supervised computer vision techniques, which demands time-consuming and subjective manual annotations. To overcome these limitations, we investigated self-supervised video summarization.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Tomoya Sugihara , Shuntaro Masuda , Ling Xiao , Toshihiko Yamasaki

Video Summarization by Learning from Unpaired Data

We consider the problem of video summarization. Given an input raw video, the goal is to select a small subset of key frames from the input video to create a shorter summary video that best describes the content of the original video. Most…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Mrigank Rochan , Yang Wang

Video Summarization with Attention-Based Encoder-Decoder Networks

This paper addresses the problem of supervised video summarization by formulating it as a sequence-to-sequence learning problem, where the input is a sequence of original video frames, the output is a keyshot sequence. Our key idea is to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Zhong Ji , Kailin Xiong , Yanwei Pang , Xuelong Li

Self-Supervised Learning for Videos: A Survey

The remarkable success of deep learning in various domains relies on the availability of large-scale annotated datasets. However, obtaining annotations is expensive and requires great effort, which is especially challenging for videos.…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Madeline C. Schiappa , Yogesh S. Rawat , Mubarak Shah

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

Continued advances in self-supervised learning have led to significant progress in video representation learning, offering a scalable alternative to supervised approaches by removing the need for manual annotations. Despite strong…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Fida Mohammad Thoker , Letian Jiang , Chen Zhao , Piyush Bagad , Hazel Doughty , Bernard Ghanem , Cees G. M. Snoek

Enhancing Video Summarization with Context Awareness

Video summarization is a crucial research area that aims to efficiently browse and retrieve relevant information from the vast amount of video content available today. With the exponential growth of multimedia data, the ability to extract…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Hai-Dang Huynh-Lam , Ngoc-Phuong Ho-Thi , Minh-Triet Tran , Trung-Nghia Le

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos

Sequential video understanding, as an emerging video understanding task, has driven lots of researchers' attention because of its goal-oriented nature. This paper studies weakly supervised sequential video understanding where the accurate…

Computer Vision and Pattern Recognition · Computer Science 2023-03-29 Sixun Dong , Huazhang Hu , Dongze Lian , Weixin Luo , Yicheng Qian , Shenghua Gao

Better and Faster: Knowledge Transfer from Multiple Self-supervised Learning Tasks via Graph Distillation for Video Classification

Video representation learning is a vital problem for classification task. Recently, a promising unsupervised paradigm termed self-supervised learning has emerged, which explores inherent supervisory signals implied in massive data for…

Computer Vision and Pattern Recognition · Computer Science 2018-04-27 Chenrui Zhang , Yuxin Peng

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decision-making…

Computer Vision and Pattern Recognition · Computer Science 2018-02-15 Kaiyang Zhou , Yu Qiao , Tao Xiang

Video Summarization Using Fully Convolutional Sequence Networks

This paper addresses the problem of video summarization. Given an input video, the goal is to select a subset of the frames to create a summary video that optimally captures the important information of the input video. With the large…

Computer Vision and Pattern Recognition · Computer Science 2018-09-03 Mrigank Rochan , Linwei Ye , Yang Wang

Ultrasound Video Summarization using Deep Reinforcement Learning

Video is an essential imaging modality for diagnostics, e.g. in ultrasound imaging, for endoscopy, or movement assessment. However, video hasn't received a lot of attention in the medical image analysis community. In the clinical practice,…

Computer Vision and Pattern Recognition · Computer Science 2020-05-20 Tianrui Liu , Qingjie Meng , Athanasios Vlontzos , Jeremy Tan , Daniel Rueckert , Bernhard Kainz

Uncertainty-Aware and Decoder-Aligned Learning for Video Summarization

Video summarization aims to produce a compact representation of a long video by selecting a subset of temporally important segments that best reflect human preferences. This task is inherently difficult due to strong annotation subjectivity…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Omer Tariq , Syed Muhammad Raza , Jeongbae Son

SummaryNet: A Multi-Stage Deep Learning Model for Automatic Video Summarisation

Video summarisation can be posed as the task of extracting important parts of a video in order to create an informative summary of what occurred in the video. In this paper we introduce SummaryNet as a supervised learning framework for…

Computer Vision and Pattern Recognition · Computer Science 2020-02-24 Ziyad Jappie , David Torpey , Turgay Celik

Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video

Multimodal abstractive summarization for videos (MAS) requires generating a concise textual summary to describe the highlights of a video according to multimodal resources, in our case, the video content and its transcript. Inspired by the…

Computation and Language · Computer Science 2023-05-09 Zenan Xu , Xiaojun Meng , Yasheng Wang , Qinliang Su , Zexuan Qiu , Xin Jiang , Qun Liu

A Stacking Ensemble Approach for Supervised Video Summarization

Video summarization methods are usually classified into shot-level or frame-level methods, which are individually used in a general way. This paper investigates the underlying complementarity between the frame-level and shot-level methods,…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Yubo An , Shenghui Zhao , Guoqiang Zhang

Personalized Video Summarization by Multimodal Video Understanding

Video summarization techniques have been proven to improve the overall user experience when it comes to accessing and comprehending video content. If the user's preference is known, video summarization can identify significant information…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Brian Chen , Xiangyuan Zhao , Yingnan Zhu

Query-adaptive Video Summarization via Quality-aware Relevance Estimation

Although the problem of automatic video summarization has recently received a lot of attention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied. We address this problem…

Computer Vision and Pattern Recognition · Computer Science 2017-09-29 Arun Balajee Vasudevan , Michael Gygli , Anna Volokitin , Luc Van Gool

Video Summarization in a Multi-View Camera Network

While most existing video summarization approaches aim to extract an informative summary of a single video, we propose a novel framework for summarizing multi-view videos by exploiting both intra- and inter-view content correlations in a…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Rameswar Panda , Abir Das , Amit K. Roy-Chowdhury