Related papers: Reconstructive Sequence-Graph Network for Video Su…

Video Summarization with Long Short-term Memory

We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the problem as a structured prediction problem on sequential data, our main idea is to use Long Short-Term…

Computer Vision and Pattern Recognition · Computer Science 2016-08-01 Ke Zhang , Wei-Lun Chao , Fei Sha , Kristen Grauman

Hierarchical Recurrent Neural Network for Video Summarization

Exploiting the temporal dependency among video frames or subshots is very important for the task of video summarization. Practically, RNN is good at temporal dependency modeling, and has achieved overwhelming performance in many video-based…

Computer Vision and Pattern Recognition · Computer Science 2019-04-30 Bin Zhao , Xuelong Li , Xiaoqiang Lu

Language-Guided Graph Representation Learning for Video Summarization

With the rapid growth of video content on social media, video summarization has become a crucial task in multimedia processing. However, existing methods face challenges in capturing global dependencies in video content and accommodating…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Wenrui Li , Wei Han , Hengyu Man , Wangmeng Zuo , Xiaopeng Fan , Yonghong Tian

SumGraph: Video Summarization via Recursive Graph Modeling

The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Jungin Park , Jiyoung Lee , Ig-Jae Kim , Kwanghoon Sohn

From Thumbnails to Summaries - A single Deep Neural Network to Rule Them All

Video summaries come in many forms, from traditional single-image thumbnails, animated thumbnails, storyboards, to trailer-like video summaries. Content creators use the summaries to display the most attractive portion of their videos; the…

Multimedia · Computer Science 2018-08-03 Hongxiang Gu , Viswanathan Swaminathan

CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization

In the Internet, ubiquitous presence of redundant, unedited, raw videos has made video summarization an important problem. Traditional methods of video summarization employ a heuristic set of hand-crafted features, which in many cases fail…

Computer Vision and Pattern Recognition · Computer Science 2017-08-24 Mohaiminul Al Nahian , A. S. M. Iftekhar , Mohammad Tariqul Islam , S. M. Mahbubur Rahman , Dimitrios Hatzinakos

SummaryNet: A Multi-Stage Deep Learning Model for Automatic Video Summarisation

Video summarisation can be posed as the task of extracting important parts of a video in order to create an informative summary of what occurred in the video. In this paper we introduce SummaryNet as a supervised learning framework for…

Computer Vision and Pattern Recognition · Computer Science 2020-02-24 Ziyad Jappie , David Torpey , Turgay Celik

MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for Video Summarization

Video summarization intends to produce a concise video summary by effectively capturing and combining the most informative parts of the whole content. Existing approaches for video summarization regard the task as a frame-wise keyframe…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Wujiang Xu , Runzhong Wang , Xiaobo Guo , Shaoshuai Li , Qiongxu Ma , Yunan Zhao , Sheng Guo , Zhenfeng Zhu , Junchi Yan

Language-guided Recursive Spatiotemporal Graph Modeling for Video Summarization

Video summarization aims to select keyframes that are visually diverse and can represent the whole story of a given video. Previous approaches have focused on global interlinkability between frames in a video by temporal modeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Jungin Park , Jiyoung Lee , Kwanghoon Sohn

Video Summarization Using Fully Convolutional Sequence Networks

This paper addresses the problem of video summarization. Given an input video, the goal is to select a subset of the frames to create a summary video that optimally captures the important information of the input video. With the large…

Computer Vision and Pattern Recognition · Computer Science 2018-09-03 Mrigank Rochan , Linwei Ye , Yang Wang

Reinforcement Learning for Unsupervised Video Summarization with Reward Generator Training

This paper presents a novel approach for unsupervised video summarization using reinforcement learning (RL), addressing limitations like unstable adversarial training and reliance on heuristic-based reward functions. The method operates on…

Multimedia · Computer Science 2025-12-24 Mehryar Abbasi , Hadi Hadizadeh , Parvaneh Saeedi

FullTransNet: Full Transformer with Local-Global Attention for Video Summarization

Video summarization aims to generate a compact, informative, and representative synopsis of raw videos, which is crucial for browsing, analyzing, and understanding video content. Dominant approaches in video summarization primarily rely on…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Libin Lan , Lu Jiang , Tianshu Yu , Xiaojuan Liu , Zhongshi He

Video Summarization with Attention-Based Encoder-Decoder Networks

This paper addresses the problem of supervised video summarization by formulating it as a sequence-to-sequence learning problem, where the input is a sequence of original video frames, the output is a keyshot sequence. Our key idea is to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Zhong Ji , Kailin Xiong , Yanwei Pang , Xuelong Li

AudioVisual Video Summarization

Audio and vision are two main modalities in video data. Multimodal learning, especially for audiovisual learning, has drawn considerable attention recently, which can boost the performance of various computer vision tasks. However, in video…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Bin Zhao , Maoguo Gong , Xuelong Li

Reservoir-Based Graph Convolutional Networks

Message passing is a core mechanism in Graph Neural Networks (GNNs), enabling the iterative update of node embeddings by aggregating information from neighboring nodes. Graph Convolutional Networks (GCNs) exemplify this approach by adapting…

Machine Learning · Computer Science 2026-03-26 Mayssa Soussia , Gita Ayu Salsabila , Mohamed Ali Mahjoub , Islem Rekik

Video Is Graph: Structured Graph Module for Video Action Recognition

In the field of action recognition, video clips are always treated as ordered frames for subsequent processing. To achieve spatio-temporal perception, existing approaches propose to embed adjacent temporal interaction in the convolutional…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Rongchang Li , Xiao-Jun Wu , Tianyang Xu

Sliding Window Recurrent Network for Efficient Video Super-Resolution

Video super-resolution (VSR) is the task of restoring high-resolution frames from a sequence of low-resolution inputs. Different from single image super-resolution, VSR can utilize frames' temporal information to reconstruct results with…

Image and Video Processing · Electrical Eng. & Systems 2022-08-25 Wenyi Lian , Wenjing Lian

Semantic-Guided Unsupervised Video Summarization

Video summarization is a crucial technique for social understanding, enabling efficient browsing of massive multimedia content and extraction of key information from social platforms. Most existing unsupervised summarization methods rely on…

Artificial Intelligence · Computer Science 2026-01-22 Haizhou Liu , Haodong Jin , Yiming Wang , Hui Yu

A Critical Review of Recurrent Neural Networks for Sequence Learning

Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video…

Machine Learning · Computer Science 2015-10-20 Zachary C. Lipton , John Berkowitz , Charles Elkan

Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data

Conventional sequential learning methods such as Recurrent Neural Networks (RNNs) focus on interactions between consecutive inputs, i.e. first-order Markovian dependency. However, most of sequential data, as seen with videos, have complex…

Machine Learning · Computer Science 2020-01-22 Kyoung-Woon On , Eun-Sol Kim , Yu-Jung Heo , Byoung-Tak Zhang