English

Multi-Modal Summary Generation using Multi-Objective Optimization

Information Retrieval 2020-05-20 v1

Abstract

Significant development of communication technology over the past few years has motivated research in multi-modal summarization techniques. A majority of the previous works on multi-modal summarization focus on text and images. In this paper, we propose a novel extractive multi-objective optimization based model to produce a multi-modal summary containing text, images, and videos. Important objectives such as intra-modality salience, cross-modal redundancy and cross-modal similarity are optimized simultaneously in a multi-objective optimization framework to produce effective multi-modal output. The proposed model has been evaluated separately for different modalities, and has been found to perform better than state-of-the-art approaches.

Keywords

Cite

@article{arxiv.2005.09252,
  title  = {Multi-Modal Summary Generation using Multi-Objective Optimization},
  author = {Anubhav Jangra and Sriparna Saha and Adam Jatowt and Mohammad Hasanuzzaman},
  journal= {arXiv preprint arXiv:2005.09252},
  year   = {2020}
}

Comments

5 pages, 2 figures

R2 v1 2026-06-23T15:39:05.054Z