Preference-based Interactive Multi-Document Summarisation

Yang Gao; Christian M. Meyer; Iryna Gurevych

Preference-based Interactive Multi-Document Summarisation

Computation and Language 2019-06-10 v1

Authors: Yang Gao , Christian M. Meyer , Iryna Gurevych

Abstract

Interactive NLP is a promising paradigm to close the gap between automatic NLP systems and the human upper bound. Preference-based interactive learning has been successfully applied, but the existing methods require several thousand interaction rounds even in simulations with perfect user feedback. In this paper, we study preference-based interactive summarisation. To reduce the number of interaction rounds, we propose the Active Preference-based ReInforcement Learning (APRIL) framework. APRIL uses Active Learning to query the user, Preference Learning to learn a summary ranking function from the preferences, and neural Reinforcement Learning to efficiently search for the (near-)optimal summary. Our results show that users can easily provide reliable preferences over summaries and that APRIL outperforms the state-of-the-art preference-based interactive method in both simulation and real-user experiments.

Keywords

reinforcement learning from human feedback recommendation system reinforcement learning

Cite

@article{arxiv.1906.02923,
  title  = {Preference-based Interactive Multi-Document Summarisation},
  author = {Yang Gao and Christian M. Meyer and Iryna Gurevych},
  journal= {arXiv preprint arXiv:1906.02923},
  year   = {2019}
}

Comments

Submitted to the special issue on "Learning from User Interactions", Information Retrieval Journal

Preference-based Interactive Multi-Document Summarisation

Abstract

Keywords

Cite

Comments

Related papers