English

Identifying Automatically Generated Headlines using Transformers

Computation and Language 2021-04-27 v3 Computers and Society Machine Learning

Abstract

False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible. In the not so distant future, identifying fake content generated by deep learning models will play a key role in protecting users from misinformation. To this end, a dataset containing human and computer-generated headlines was created and a user study indicated that humans were only able to identify the fake headlines in 47.8% of the cases. However, the most accurate automatic approach, transformers, achieved an overall accuracy of 85.7%, indicating that content generated from language models can be filtered out accurately.

Keywords

Cite

@article{arxiv.2009.13375,
  title  = {Identifying Automatically Generated Headlines using Transformers},
  author = {Antonis Maronikolakis and Hinrich Schutze and Mark Stevenson},
  journal= {arXiv preprint arXiv:2009.13375},
  year   = {2021}
}

Comments

NLP4IF 2021 Proceedings, NAACL 2021

R2 v1 2026-06-23T18:50:59.453Z