English

Unsupervised authorship attribution

Computation and Language 2015-03-27 v1

Abstract

We describe a technique for attributing parts of a written text to a set of unknown authors. Nothing is assumed to be known a priori about the writing styles of potential authors. We use multiple independent clusterings of an input text to identify parts that are similar and dissimilar to one another. We describe algorithms necessary to combine the multiple clusterings into a meaningful output. We show results of the application of the technique on texts having multiple writing styles.

Keywords

Cite

@article{arxiv.1503.07613,
  title  = {Unsupervised authorship attribution},
  author = {David Fifield and Torbjørn Follan and Emil Lunde},
  journal= {arXiv preprint arXiv:1503.07613},
  year   = {2015}
}
R2 v1 2026-06-22T09:02:36.059Z