Related papers: Document Author Classification Using Parsed Langua…

Text Classification For Authorship Attribution Analysis

Authorship attribution mainly deals with undecided authorship of literary texts. Authorship attribution is useful in resolving issues like uncertain authorship, recognize authorship of unknown texts, spot plagiarism so on. Statistical…

Digital Libraries · Computer Science 2013-10-21 M. Sudheep Elayidom , Chinchu Jose , Anitta Puthussery , Neenu K Sasi

Authorship recognition via fluctuation analysis of network topology and word intermittency

Statistical methods have been widely employed in many practical natural language processing applications. More specifically, complex networks concepts and methods from dynamical systems theory have been successfully applied to recognize…

Computation and Language · Computer Science 2015-03-04 Diego R. Amancio

A Machine Learning Framework for Authorship Identification From Texts

Authorship identification is a process in which the author of a text is identified. Most known literary texts can easily be attributed to a certain author because they are, for example, signed. Yet sometimes we find unfinished pieces of…

Computation and Language · Computer Science 2019-12-24 Rahul Radhakrishnan Iyer , Carolyn Penstein Rose

Natural Language Parsing as Statistical Pattern Recognition

Traditional natural language parsers are based on rewrite rule systems developed in an arduous, time-consuming manner by grammarians. A majority of the grammarian's efforts are devoted to the disambiguation process, first hypothesizing…

cmp-lg · Computer Science 2016-08-31 David M. Magerman

Neural Deepfake Detection with Factual Structure of Text

Deepfake detection, the task of automatically discriminating machine-generated text, is increasingly critical with recent advances in natural language generative models. Existing approaches to deepfake detection typically represent…

Computation and Language · Computer Science 2020-10-16 Wanjun Zhong , Duyu Tang , Zenan Xu , Ruize Wang , Nan Duan , Ming Zhou , Jiahai Wang , Jian Yin

Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not. The analysis is traditionally performed by experts who consider linguistic…

Computation and Language · Computer Science 2019-11-21 Benedikt Boenninghoff , Steffen Hessler , Dorothea Kolossa , Robert M. Nickel

Authorship Attribution Using Word Network Features

In this paper, we explore a set of novel features for authorship attribution of documents. These features are derived from a word network representation of natural language text. As has been noted in previous studies, natural language tends…

Computation and Language · Computer Science 2013-11-14 Shibamouli Lahiri , Rada Mihalcea

Classifying informative and imaginative prose using complex networks

Statistical methods have been widely employed in recent years to grasp many language properties. The application of such techniques have allowed an improvement of several linguistic applications, which encompasses machine translation,…

Computation and Language · Computer Science 2016-02-22 Henrique F. de Arruda , Luciano da F. Costa , Diego R. Amancio

On the role of words in the network structure of texts: application to authorship attribution

Well-established automatic analyses of texts mainly consider frequencies of linguistic units, e.g. letters, words and bigrams, while methods based on co-occurrence networks consider the structure of texts regardless of the nodes label (i.e.…

Computation and Language · Computer Science 2018-02-27 Camilo Akimushkin , Diego R. Amancio , Osvaldo N. Oliveira

Few-Shot Detection of Machine-Generated Text using Style Representations

The advent of instruction-tuned language models that convincingly mimic human writing poses a significant risk of abuse. However, such abuse may be counteracted with the ability to detect whether a piece of text was composed by a language…

Computation and Language · Computer Science 2024-05-09 Rafael Rivera Soto , Kailin Koch , Aleem Khan , Barry Chen , Marcus Bishop , Nicholas Andrews

Learning Stylometric Representations for Authorship Analysis

Authorship analysis (AA) is the study of unveiling the hidden properties of authors from a body of exponentially exploding textual data. It extracts an author's identity and sociolinguistic characteristics based on the reflected writing…

Computation and Language · Computer Science 2016-06-06 Steven H. H. Ding , Benjamin C. M. Fung , Farkhund Iqbal , William K. Cheung

Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not. A range of successful technical approaches has been proposed for this task, many of which are based on…

Computation and Language · Computer Science 2019-08-22 Benedikt Boenninghoff , Robert M. Nickel , Steffen Zeiler , Dorothea Kolossa

Authorship Verification - An Approach based on Random Forest

Authorship attribution, being an important problem in many areas in-cluding information retrieval, computational linguistics, law and journalism etc., has been identified as a subject of increasingly research interest in the re-cent years.…

Computation and Language · Computer Science 2016-08-01 Promita Maitra , Souvick Ghosh , Dipankar Das

Finding Structure in Text, Genome and Other Symbolic Sequences

The statistical methods derived and described in this thesis provide new ways to elucidate the structural properties of text and other symbolic sequences. Generically, these methods allow detection of a difference in the frequency of a…

Computation and Language · Computer Science 2012-07-10 Ted Dunning

Unsupervised and Distributional Detection of Machine-Generated Text

The power of natural language generation models has provoked a flurry of interest in automatic methods to detect if a piece of text is human or machine-authored. The problem so far has been framed in a standard supervised way and consists…

Computation and Language · Computer Science 2021-11-05 Matthias Gallé , Jos Rozen , Germán Kruszewski , Hady Elsahar

The Sensitivity of Word Embeddings-based Author Detection Models to Semantic-preserving Adversarial Perturbations

Authorship analysis is an important subject in the field of natural language processing. It allows the detection of the most likely writer of articles, news, books, or messages. This technique has multiple uses in tasks related to…

Computation and Language · Computer Science 2021-02-25 Jeremiah Duncan , Fabian Fallas , Chris Gropp , Emily Herron , Maria Mahbub , Paula Olaya , Eduardo Ponce , Tabitha K. Samuel , Daniel Schultz , Sudarshan Srinivasan , Maofeng Tang , Viktor Zenkov , Quan Zhou , Edmon Begoli

An Information-Theoretic Approach for Detecting Edits in AI-Generated Text

We propose a method to determine whether a given article was written entirely by a generative language model or perhaps contains edits by a different author, possibly a human. Our process involves multiple tests for the origin of individual…

Information Theory · Computer Science 2024-08-27 Idan Kashtan , Alon Kipnis

Authorship Attribution Based on Life-Like Network Automata

The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based…

Computation and Language · Computer Science 2018-03-28 Jeaneth Machicao , Edilson A. Corrêa , Gisele H. B. Miranda , Diego R. Amancio , Odemir M. Bruno

A stylometric analysis of speaker attribution from speech transcripts

Forensic scientists often need to identify an unknown speaker or writer in cases such as ransom calls, covert recordings, alleged suicide notes, or anonymous online communications, among many others. Speaker recognition in the speech domain…

Computation and Language · Computer Science 2025-12-19 Cristina Aggazzotti , Elizabeth Allyn Smith

Distinguishing Fictional Voices: a Study of Authorship Verification Models for Quotation Attribution

Recent approaches to automatically detect the speaker of an utterance of direct speech often disregard general information about characters in favor of local information found in the context, such as surrounding mentions of entities. In…

Computation and Language · Computer Science 2024-01-31 Gaspard Michel , Elena V. Epure , Romain Hennequin , Christophe Cerisara