Related papers: Algorithmic Detection of Computer Generated Text

Unsupervised and Distributional Detection of Machine-Generated Text

The power of natural language generation models has provoked a flurry of interest in automatic methods to detect if a piece of text is human or machine-authored. The problem so far has been framed in a standard supervised way and consists…

Computation and Language · Computer Science 2021-11-05 Matthias Gallé , Jos Rozen , Germán Kruszewski , Hady Elsahar

Machine Learning in Automated Text Categorization

The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize…

Information Retrieval · Computer Science 2021-09-21 Fabrizio Sebastiani

Comparing the writing style of real and artificial papers

Recent years have witnessed the increase of competition in science. While promoting the quality of research in many cases, an intense competition among scientists can also trigger unethical scientific behaviors. To increase the total number…

Computation and Language · Computer Science 2016-02-22 Diego R. Amancio

Automatic Detection of Machine Generated Text: A Critical Survey

Text generative models (TGMs) excel in producing text that matches the style of human language reasonably well. Such TGMs can be misused by adversaries, e.g., by automatically generating fake news and fake product reviews that can look…

Computation and Language · Computer Science 2020-11-04 Ganesh Jawahar , Muhammad Abdul-Mageed , Laks V. S. Lakshmanan

Automatic Detection of Generated Text is Easiest when Humans are Fooled

Recent advancements in neural language modelling make it possible to rapidly generate vast amounts of human-sounding text. The capabilities of humans and automatic discriminators to detect machine-generated text have been a large source of…

Computation and Language · Computer Science 2020-05-11 Daphne Ippolito , Daniel Duckworth , Chris Callison-Burch , Douglas Eck

Using Machine Learning to Distinguish Human-written from Machine-generated Creative Fiction

Following the universal availability of generative AI systems with the release of ChatGPT, automatic detection of deceptive text created by Large Language Models has focused on domains such as academic plagiarism and "fake news". However,…

Computation and Language · Computer Science 2024-12-23 Andrea Cristina McGlinchey , Peter J Barclay

Detection of Fake Generated Scientific Abstracts

The widespread adoption of Large Language Models and publicly available ChatGPT has marked a significant turning point in the integration of Artificial Intelligence into people's everyday lives. The academic community has taken notice of…

Computation and Language · Computer Science 2024-05-01 Panagiotis C. Theocharopoulos , Panagiotis Anagnostou , Anastasia Tsoukala , Spiros V. Georgakopoulos , Sotiris K. Tasoulis , Vassilis P. Plagianakos

Computerized document classification already orders the news articles that Apple's "News" app or Google's "personalized search" feature groups together to match a reader's interests. The invisible and therefore illegible decisions that go…

Computation and Language · Computer Science 2018-12-17 Ashley Lee , Jo Guldi , Andras Zsom

Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing

The growing use of large language models (LLMs) for text generation has led to widespread concerns about AI-generated content detection. However, an overlooked challenge is AI-polished text, where human-written content undergoes subtle…

Computation and Language · Computer Science 2025-05-06 Shoumik Saha , Soheil Feizi

ChatGPT or academic scientist? Distinguishing authorship with over 99% accuracy using off-the-shelf machine learning tools

ChatGPT has enabled access to AI-generated writing for the masses, and within just a few months, this product has disrupted the knowledge economy, initiating a culture shift in the way people work, learn, and write. The need to discriminate…

Machine Learning · Computer Science 2023-03-30 Heather Desaire , Aleesa E. Chua , Madeline Isom , Romana Jarosova , David Hua

Testing of Detection Tools for AI-Generated Text

Recent advances in generative pre-trained transformer large language models have emphasised the potential risks of unfair use of artificial intelligence (AI) generated content in an academic environment and intensified efforts in searching…

Computation and Language · Computer Science 2023-12-27 Debora Weber-Wulff , Alla Anohina-Naumeca , Sonja Bjelobaba , Tomáš Foltýnek , Jean Guerrero-Dib , Olumide Popoola , Petr Šigut , Lorna Waddington

A pipeline and comparative study of 12 machine learning models for text classification

Text-based communication is highly favoured as a communication method, especially in business environments. As a result, it is often abused by sending malicious messages, e.g., spam emails, to deceive users into relaying personal…

Information Retrieval · Computer Science 2022-04-14 Annalisa Occhipinti , Louis Rogers , Claudio Angione

Mixture of Detectors: A Compact View of Machine-Generated Text Detection

Large Language Models (LLMs) are gearing up to surpass human creativity. The veracity of the statement needs careful consideration. In recent developments, critical questions arise regarding the authenticity of human work and the…

Computation and Language · Computer Science 2025-09-29 Sai Teja Lekkala , Yadagiri Annepaka , Arun Kumar Challa , Samatha Reddy Machireddy , Partha Pakray , Chukhu Chunka

Mitigating Human and Computer Opinion Fraud via Contrastive Learning

We introduce the novel approach towards fake text reviews detection in collaborative filtering recommender systems. The existing algorithms concentrate on detecting the fake reviews, generated by language models and ignore the texts,…

Artificial Intelligence · Computer Science 2023-01-10 Yuliya Tukmacheva , Ivan Oseledets , Evgeny Frolov

Exploring the Limitations of Detecting Machine-Generated Text

Recent improvements in the quality of the generations by large language models have spurred research into identifying machine-generated text. Such work often presents high-performing detectors. However, humans and machines can produce text…

Computation and Language · Computer Science 2024-12-13 Jad Doughman , Osama Mohammed Afzal , Hawau Olamide Toyin , Shady Shehata , Preslav Nakov , Zeerak Talat

Robust Detection of LLM-Generated Text: A Comparative Analysis

The ability of large language models to generate complex texts allows them to be widely integrated into many aspects of life, and their output can quickly fill all network resources. As the impact of LLMs grows, it becomes increasingly…

Computation and Language · Computer Science 2024-11-12 Yongye Su , Yuqing Wu

Automated Content Grading Using Machine Learning

Grading of examination papers is a hectic, time-labor intensive task and is often subjected to inefficiency and bias in checking. This research project is a primitive experiment in the automation of grading of theoretical answers written in…

Machine Learning · Computer Science 2020-04-21 Rahul Kr Chauhan , Ravinder Saharan , Siddhartha Singh , Priti Sharma

I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text

Potential harms of Large Language Models such as mass misinformation and plagiarism can be partially mitigated if there exists a reliable way to detect machine generated text. In this paper, we propose a new watermarking method to detect…

Computation and Language · Computer Science 2023-12-12 Kaan Efe Keleş , Ömer Kaan Gürbüz , Mucahid Kutlu

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial…

Computation and Language · Computer Science 2025-04-15 Kathleen C. Fraser , Hillary Dawkins , Svetlana Kiritchenko

Interpretable Text Classification Applied to the Detection of LLM-generated Creative Writing

We consider the problem of distinguishing human-written creative fiction (excerpts from novels) from similar text generated by an LLM. Our results show that, while human observers perform poorly (near chance levels) on this binary…

Computation and Language · Computer Science 2026-01-13 Minerva Suvanto , Andrea McGlinchey , Mattias Wahde , Peter J Barclay