Related papers: AI-Generated Text Detection and Classification Bas…

Detecting AI Generated Text Based on NLP and Machine Learning Approaches

Recent advances in natural language processing (NLP) may enable artificial intelligence (AI) models to generate writing that is identical to human written form in the future. This might have profound ethical, legal, and social…

Machine Learning · Computer Science 2024-04-17 Nuzhat Prova

AI Generated Text Detection

The rapid development of large language models has led to an increase in AI-generated text, with students increasingly using LLM-generated content as their own work, which violates academic integrity. This paper presents an evaluation of AI…

Computation and Language · Computer Science 2026-01-08 Adilkhan Alikhanov , Aidar Amangeldi , Diar Demeubay , Dilnaz Akhmetzhan , Nurbek Moldakhmetov , Omar Polat , Galymzhan Zharas

Research on Violent Text Detection System Based on BERT-fasttext Model

In the digital age of today, the internet has become an indispensable platform for people's lives, work, and information exchange. However, the problem of violent text proliferation in the network environment has arisen, which has brought…

Computation and Language · Computer Science 2024-12-24 Yongsheng Yang , Xiaoying Wang

Enhancing Grammatical Error Detection using BERT with Cleaned Lang-8 Dataset

This paper presents an improved LLM based model for Grammatical Error Detection (GED), which is a very challenging and equally important problem for many applications. The traditional approach to GED involved hand-designed features, but…

Computation and Language · Computer Science 2024-11-26 Rahul Nihalani , Kushal Shah

Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm

In this paper, a tool for detecting LLM AI text generation is developed based on the Transformer model, aiming to improve the accuracy of AI text generation detection and provide reference for subsequent research. Firstly the text is…

Computation and Language · Computer Science 2024-05-14 Yuhong Mo , Hao Qin , Yushan Dong , Ziyi Zhu , Zhenglin Li

Assessing Classical Machine Learning and Transformer-based Approaches for Detecting AI-Generated Research Text

The rapid adoption of large language models (LLMs) such as ChatGPT has blurred the line between human and AI-generated texts, raising urgent questions about academic integrity, intellectual property, and the spread of misinformation. Thus,…

Computation and Language · Computer Science 2025-09-26 Sharanya Parimanoharan , Ruwan D. Nawarathna

Fine-Grained Detection of AI-Generated Text Using Sentence-Level Segmentation

Generation of Artificial Intelligence (AI) texts in important works has become a common practice that can be used to misuse and abuse AI at various levels. Traditional AI detectors often rely on document-level classification, which…

Computation and Language · Computer Science 2025-09-24 Lekkala Sai Teja , Annepaka Yadagiri , Partha Pakray , Chukhu Chunka , Mangadoddi Srikar Vardhan

AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models

Large Language Models (LLMs) possess an extraordinary capability to produce text that is not only coherent and contextually relevant but also strikingly similar to human writing. They adapt to various styles and genres, producing content…

Computation and Language · Computer Science 2025-07-08 Chinnappa Guggilla , Budhaditya Roy , Trupti Ramdas Chavan , Abdul Rahman , Edward Bowen

Automated classification for open-ended questions with BERT

Manual coding of text data from open-ended questions into different categories is time consuming and expensive. Automated coding uses statistical/machine learning to train on a small subset of manually coded text answers. Recently,…

Applications · Statistics 2023-10-25 Hyukjun Gweon , Matthias Schonlau

Exploring the Capacity of a Large-scale Masked Language Model to Recognize Grammatical Errors

In this paper, we explore the capacity of a language model-based method for grammatical error detection in detail. We first show that 5 to 10% of training data are enough for a BERT-based error detection method to achieve performance…

Computation and Language · Computer Science 2021-08-30 Ryo Nagata , Manabu Kimura , Kazuaki Hanawa

Progress Notes Classification and Keyword Extraction using Attention-based Deep Learning Models with BERT

Various deep learning algorithms have been developed to analyze different types of clinical data including clinical text classification and extracting information from 'free text' and so on. However, automate the keyword extraction from the…

Computation and Language · Computer Science 2019-10-25 Matthew Tang , Priyanka Gandhi , Md Ahsanul Kabir , Christopher Zou , Jordyn Blakey , Xiao Luo

Assessing Text Classification Methods for Cyberbullying Detection on Social Media Platforms

Cyberbullying significantly contributes to mental health issues in communities by negatively impacting the psychology of victims. It is a prevalent problem on social media platforms, necessitating effective, real-time detection and…

Computation and Language · Computer Science 2024-12-31 Adamu Gaston Philipo , Doreen Sebastian Sarwatt , Jianguo Ding , Mahmoud Daneshmand , Huansheng Ning

How to Fine-Tune BERT for Text Classification?

Language model pre-training has proven to be useful in learning universal language representations. As a state-of-the-art language model pre-training model, BERT (Bidirectional Encoder Representations from Transformers) has achieved amazing…

Computation and Language · Computer Science 2020-02-06 Chi Sun , Xipeng Qiu , Yige Xu , Xuanjing Huang

BERTSel: Answer Selection with Pre-trained Models

Recently, pre-trained models have been the dominant paradigm in natural language processing. They achieved remarkable state-of-the-art performance across a wide range of related tasks, such as textual entailment, natural language inference,…

Computation and Language · Computer Science 2019-05-21 Dongfang Li , Yifei Yu , Qingcai Chen , Xinyu Li

Robust AI-Generated Text Detection by Restricted Embeddings

Growing amount and quality of AI-generated texts makes detecting such content more difficult. In most real-world scenarios, the domain (style and topic) of generated data and the generator model are not known in advance. In this work, we…

Computation and Language · Computer Science 2025-03-17 Kristian Kuznetsov , Eduard Tulchinskii , Laida Kushnareva , German Magai , Serguei Barannikov , Sergey Nikolenko , Irina Piontkovskaya

Ethic-BERT: An Enhanced Deep Learning Model for Ethical and Non-Ethical Content Classification

Developing AI systems capable of nuanced ethical reasoning is critical as they increasingly influence human decisions, yet existing models often rely on superficial correlations rather than principled moral understanding. This paper…

Computers and Society · Computer Science 2025-10-16 Mahamodul Hasan Mahadi , Md. Nasif Safwan , Souhardo Rahman , Shahnaj Parvin , Aminun Nahar , Kamruddin Nur

FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT

Machine based text comprehension has always been a significant research field in natural language processing. Once a full understanding of the text context and semantics is achieved, a deep learning model can be trained to solve a large…

Computation and Language · Computer Science 2020-09-03 Omar Mossad , Amgad Ahmed , Anandharaju Raju , Hari Karthikeyan , Zayed Ahmed

Findings of the Counter Turing Test: AI-Generated Text Detection

The growing capability of large language models to produce fluent, contextually coherent text has created mounting pressure on the systems and institutions responsible for ensuring the authenticity of digital content. Advanced generative…

Computation and Language · Computer Science 2026-05-26 Rajarshi Roy , Gurpreet Singh , Ashhar Aziz , Shashwat Bajpai , Nasrin Imanpour , Shwetangshu Biswas , Kapil Wanaskar , Parth Patwa , Subhankar Ghosh , Shreyas Dixit , Nilesh Ranjan Pal , Vipula Rawte , Ritvik Garimella , Amitava Das , Amit Sheth , Vasu Sharma , Aishwarya Naresh Reganti , Vinija Jain , Aman Chadha

Semantic Answer Type Prediction using BERT: IAI at the ISWC SMART Task 2020

This paper summarizes our participation in the SMART Task of the ISWC 2020 Challenge. A particular question we are interested in answering is how well neural methods, and specifically transformer models, such as BERT, perform on the answer…

Computation and Language · Computer Science 2021-09-15 Vinay Setty , Krisztian Balog

Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model

Disparate biases associated with datasets and trained classifiers in hateful and abusive content identification tasks have raised many concerns recently. Although the problem of biased datasets on abusive language detection has been…

Social and Information Networks · Computer Science 2021-01-27 Marzieh Mozafari , Reza Farahbakhsh , Noel Crespi