Related papers: Toxicity Classification in Ukrainian

Detoxifying Language Models with a Toxic Corpus

Existing studies have investigated the tendency of autoregressive language models to generate contexts that exhibit undesired biases and toxicity. Various debiasing approaches have been proposed, which are primarily categorized into…

Computation and Language · Computer Science 2022-05-03 Yoon A Park , Frank Rudzicz

Methods for Detoxification of Texts for the Russian Language

We introduce the first study of automatic detoxification of Russian texts to combat offensive language. Such a kind of textual style transfer can be used, for instance, for processing toxic content in social media. While much work has been…

Computation and Language · Computer Science 2021-05-20 Daryna Dementieva , Daniil Moskovskiy , Varvara Logacheva , David Dale , Olga Kozlova , Nikita Semenov , Alexander Panchenko

Towards Building a Robust Toxicity Predictor

Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts. This paper presents a novel adversarial attack, \texttt{ToxicTrap},…

Computation and Language · Computer Science 2024-04-16 Dmitriy Bespalov , Sourav Bhabesh , Yi Xiang , Liutong Zhou , Yanjun Qi

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

Pretrained neural language models (LMs) are prone to generating racist, sexist, or otherwise toxic language which hinders their safe deployment. We investigate the extent to which pretrained LMs can be prompted to generate toxic language,…

Computation and Language · Computer Science 2020-09-29 Samuel Gehman , Suchin Gururangan , Maarten Sap , Yejin Choi , Noah A. Smith

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

The reliability of multilingual Large Language Model (LLM) evaluation is currently compromised by the inconsistent quality of translated benchmarks. Existing resources often suffer from semantic drift and context loss, which can lead to…

Computation and Language · Computer Science 2026-02-26 Hanna Yukhymenko , Anton Alexandrov , Martin Vechev

Do Prompts Guarantee Safety? Mitigating Toxicity from LLM Generations through Subspace Intervention

Large Language Models (LLMs) are powerful text generators, yet they can produce toxic or harmful content even when given seemingly harmless prompts. This presents a serious safety challenge and can cause real-world harm. Toxicity is often…

Computation and Language · Computer Science 2026-02-09 Himanshu Singh , Ziwei Xu , A. V. Subramanyam , Mohan Kankanhalli

ToxiSpanSE: An Explainable Toxicity Detection in Code Review Comments

Background: The existence of toxic conversations in open-source platforms can degrade relationships among software developers and may negatively impact software product quality. To help mitigate this, some initial work has been done to…

Software Engineering · Computer Science 2023-07-10 Jaydeb Saker , Sayma Sultana , Steven R. Wilson , Amiangshu Bosu

A Multi-Modal Multilingual Benchmark for Document Image Classification

Document image classification is different from plain-text document classification and consists of classifying a document by understanding the content and structure of documents such as forms, emails, and other such documents. We show that…

Computation and Language · Computer Science 2023-10-26 Yoshinari Fujinuma , Siddharth Varia , Nishant Sankaran , Srikar Appalaraju , Bonan Min , Yogarshi Vyas

Detecting Bias in Transfer Learning Approaches for Text Classification

Classification is an essential and fundamental task in machine learning, playing a cardinal role in the field of natural language processing (NLP) and computer vision (CV). In a supervised learning setting, labels are always needed for the…

Computation and Language · Computer Science 2021-02-04 Irene Li

Text Clustering as Classification with LLMs

Text clustering serves as a fundamental technique for organizing and interpreting unstructured textual data, particularly in contexts where manual annotation is prohibitively costly. With the rapid advancement of Large Language Models…

Computation and Language · Computer Science 2025-10-08 Chen Huang , Guoxiu He

Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

Data contamination has garnered increased attention in the era of large language models (LLMs) due to the reliance on extensive internet-derived training corpora. The issue of training corpus overlap with evaluation benchmarks--referred to…

Computation and Language · Computer Science 2024-06-24 Chunyuan Deng , Yilun Zhao , Yuzhao Heng , Yitong Li , Jiannan Cao , Xiangru Tang , Arman Cohan

Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have…

Computation and Language · Computer Science 2024-05-21 Siva Rajesh Kasa , Aniket Goel , Karan Gupta , Sumegh Roychowdhury , Anish Bhanushali , Nikhil Pattisapu , Prasanna Srinivasa Murthy

Text classification dataset and analysis for Uzbek language

Text classification is an important task in Natural Language Processing (NLP), where the goal is to categorize text data into predefined classes. In this study, we analyse the dataset creation steps and evaluation techniques of multi-label…

Computation and Language · Computer Science 2023-03-01 Elmurod Kuriyozov , Ulugbek Salaev , Sanatbek Matlatipov , Gayrat Matlatipov

Building Multilingual Datasets for Predicting Mental Health Severity through LLMs: Prospects and Challenges

Large Language Models (LLMs) are increasingly being integrated into various medical fields, including mental health support systems. However, there is a gap in research regarding the effectiveness of LLMs in non-English mental health…

Computation and Language · Computer Science 2026-02-10 Konstantinos Skianis , John Pavlopoulos , A. Seza Doğruöz

Automated multilingual detection of Pro-Kremlin propaganda in newspapers and Telegram posts

The full-scale conflict between the Russian Federation and Ukraine generated an unprecedented amount of news articles and social media data reflecting opposing ideologies and narratives. These polarized campaigns have led to mutual…

Computation and Language · Computer Science 2023-01-26 Veronika Solopova , Oana-Iuliana Popescu , Christoph Benzmüller , Tim Landgraf

Analyzing Toxicity in Open Source Software Communications Using Psycholinguistics and Moral Foundations Theory

Studies have shown that toxic behavior can cause contributors to leave, and hinder newcomers' (especially from underrepresented communities) participation in Open Source Software (OSS) projects. Thus, detection of toxic language plays a…

Software Engineering · Computer Science 2025-01-28 Ramtin Ehsani , Rezvaneh Rezapour , Preetha Chatterjee

LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs

Evaluating cross-lingual knowledge transfer in large language models is challenging, as correct answers in a target language may arise either from genuine transfer or from prior exposure during pre-training. We present LiveCLKTBench, an…

Computation and Language · Computer Science 2026-04-21 Pei-Fu Guo , Yun-Da Tsai , Chun-Chia Hsu , Kai-Xin Chen , Ya-An Tsai , Kai-Wei Chang , Nanyun Peng , Mi-Yen Yeh , Shou-De Lin

An AI-Powered Research Assistant in the Lab: A Practical Guide for Text Analysis Through Iterative Collaboration with LLMs

Analyzing texts such as open-ended responses, headlines, or social media posts is a time- and labor-intensive process highly susceptible to bias. LLMs are promising tools for text analysis, using either a predefined (top-down) or a…

Computation and Language · Computer Science 2025-05-19 Gino Carmona-Díaz , William Jiménez-Leal , María Alejandra Grisales , Chandra Sripada , Santiago Amaya , Michael Inzlicht , Juan Pablo Bermúdez

Designing Evaluations of Machine Learning Models for Subjective Inference: The Case of Sentence Toxicity

Machine Learning (ML) is increasingly applied in real-life scenarios, raising concerns about bias in automatic decision making. We focus on bias as a notion of opinion exclusion, that stems from the direct application of traditional ML…

Machine Learning · Computer Science 2019-11-07 Agathe Balayn , Alessandro Bozzon

Wisdom of the LLM Crowd: A Large Scale Benchmark of Multi-Label U.S. Election-Related Harmful Social Media Content

The spread of election misinformation and harmful political content conveys misleading narratives and poses a serious threat to democratic integrity. Detecting harmful content at early stages is essential for understanding and potentially…

Human-Computer Interaction · Computer Science 2026-02-24 Qile Wang , Prerana Khatiwada , Carolina Coimbra Vieira , Benjamin E. Bagozzi , Kenneth E. Barner , Matthew Louis Mauriello