Related papers: Toxicity Classification in Ukrainian

Something Just Like TRuST : Toxicity Recognition of Span and Target

Toxic language includes content that is offensive, abusive, or that promotes harm. Progress in preventing toxic output from large language models (LLMs) is hampered by inconsistent definitions of toxicity. We introduce TRuST, a large-scale…

Computation and Language · Computer Science 2026-01-07 Berk Atil , Namrata Sureddy , Rebecca J. Passonneau

Char-RNN for Word Stress Detection in East Slavic Languages

We explore how well a sequence labeling approach, namely, recurrent neural network, is suited for the task of resource-poor and POS tagging free word stress detection in the Russian, Ukranian, Belarusian languages. We present new datasets,…

Computation and Language · Computer Science 2023-10-04 Ekaterina Chernyak , Maria Ponomareva , Kirill Milintsevich

Detecting Toxic Language: Ontology and BERT-based Approaches for Bulgarian Text

Toxic content detection in online communication remains a significant challenge, with current solutions often inadvertently blocking valuable information, including medical terms and text related to minority groups. This paper presents a…

Computation and Language · Computer Science 2026-04-03 Melania Berbatova , Tsvetoslav Vasev

Interpretable Multi Labeled Bengali Toxic Comments Classification using Deep Learning

This paper presents a deep learning-based pipeline for categorizing Bengali toxic comments, in which at first a binary classification model is used to determine whether a comment is toxic or not, and then a multi-label classifier is…

Computation and Language · Computer Science 2023-04-21 Tanveer Ahmed Belal , G. M. Shahariar , Md. Hasanul Kabir

Toxicity Inspector: A Framework to Evaluate Ground Truth in Toxicity Detection Through Feedback

Toxic language is difficult to define, as it is not monolithic and has many variations in perceptions of toxicity. This challenge of detecting toxic language is increased by the highly contextual and subjectivity of its interpretation,…

Computation and Language · Computer Science 2023-05-19 Huriyyah Althunayan , Rahaf Bahlas , Manar Alharbi , Lena Alsuwailem , Abeer Aldayel , Rehab ALahmadi

Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification

Despite notable advances in large language models (LLMs), reliable evaluation of text generation tasks such as text style transfer (TST) remains an open challenge. Existing research has shown that automatic metrics often correlate poorly…

Computation and Language · Computer Science 2026-03-05 Vitaly Protasov , Nikolay Babakov , Daryna Dementieva , Alexander Panchenko

Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings

Detecting toxic content using language models is important but challenging. While large language models (LLMs) have demonstrated strong performance in understanding Chinese, recent studies show that simple character substitutions in toxic…

Computation and Language · Computer Science 2025-06-02 Shujian Yang , Shiyao Cui , Chuanrui Hu , Haicheng Wang , Tianwei Zhang , Minlie Huang , Jialiang Lu , Han Qiu

Same Same, But Different: Conditional Multi-Task Learning for Demographic-Specific Toxicity Detection

Algorithmic bias often arises as a result of differential subgroup validity, in which predictive relationships vary across groups. For example, in toxic language detection, comments targeting different demographic groups can vary markedly…

Machine Learning · Computer Science 2023-03-08 Soumyajit Gupta , Sooyong Lee , Maria De-Arteaga , Matthew Lease

Moral Semantics Survive Machine Translation: Cross-Lingual Evidence from Moral Foundations Corpora

Moral language is subtle and culturally variable, making it difficult to translate faithfully across languages. Idiomatic expressions, slang, and cultural references introduce hard-to-avoid translation artifacts. Yet automated moral values…

Computation and Language · Computer Science 2026-05-22 Maciej Skorski

Soft-Label Integration for Robust Toxicity Classification

Toxicity classification in textual content remains a significant problem. Data with labels from a single annotator fall short of capturing the diversity of human perspectives. Therefore, there is a growing need to incorporate crowdsourced…

Artificial Intelligence · Computer Science 2024-11-11 Zelei Cheng , Xian Wu , Jiahao Yu , Shuo Han , Xin-Qiang Cai , Xinyu Xing

Universal Cross-Lingual Text Classification

Text classification, an integral task in natural language processing, involves the automatic categorization of text into predefined classes. Creating supervised labeled datasets for low-resource languages poses a considerable challenge.…

Computation and Language · Computer Science 2024-06-18 Riya Savant , Anushka Shelke , Sakshi Todmal , Sanskruti Kanphade , Ananya Joshi , Raviraj Joshi

Concept-Based Interpretability for Toxicity Detection

The rise of social networks has not only facilitated communication but also allowed the spread of harmful content. Although significant advances have been made in detecting toxic language in textual data, the exploration of concept-based…

Computation and Language · Computer Science 2025-12-16 Samarth Garg , Divya Singh , Deeksha Varshney , Mamta

Benchmark on Peer Review Toxic Detection: A Challenging Task with a New Dataset

Peer review is crucial for advancing and improving science through constructive criticism. However, toxic feedback can discourage authors and hinder scientific progress. This work explores an important but underexplored area: detecting…

Computation and Language · Computer Science 2025-02-05 Man Luo , Bradley Peterson , Rafael Gan , Hari Ramalingame , Navya Gangrade , Ariadne Dimarogona , Imon Banerjee , Phillip Howard

Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks

The widespread dissemination of toxic online posts is increasingly damaging to society. However, research on detecting toxic language in Chinese has lagged significantly. Existing datasets lack fine-grained annotation of toxic types and…

Computation and Language · Computer Science 2023-05-09 Junyu Lu , Bo Xu , Xiaokun Zhang , Changrong Min , Liang Yang , Hongfei Lin

Transductive Learning with String Kernels for Cross-Domain Text Classification

For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of…

Computation and Language · Computer Science 2018-11-06 Radu Tudor Ionescu , Andrei M. Butnaru

Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Rethinking Suicidal Ideation Detection: A Trustworthy Annotation Framework and Cross-Lingual Model Evaluation

Suicidal ideation detection is critical for real-time suicide prevention, yet its progress faces two under-explored challenges: limited language coverage and unreliable annotation practices. Most available datasets are in English, but even…

Computation and Language · Computer Science 2025-07-22 Amina Dzafic , Merve Kavut , Ulya Bayram

Text Detoxification: Data Efficiency, Semantic Preservation and Model Generalization

The widespread dissemination of toxic content on social media poses a serious threat to both online environments and public discourse, highlighting the urgent need for detoxification methods that effectively remove toxicity while preserving…

Machine Learning · Computer Science 2025-07-08 Jing Yu , Yibo Zhao , Jiapeng Zhu , Wenming Shao , Bo Pang , Zhao Zhang , Xiang Li

Toxicity Detection can be Sensitive to the Conversational Context

User posts whose perceived toxicity depends on the conversational context are rare in current toxicity detection datasets. Hence, toxicity detectors trained on existing datasets will also tend to disregard context, making the detection of…

Computation and Language · Computer Science 2021-11-22 Alexandros Xenos , John Pavlopoulos , Ion Androutsopoulos , Lucas Dixon , Jeffrey Sorensen , Leo Laugier

Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection

Toxicity detection in gaming communities faces significant scaling challenges when expanding across multiple games and languages, particularly in real-time environments where computational efficiency is crucial. We present two key findings…

Computation and Language · Computer Science 2025-06-10 Zachary Yang , Domenico Tullo , Reihaneh Rabbany