Related papers: Toxicity Classification in Ukrainian

Measuring Misogyny in Natural Language Generation: Preliminary Results from a Case Study on two Reddit Communities

Generic `toxicity' classifiers continue to be used for evaluating the potential for harm in natural language generation, despite mounting evidence of their shortcomings. We consider the challenge of measuring misogyny in natural language…

Computation and Language · Computer Science 2023-12-07 Aaron J. Snoswell , Lucinda Nelson , Hao Xue , Flora D. Salim , Nicolas Suzor , Jean Burgess

Investigating Data Contamination in Modern Benchmarks for Large Language Models

Recent observations have underscored a disparity between the inflated benchmark scores and the actual performance of LLMs, raising concerns about potential contamination of evaluation benchmarks. This issue is especially critical for…

Computation and Language · Computer Science 2024-04-05 Chunyuan Deng , Yilun Zhao , Xiangru Tang , Mark Gerstein , Arman Cohan

MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer

We introduce MULTI-EURLEX, a new multilingual dataset for topic classification of legal documents. The dataset comprises 65k European Union (EU) laws, officially translated in 23 languages, annotated with multiple labels from the EUROVOC…

Computation and Language · Computer Science 2021-09-08 Ilias Chalkidis , Manos Fergadiotis , Ion Androutsopoulos

Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data

As language models (LMs) deliver increasing performance on a range of NLP tasks, probing classifiers have become an indispensable technique in the effort to better understand their inner workings. A typical setup involves (1) defining an…

Computation and Language · Computer Science 2024-08-01 Charles Jin , Martin Rinard

Automated Word Stress Detection in Russian

In this study we address the problem of automated word stress detection in Russian using character level models and no part-speech-taggers. We use a simple bidirectional RNN with LSTM nodes and achieve the accuracy of 90% or higher. We…

Computation and Language · Computer Science 2019-07-15 Maria Ponomareva , Kirill Milintsevich , Ekaterina Chernyak , Anatoly Starostin

Defining and Detecting Toxicity on Social Media: Context and Knowledge are Key

Online platforms have become an increasingly prominent means of communication. Despite the obvious benefits to the expanded distribution of content, the last decade has resulted in disturbing toxic communication, such as cyberbullying and…

Social and Information Networks · Computer Science 2023-09-04 Amit Sheth , Valerie L. Shalin , Ugur Kursuncu

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Not all topics are equally "flammable" in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that…

Computation and Language · Computer Science 2021-03-10 Nikolay Babakov , Varvara Logacheva , Olga Kozlova , Nikita Semenov , Alexander Panchenko

An Empirical Investigation of Learning from Biased Toxicity Labels

Collecting annotations from human raters often results in a trade-off between the quantity of labels one wishes to gather and the quality of these labels. As such, it is often only possible to gather a small amount of high-quality labels.…

Machine Learning · Computer Science 2021-10-05 Neel Nanda , Jonathan Uesato , Sven Gowal

The Tokenizer Tax Across 25 European Languages: Domain Invariance, Cross-Lingual Few-Shot Effects, and the Ukrainian Penalty

Tokenizer fertility the number of tokens per word imposes a hidden cost on non-English NLP. We measure fertility for ten foundation models across 25 European languages on parallel text, producing the first controlled tokenizer tax map for…

Computation and Language · Computer Science 2026-05-26 Volodymyr Ovcharov

Automated Utterance Labeling of Conversations Using Natural Language Processing

Conversational data is essential in psychology because it can help researchers understand individuals cognitive processes, emotions, and behaviors. Utterance labelling is a common strategy for analyzing this type of data. The development of…

Computation and Language · Computer Science 2022-08-16 Maria Laricheva , Chiyu Zhang , Yan Liu , Guanyu Chen , Terence Tracey , Richard Young , Giuseppe Carenini

Beyond Next Token Probabilities: Learnable, Fast Detection of Hallucinations and Data Contamination on LLM Output Distributions

The automated detection of hallucinations and training data contamination is pivotal to the safe deployment of Large Language Models (LLMs). These tasks are particularly challenging in settings where no access to model internals is…

Machine Learning · Computer Science 2025-10-01 Guy Bar-Shalom , Fabrizio Frasca , Derek Lim , Yoav Gelberg , Yftah Ziser , Ran El-Yaniv , Gal Chechik , Haggai Maron

Semantic Shifts of Psychological Concepts in Scientific and Popular Media Discourse: A Distributional Semantics Analysis of Russian-Language Corpora

This article examines semantic shifts in psychological concepts across scientific and popular media discourse using methods of distributional semantics applied to Russian-language corpora. Two corpora were compiled: a scientific corpus of…

Computation and Language · Computer Science 2026-04-02 Orlova Anastasia

Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency Detection

Most existing approaches to disfluency detection heavily rely on human-annotated corpora, which is expensive to obtain in practice. There have been several proposals to alleviate this issue with, for instance, self-supervised learning…

Computation and Language · Computer Science 2020-10-30 Shaolei Wang , Zhongyuan Wang , Wanxiang Che , Ting Liu

IYKYK: Using language models to decode extremist cryptolects

Extremist groups develop complex in-group language, also referred to as cryptolects, to exclude or mislead outsiders. We investigate the ability of current language technologies to detect and interpret the cryptolects of two online…

Computation and Language · Computer Science 2025-06-09 Christine de Kock , Arij Riabi , Zeerak Talat , Michael Sejr Schlichtkrull , Pranava Madhyastha , Ed Hovy

Text Detoxification in isiXhosa and Yor\`ub\'a: A Cross-Lingual Machine Learning Approach for Low-Resource African Languages

Toxic language is one of the major barrier to safe online participation, yet robust mitigation tools are scarce for African languages. This study addresses this critical gap by investigating automatic text detoxification (toxic to neutral…

Computation and Language · Computer Science 2026-01-12 Abayomi O. Agbeyangi

DetoxLLM: A Framework for Detoxification with Explanations

Prior works on detoxification are scattered in the sense that they do not cover all aspects of detoxification needed in a real-world scenario. Notably, prior works restrict the task of developing detoxification models to only a seen subset…

Machine Learning · Computer Science 2024-10-07 Md Tawkat Islam Khondaker , Muhammad Abdul-Mageed , Laks V. S. Lakshmanan

Toxicity Detection for Free

Current LLMs are generally aligned to follow safety requirements and tend to refuse toxic prompts. However, LLMs can fail to refuse toxic prompts or be overcautious and refuse benign examples. In addition, state-of-the-art toxicity…

Computation and Language · Computer Science 2024-11-11 Zhanhao Hu , Julien Piet , Geng Zhao , Jiantao Jiao , David Wagner

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

In this paper, we propose a model-agnostic cost-effective approach to developing bilingual base large language models (LLMs) to support English and any target language. The method includes vocabulary expansion, initialization of new…

Computation and Language · Computer Science 2024-10-25 Artur Kiulian , Anton Polishko , Mykola Khandoga , Yevhen Kostiuk , Guillermo Gabrielli , Łukasz Gagała , Fadi Zaraket , Qusai Abu Obaida , Hrishikesh Garud , Wendy Wing Yee Mak , Dmytro Chaplynskyi , Selma Belhadj Amor , Grigol Peradze

Test-Time Detoxification without Training or Learning Anything

Large language models can produce toxic or inappropriate text even for benign inputs, creating risks when deployed at scale. Detoxification is therefore important for safety and user trust, particularly when we want to reduce harmful…

Computation and Language · Computer Science 2026-02-04 Baturay Saglam , Dionysis Kalogerias

Learning Multilingual Embeddings for Cross-Lingual Information Retrieval in the Presence of Topically Aligned Corpora

Cross-lingual information retrieval is a challenging task in the absence of aligned parallel corpora. In this paper, we address this problem by considering topically aligned corpora designed for evaluating an IR setup. To emphasize, we…

Information Retrieval · Computer Science 2018-04-13 Mitodru Niyogi , Kripabandhu Ghosh , Arnab Bhattacharya