Related papers: KTCR: Improving Implicit Hate Detection with Knowl…

Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection

Hate speech detection is a crucial area of research in natural language processing, essential for ensuring online community safety. However, detecting implicit hate speech, where harmful intent is conveyed in subtle or indirect ways,…

Computation and Language · Computer Science 2025-04-17 Yumin Kim , Hwanhee Lee

Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

This paper evaluates data augmentation and feature enhancement techniques for hate speech detection, comparing traditional classifiers, e.g., Delta Term Frequency-Inverse Document Frequency (Delta TF-IDF), with transformer-based models…

Computation and Language · Computer Science 2026-03-06 Brian Jing Hong Nge , Stefan Su , Thanh Thi Nguyen , Campbell Wilson , Alexandra Phelan , Naomi Pfitzner

A Target-Aware Analysis of Data Augmentation for Hate Speech Detection

Hate speech is one of the main threats posed by the widespread use of social networks, despite efforts to limit it. Although attention has been devoted to this issue, the lack of datasets and case studies centered around scarcely…

Computation and Language · Computer Science 2024-10-11 Camilla Casula , Sara Tonelli

Transfer Learning for Hate Speech Detection in Social Media

Today, the internet is an integral part of our daily lives, enabling people to be more connected than ever before. However, this greater connectivity and access to information increase exposure to harmful content such as cyber-bullying and…

Social and Information Networks · Computer Science 2023-10-31 Lanqin Yuan , Tianyu Wang , Gabriela Ferraro , Hanna Suominen , Marian-Andrei Rizoiu

A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media

Generated hateful and toxic content by a portion of users in social media is a rising phenomenon that motivated researchers to dedicate substantial efforts to the challenging direction of hateful content identification. We not only need an…

Social and Information Networks · Computer Science 2019-10-29 Marzieh Mozafari , Reza Farahbakhsh , Noel Crespi

Improving Cross-Domain Hate Speech Generalizability with Emotion Knowledge

Reliable automatic hate speech (HS) detection systems must adapt to the in-flow of diverse new data to curtail hate speech. However, hate speech detection systems commonly lack generalizability in identifying hate speech dissimilar to data…

Computation and Language · Computer Science 2023-12-19 Shi Yin Hong , Susan Gauch

Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

The proliferation of hate speech on social media platforms has necessitated the development of effective detection and moderation tools. This study evaluates the efficacy of various machine learning models in identifying hate speech and…

Computation and Language · Computer Science 2026-02-25 Saurabh Mishra , Shivani Thakur , Radhika Mamidi

Towards Generalizable Generic Harmful Speech Datasets for Implicit Hate Speech Detection

Implicit hate speech has recently emerged as a critical challenge for social media platforms. While much of the research has traditionally focused on harmful speech in general, the need for generalizable techniques to detect veiled and…

Computation and Language · Computer Science 2025-06-23 Saad Almohaimeed , Saleh Almohaimeed , Damla Turgut , Ladislau Bölöni

Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task. State-of-the-art deep learning-based approaches usually…

Computation and Language · Computer Science 2022-10-19 Tulika Bose , Irina Illina , Dominique Fohr

Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation

A growing body of work has focused on text classification methods for detecting the increasing amount of hate speech posted online. This progress has been limited to only a select number of highly-resourced languages causing detection…

Computation and Language · Computer Science 2023-10-05 Aman Khullar , Daniel Nkemelu , Cuong V. Nguyen , Michael L. Best

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection

Optimization of offensive content moderation models for different types of hateful messages is typically achieved through continued pre-training or fine-tuning on new hate speech benchmarks. However, existing benchmarks mainly address…

Computation and Language · Computer Science 2026-04-07 Irina Proskurina , Marc-Antoine Carpentier , Julien Velcin

Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

Online harassment in the form of hate speech has been on the rise in recent years. Addressing the issue requires a combination of content moderation by people, aided by automatic detection methods. As content moderation is itself harmful to…

Computation and Language · Computer Science 2021-08-03 Sheikh Muhammad Sarwar , Vanessa Murdock

Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection

With proliferation of user generated contents in social media platforms, establishing mechanisms to automatically identify toxic and abusive content becomes a prime concern for regulators, researchers, and society. Keeping the balance…

Computation and Language · Computer Science 2021-06-10 Djamila Romaissa Beddiar , Md Saroar Jahan , Mourad Oussalah

RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection

Hate speech remains prevalent in human society and continues to evolve in its forms and expressions. Modern advancements in internet and online anonymity accelerate its rapid spread and complicate its detection. However, hate speech…

Computation and Language · Computer Science 2026-04-24 Yejin Lee , Hyeseon Ahn , Yo-Sub Han

Advancing Hate Speech Detection with Transformers: Insights from the MetaHate

Hate speech is a widespread and harmful form of online discourse, encompassing slurs and defamatory posts that can have serious social, psychological, and sometimes physical impacts on targeted individuals and communities. As social media…

Machine Learning · Computer Science 2025-08-08 Santosh Chapagain , Shah Muhammad Hamdi , Soukaina Filali Boubrahimi

Learning Deep Representations with Probabilistic Knowledge Transfer

Knowledge Transfer (KT) techniques tackle the problem of transferring the knowledge from a large and complex neural network into a smaller and faster one. However, existing KT methods are tailored towards classification tasks and they…

Machine Learning · Computer Science 2019-03-21 Nikolaos Passalis , Anastasios Tefas

Combating high variance in Data-Scarce Implicit Hate Speech Classification

Hate speech classification has been a long-standing problem in natural language processing. However, even though there are numerous hate speech detection methods, they usually overlook a lot of hateful statements due to them being implicit…

Computation and Language · Computer Science 2022-08-30 Debaditya Pal , Kaustubh Chaudhari , Harsh Sharma

Cross-lingual hate speech detection based on multilingual domain-specific word embeddings

Automatic hate speech detection in online social networks is an important open problem in Natural Language Processing (NLP). Hate speech is a multidimensional issue, strongly dependant on language and cultural factors. Despite its…

Computation and Language · Computer Science 2021-05-03 Aymé Arango , Jorge Pérez , Barbara Poblete

DeepHate: Hate Speech Detection via Multi-Faceted Text Representations

Online hate speech is an important issue that breaks the cohesiveness of online social communities and even raises public safety concerns in our societies. Motivated by this rising issue, researchers have developed many traditional machine…

Computation and Language · Computer Science 2021-03-23 Rui Cao , Roy Ka-Wei Lee , Tuan-Anh Hoang

Generative AI for Hate Speech Detection: Evaluation and Findings

Automatic hate speech detection using deep neural models is hampered by the scarcity of labeled datasets, leading to poor generalization. To mitigate this problem, generative AI has been utilized to generate large amounts of synthetic hate…

Computation and Language · Computer Science 2023-11-17 Sagi Pendzel , Tomer Wullach , Amir Adler , Einat Minkov