Related papers: Decoding Hate: Exploring Language Models' Reaction…

Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection

Large language models (LLMs) excel in many diverse applications beyond language generation, e.g., translation, summarization, and sentiment analysis. One intriguing application is in text classification. This becomes pertinent in the realm…

Computation and Language · Computer Science 2024-03-14 Tharindu Kumarage , Amrita Bhattacharjee , Joshua Garland

An Investigation of Large Language Models for Real-World Hate Speech Detection

Hate speech has emerged as a major problem plaguing our social spaces today. While there have been significant efforts to address this problem, existing methods are still significantly limited in effectively detecting hate speech online. A…

Computers and Society · Computer Science 2024-01-09 Keyan Guo , Alexander Hu , Jaden Mu , Ziheng Shi , Ziming Zhao , Nishant Vishwamitra , Hongxin Hu

Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales

Although social media platforms are a prominent arena for users to engage in interpersonal discussions and express opinions, the facade and anonymity offered by social media may allow users to spew hate speech and offensive content. Given…

Computation and Language · Computer Science 2024-05-09 Ayushi Nirmal , Amrita Bhattacharjee , Paras Sheth , Huan Liu

HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models

Hate speech has become pervasive in today's digital age. Although there has been considerable research to detect hate speech or generate counter speech to combat hateful views, these approaches still cannot completely eliminate the…

Computation and Language · Computer Science 2023-10-24 Vibhor Agarwal , Yu Chen , Nishanth Sastry

Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization

The proliferation of hate speech on social media is one of the serious issues that is bringing huge impacts to society: an escalation of violence, discrimination, and social fragmentation. The problem of detecting hate speech is…

Computation and Language · Computer Science 2025-02-28 Anwar Hossain Zahid , Monoshi Kumar Roy , Swarna Das

Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?

Hate speech detection across contemporary social media presents unique challenges due to linguistic diversity and the informal nature of online discourse. These challenges are further amplified in settings involving code-mixing,…

Computation and Language · Computer Science 2025-06-17 Daman Deep Singh , Ramanuj Bhattacharjee , Abhijnan Chakraborty

LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection

In the evolving landscape of online communication, hate speech detection remains a formidable challenge, further compounded by the diversity of digital platforms. This study investigates the effectiveness and adaptability of pre-trained and…

Computation and Language · Computer Science 2025-05-01 Ahmad Nasir , Aadish Sharma , Kokil Jaidka , Saifuddin Ahmed

HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models

Hate speech encompasses verbal, written, or behavioral communication that targets derogatory or discriminatory language against individuals or groups based on sensitive characteristics. Automated hate speech detection plays a crucial role…

Computation and Language · Computer Science 2024-05-06 Tanmay Sen , Ansuman Das , Mrinmay Sen

HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

Large Language Models (LLMs) have raised increasing concerns about their misuse in generating hate speech. Among all the efforts to address this issue, hate speech detectors play a crucial role. However, the effectiveness of different…

Cryptography and Security · Computer Science 2025-01-29 Xinyue Shen , Yixin Wu , Yiting Qu , Michael Backes , Savvas Zannettou , Yang Zhang

Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model

Recently, large language models (LLMs) have taken the spotlight in natural language processing. Further, integrating LLMs with vision enables the users to explore more emergent abilities in multimodality. Visual language models (VLMs), such…

Computation and Language · Computer Science 2023-11-14 Minh-Hao Van , Xintao Wu

Exploring the Adversarial Capabilities of Large Language Models

The proliferation of large language models (LLMs) has sparked widespread and general interest due to their strong language generation capabilities, offering great potential for both industry and research. While previous research delved into…

Artificial Intelligence · Computer Science 2024-07-09 Lukas Struppek , Minh Hieu Le , Dominik Hintersdorf , Kristian Kersting

Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection

Hate speech in social media is a growing phenomenon, and detecting such toxic content has recently gained significant traction in the research community. Existing studies have explored fine-tuning language models (LMs) to perform hate…

Computation and Language · Computer Science 2023-03-07 Md Rabiul Awal , Roy Ka-Wei Lee , Eshaan Tanwar , Tanmay Garg , Tanmoy Chakraborty

Advancing Hate Speech Detection with Transformers: Insights from the MetaHate

Hate speech is a widespread and harmful form of online discourse, encompassing slurs and defamatory posts that can have serious social, psychological, and sometimes physical impacts on targeted individuals and communities. As social media…

Machine Learning · Computer Science 2025-08-08 Santosh Chapagain , Shah Muhammad Hamdi , Soukaina Filali Boubrahimi

Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech

The assessment of legal problems requires the consideration of a specific legal system and its levels of abstraction, from constitutional law to statutory law to case law. The extent to which Large Language Models (LLMs) internalize such…

Computation and Language · Computer Science 2025-06-04 Florian Ludwig , Torsten Zesch , Frederike Zufall

Supporting Human Raters with the Detection of Harmful Content using Large Language Models

In this paper, we explore the feasibility of leveraging large language models (LLMs) to automate or otherwise assist human raters with identifying harmful content including hate speech, harassment, violent extremism, and election…

Cryptography and Security · Computer Science 2024-06-19 Kurt Thomas , Patrick Gage Kelley , David Tao , Sarah Meiklejohn , Owen Vallis , Shunwen Tan , Blaž Bratanič , Felipe Tiengo Ferreira , Vijay Kumar Eranti , Elie Bursztein

Explain the Flag: Contextualizing Hate Speech Beyond Censorship

Hate, derogatory, and offensive speech remains a persistent challenge in online platforms and public discourse. While automated detection systems are widely used, most focus on censorship or removal, raising concerns for transparency and…

Computation and Language · Computer Science 2026-04-17 Jason Liartis , Eirini Kaldeli , Lambrini Gyftokosta , Eleftherios Chelioudakis , Orfeas Menis Mastromichalakis

Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate

Automated counter-narratives (CN) offer a promising strategy for mitigating online hate speech, yet concerns about their affective tone, accessibility, and ethical risks remain. We propose a framework for evaluating Large Language Model…

Computation and Language · Computer Science 2025-06-05 Mikel K. Ngueajio , Flor Miriam Plaza-del-Arco , Yi-Ling Chung , Danda B. Rawat , Amanda Cercas Curry

Don't Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection

The fairness and trustworthiness of Large Language Models (LLMs) are receiving increasing attention. Implicit hate speech, which employs indirect language to convey hateful intentions, occupies a significant portion of practice. However,…

Computation and Language · Computer Science 2024-07-24 Min Zhang , Jianfeng He , Taoran Ji , Chang-Tien Lu

Investigating Annotator Bias in Large Language Models for Hate Speech Detection

Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The…

Computation and Language · Computer Science 2024-11-19 Amit Das , Zheng Zhang , Najib Hasan , Souvika Sarkar , Fatemeh Jamshidi , Tathagata Bhattacharya , Mostafa Rahgouy , Nilanjana Raychawdhary , Dongji Feng , Vinija Jain , Aman Chadha , Mary Sandage , Lauramarie Pope , Gerry Dozier , Cheryl Seals

Interpretable Multi-Modal Hate Speech Detection

With growing role of social media in shaping public opinions and beliefs across the world, there has been an increased attention to identify and counter the problem of hate speech on social media. Hate speech on online spaces has serious…

Computation and Language · Computer Science 2021-03-03 Prashanth Vijayaraghavan , Hugo Larochelle , Deb Roy