Related papers: Detecting Compromised Implicit Association Test Re…

Reproducibility of Implicit Association Test (IAT) -- Case study of meta-analysis of racial bias research claims

The Implicit Association Test, IAT, is widely used to measure hidden (subconscious) human biases, implicit bias, of many topics: race, gender, age, ethnicity, religion stereotypes. There is a need to understand the reliability of these…

Applications · Statistics 2023-12-27 S. Stanley Young , Warren B. Kindzierski

Implicit Bias in LLMs: A Survey

Due to the implement of guardrails by developers, Large language models (LLMs) have demonstrated exceptional performance in explicit bias tests. However, bias in LLMs may occur not only explicitly, but also implicitly, much like humans who…

Computation and Language · Computer Science 2025-03-05 Xinru Lin , Luyang Li

Using Online Implicit Association Tests in Opinion Polling

Opinion polls have now become a very important component of society because they are now a defacto component of our daily news cycle and because their results influence governments and business in ways which are not always obvious to us.…

Computers and Society · Computer Science 2020-07-09 Alan Smeaton , Hyowon Lee , Niamh Morris , David Hanley

Measuring Implicit Bias in Explicitly Unbiased Large Language Models

Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs…

Computers and Society · Computer Science 2024-05-24 Xuechunzi Bai , Angelina Wang , Ilia Sucholutsky , Thomas L. Griffiths

Seeing Stereotypes

Reliance on stereotypes is a persistent feature of human decision-making and has been extensively documented in educational settings, where it can shape students' confidence, performance, and long-term human capital accumulation. While…

General Economics · Economics 2025-03-05 Elisa Baldazzi , Pietro Biroli , Marina Della Giusta , Florent Dubois

A word association network methodology for evaluating implicit biases in LLMs compared to humans

As Large language models (LLMs) become increasingly integrated into our lives, their inherent social biases remain a pressing concern. Detecting and evaluating these biases can be challenging because they are often implicit rather than…

Computation and Language · Computer Science 2025-10-29 Katherine Abramski , Giulio Rossetti , Massimo Stella

The African Woman is Rhythmic and Soulful: An Investigation of Implicit Biases in LLM Open-ended Text Generation

This paper investigates the subtle and often concealed biases present in Large Language Models (LLMs), focusing on implicit biases that may remain despite passing explicit bias tests. Implicit biases are significant because they influence…

Computation and Language · Computer Science 2024-10-01 Serene Lim , María Pérez-Ortiz

Implicit Bias-Like Patterns in Reasoning Models

Implicit biases refer to automatic mental processes that shape perceptions, judgments, and behaviors. Previous research on "implicit bias" in LLMs focused primarily on outputs rather than the processes underlying the outputs. We present the…

Computers and Society · Computer Science 2026-04-07 Messi H. J. Lee , Calvin K. Lai

Bayesian Inference of Psychometric Variables From Brain and Behavior in Implicit Association Tests

Objective. We establish a principled method for inferring mental health related psychometric variables from neural and behavioral data using the Implicit Association Test (IAT) as the data generation engine, aiming to overcome the limited…

Machine Learning · Computer Science 2026-03-18 Christian A. Kothe , Sean Mullen , Michael V. Bronstein , Grant Hanada , Marcelo Cicconet , Aaron N. McInnes , Tim Mullen , Marc Aafjes , Scott R. Sponheim , Alik S. Widge

Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training

Aspect-based sentiment analysis aims to identify the sentiment polarity of a specific aspect in product reviews. We notice that about 30% of reviews do not contain obvious opinion words, but still convey clear human-aware sentiment…

Computation and Language · Computer Science 2021-11-04 Zhengyan Li , Yicheng Zou , Chong Zhang , Qi Zhang , Zhongyu Wei

Detection and Mitigation of Bias in Ted Talk Ratings

Unbiased data collection is essential to guaranteeing fairness in artificial intelligence models. Implicit bias, a form of behavioral conditioning that leads us to attribute predetermined characteristics to members of certain groups and…

Artificial Intelligence · Computer Science 2020-03-03 Rupam Acharyya , Shouman Das , Ankani Chattoraj , Oishani Sengupta , Md Iftekar Tanveer

An Open-Source Web App for Creating and Scoring Qualtrics-based Implicit Association Test

The Implicit Association Test (IAT) is a common behavioral paradigm to assess implicit attitudes in various research contexts. In recent years, researchers have sought to collect IAT data remotely using online applications. Compared to…

Quantitative Methods · Quantitative Biology 2021-11-04 Yong Cui , Jason D. Robinson , Seokhun Kim , George Kypriotakis , Charles E. Green , Sanjay S. Shete , Paul M. Cinciripini

Can We Derive Explicit and Implicit Bias from Corpus?

Language is a popular resource to mine speakers' attitude bias, supposing that speakers' statements represent their bias on concepts. However, psychology studies show that people's explicit bias in statements can be different from their…

Social and Information Networks · Computer Science 2019-06-03 Bo Wang , Baixiang Xue , Anthony G. Greenwald

Inference-Time Reasoning Selectively Reduces Implicit Social Bias in Large Language Models

Drawing on constructs from psychology, prior work has identified a distinction between explicit and implicit bias in large language models (LLMs). While many LLMs undergo post-training alignment and safety procedures to avoid expressions of…

Computers and Society · Computer Science 2026-02-05 Molly Apsel , Michael N. Jones

MIST: Towards Multi-dimensional Implicit BiaS Evaluation of LLMs for Theory of Mind

Theory of Mind (ToM) in Large Language Models (LLMs) refers to the model's ability to infer the mental states of others, with failures in this ability often manifesting as systemic implicit biases. Assessing this challenge is difficult, as…

Computation and Language · Computer Science 2026-01-19 Yanlin Li , Hao Liu , Huimin Liu , Kun Wang , Yinwei Wei , Yupeng Hu

Implicit Incentive Provision with Misspecified Learning

We study misspecified Bayesian learning in principal-agent relationships, where an agent is assessed by an evaluator and rewarded by the market. The agent's outcome depends on their innate ability, costly effort -- whose effectiveness is…

Theoretical Economics · Economics 2025-12-02 Federico Echenique , Anqi Li

Building Better Deception Probes Using Targeted Instruction Pairs

Linear probes are a promising approach for monitoring AI systems for deceptive behaviour. Previous work has shown that a linear classifier trained on a contrastive instruction pair and a simple dataset can achieve good performance. However,…

Artificial Intelligence · Computer Science 2026-02-03 Vikram Natarajan , Devina Jain , Shivam Arora , Satvik Golechha , Joseph Bloom

Implicitly Constrained Semi-Supervised Linear Discriminant Analysis

Semi-supervised learning is an important and active topic of research in pattern recognition. For classification using linear discriminant analysis specifically, several semi-supervised variants have been proposed. Using any one of these…

Machine Learning · Statistics 2014-11-18 Jesse H. Krijthe , Marco Loog

Covert Bias: The Severity of Social Views' Unalignment in Language Models Towards Implicit and Explicit Opinion

While various approaches have recently been studied for bias identification, little is known about how implicit language that does not explicitly convey a viewpoint affects bias amplification in large language models. To examine the…

Computation and Language · Computer Science 2024-08-19 Abeer Aldayel , Areej Alokaili , Rehab Alahmadi

Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results

Semi-supervised learning is a setting in which one has labeled and unlabeled data available. In this survey we explore different types of theoretical results when one uses unlabeled data in classification and regression tasks. Most methods…

Machine Learning · Computer Science 2020-07-31 Alexander Mey , Marco Loog