Related papers: Data Representativeness in Accessibility Datasets:…

Understanding the Representation and Representativeness of Age in AI Data Sets

A diverse representation of different demographic groups in AI training data sets is important in ensuring that the models will work for a large range of users. To this end, recent efforts in AI fairness and inclusion have advocated for…

Computers and Society · Computer Science 2021-05-07 Joon Sung Park , Michael S. Bernstein , Robin N. Brewer , Ece Kamar , Meredith Ringel Morris

Sharing Practices for Datasets Related to Accessibility and Aging

Datasets sourced from people with disabilities and older adults play an important role in innovation, benchmarking, and mitigating bias for both assistive and inclusive AI-infused applications. However, they are scarce. We conduct a…

Human-Computer Interaction · Computer Science 2021-08-25 Rie Kamikubo , Utkarsh Dwivedi , Hernisa Kacorri

Contributing to Accessibility Datasets: Reflections on Sharing Study Data by Blind People

To ensure that AI-infused systems work for disabled people, we need to bring accessibility datasets sourced from this community in the development lifecycle. However, there are many ethical and privacy concerns limiting greater data…

Computers and Society · Computer Science 2023-03-10 Rie Kamikubo , Kyungjun Lee , Hernisa Kacorri

An Investigation of Experiences Engaging the Margins in Data-Centric Innovation

Data-centric technologies provide exciting opportunities, but recent research has shown how lack of representation in datasets, often as a result of systemic inequities and socioeconomic disparities, can produce inequitable outcomes that…

Human-Computer Interaction · Computer Science 2025-01-15 Gabriella Thompson , Ebtesam Al Haque , Paulette Blanc , Meme Styles , Denae Ford , Angela D. R. Smith , Brittany Johnson

Representation Bias in Data: A Survey on Identification and Resolution Techniques

Data-driven algorithms are only as good as the data they work with, while data sets, especially social data, often fail to represent minorities adequately. Representation Bias in data can happen due to various reasons ranging from…

Databases · Computer Science 2023-03-21 Nima Shahbazi , Yin Lin , Abolfazl Asudeh , H. V. Jagadish

Beyond Internal Data: Constructing Complete Datasets for Fairness Testing

As AI becomes prevalent in high-risk domains and decision-making, it is essential to test for potential harms and biases. This urgency is reflected by the global emergence of AI regulations that emphasise fairness and adequate testing, with…

Machine Learning · Computer Science 2025-07-25 Varsha Ramineni , Hossein A. Rahmani , Emine Yilmaz , David Barber

Dataset Representativeness and Downstream Task Fairness

Our society collects data on people for a wide range of applications, from building a census for policy evaluation to running meaningful clinical trials. To collect data, we typically sample individuals with the goal of accurately…

Machine Learning · Computer Science 2024-07-02 Victor Borza , Andrew Estornell , Chien-Ju Ho , Bradley Malin , Yevgeniy Vorobeychik

Bias in Data-driven AI Systems -- An Introductory Survey

AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights…

Computers and Society · Computer Science 2020-01-28 Eirini Ntoutsi , Pavlos Fafalios , Ujwal Gadiraju , Vasileios Iosifidis , Wolfgang Nejdl , Maria-Esther Vidal , Salvatore Ruggieri , Franco Turini , Symeon Papadopoulos , Emmanouil Krasanakis , Ioannis Kompatsiaris , Katharina Kinder-Kurlanda , Claudia Wagner , Fariba Karimi , Miriam Fernandez , Harith Alani , Bettina Berendt , Tina Kruegel , Christian Heinze , Klaus Broelemann , Gjergji Kasneci , Thanassis Tiropanis , Steffen Staab

Data Representativity for Machine Learning and AI Systems

Data representativity is crucial when drawing inference from data through machine learning models. Scholars have increased focus on unraveling the bias and fairness in models, also in relation to inherent biases in the input data. However,…

Machine Learning · Statistics 2023-02-06 Line H. Clemmensen , Rune D. Kjærsgaard

Improving Fairness of AI Systems with Lossless De-biasing

In today's society, AI systems are increasingly used to make critical decisions such as credit scoring and patient triage. However, great convenience brought by AI systems comes with troubling prevalence of bias against underrepresented…

Machine Learning · Computer Science 2021-05-11 Yan Zhou , Murat Kantarcioglu , Chris Clifton

Framework for Accessible and Inclusive Teaching Materials for Statistics and Data Science Courses

Despite rapid growth in the data science workforce, people of color, women, those with disabilities, and others remain underrepresented in, underserved by, and sometimes excluded from the field. This pattern prevents equal opportunity for…

Other Statistics · Statistics 2024-07-23 Mine Dogucu , Alicia A. Johnson , Miles Ott

Beyond Internal Data: Bounding and Estimating Fairness from Incomplete Data

Ensuring fairness in AI systems is critical, especially in high-stakes domains such as lending, hiring, and healthcare. This urgency is reflected in emerging global regulations that mandate fairness assessments and independent bias audits.…

Machine Learning · Computer Science 2025-08-19 Varsha Ramineni , Hossein A. Rahmani , Emine Yilmaz , David Barber

Towards affective computing that works for everyone

Missing diversity, equity, and inclusion elements in affective computing datasets directly affect the accuracy and fairness of emotion recognition algorithms across different groups. A literature review reveals how affective computing…

Human-Computer Interaction · Computer Science 2023-09-20 Tessa Verhoef , Eduard Fosch-Villaronga

Generative AI Carries Non-Democratic Biases and Stereotypes: Representation of Women, Black Individuals, Age Groups, and People with Disability in AI-Generated Images across Occupations

In this study, I investigate how generative artificial intelligence (AI) systems reproduce and reinforce societal biases, with a specific focus on the representation of women, Black individuals, age groups, and people with visible…

Artificial Intelligence · Computer Science 2026-04-29 Ayoob Sadeghiani

Considerations for Ethical Speech Recognition Datasets

Speech AI Technologies are largely trained on publicly available datasets or by the massive web-crawling of speech. In both cases, data acquisition focuses on minimizing collection effort, without necessarily taking the data subjects'…

Computers and Society · Computer Science 2023-05-04 Orestis Papakyriakopoulos , Alice Xiang

Disability data futures: Achievable imaginaries for AI and disability data justice

Data are the medium through which individuals' identities and experiences are filtered in contemporary states and systems, and AI is increasingly the layer mediating between people, data, and decisions. The history of data and AI is often…

Human-Computer Interaction · Computer Science 2024-11-07 Denis Newman-Griffis , Bonnielin Swenor , Rupa Valdez , Gillian Mason

AccessShare: Co-designing Data Access and Sharing with Blind People

Blind people are often called to contribute image data to datasets for AI innovation with the hope for future accessibility and inclusion. Yet, the visual inspection of the contributed images is inaccessible. To this day, we lack mechanisms…

Human-Computer Interaction · Computer Science 2024-07-30 Rie Kamikubo , Farnaz Zamiri Zeraati , Kyungjun Lee , Hernisa Kacorri

Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition

Demographic biases in source datasets have been shown as one of the causes of unfairness and discrimination in the predictions of Machine Learning models. One of the most prominent types of demographic bias are statistical imbalances in the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Iris Dominguez-Catena , Daniel Paternain , Mikel Galar

Increasing Data Equity Through Accessibility

This position statement is a response to the Office of Science and Technology Policy's Request for Information on "Equitable Data Engagement and Accountability." This response considers data equity specifically for people with disabilities.…

Human-Computer Interaction · Computer Science 2022-10-06 Frank Elavsky , Jennifer Mankoff , Arvind Satyanarayan

"What We Can't Measure, We Can't Understand": Challenges to Demographic Data Procurement in the Pursuit of Fairness

As calls for fair and unbiased algorithmic systems increase, so too does the number of individuals working on algorithmic fairness in industry. However, these practitioners often do not have access to the demographic data they feel they…

Computers and Society · Computer Science 2021-01-26 McKane Andrus , Elena Spitzer , Jeffrey Brown , Alice Xiang