Related papers: Aligning AI With Shared Human Values

Is ETHICS about ethics? Evaluating the ETHICS benchmark

ETHICS is probably the most-cited dataset for testing the ethical capabilities of language models. Drawing on moral theory, psychology, and prompt evaluation, we interrogate the validity of the ETHICS benchmark. Adding to prior work, our…

Computers and Society · Computer Science 2024-11-27 Leif Hancox-Li , Borhane Blili-Hamelin

An Evaluation of GPT-4 on the ETHICS Dataset

This report summarizes a short study of the performance of GPT-4 on the ETHICS dataset. The ETHICS dataset consists of five sub-datasets covering different fields of ethics: Justice, Deontology, Virtue Ethics, Utilitarianism, and…

Computation and Language · Computer Science 2023-09-20 Sergey Rodionov , Zarathustra Amadeus Goertzel , Ben Goertzel

LLM Ethics Benchmark: A Three-Dimensional Assessment System for Evaluating Moral Reasoning in Large Language Models

This study establishes a novel framework for systematically evaluating the moral reasoning capabilities of large language models (LLMs) as they increasingly integrate into critical societal domains. Current assessment methodologies lack the…

Computers and Society · Computer Science 2025-05-05 Junfeng Jiao , Saleh Afroogh , Abhejay Murali , Kevin Chen , David Atkinson , Amit Dhurandhar

Unpacking the Ethical Value Alignment in Big Models

Big models have greatly advanced AI's ability to understand, generate, and manipulate information and content, enabling numerous applications. However, as these models become increasingly integrated into everyday life, their inherent…

Computers and Society · Computer Science 2023-10-27 Xiaoyuan Yi , Jing Yao , Xiting Wang , Xing Xie

Metaethical Perspectives on 'Benchmarking' AI Ethics

Benchmarks are seen as the cornerstone for measuring technical progress in Artificial Intelligence (AI) research and have been developed for a variety of tasks ranging from question answering to facial recognition. An increasingly prominent…

Computers and Society · Computer Science 2022-04-12 Travis LaCroix , Alexandra Sasha Luccioni

EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval

Artificial intelligence (AI) technologies should adhere to human norms to better serve our society and avoid disseminating harmful or misleading information, particularly in Conversational Information Retrieval (CIR). Previous work,…

Computation and Language · Computer Science 2023-10-03 Yiyao Yu , Junjie Wang , Yuxiang Zhang , Lin Zhang , Yujiu Yang , Tetsuya Sakai

Expected Moral Shortfall for Ethical Competence in Decision-making Models

Moral cognition is a crucial yet underexplored aspect of decision-making in AI models. Regardless of the application domain, it should be a consideration that allows for ethically aligned decision-making. This paper presents a multifaceted…

Computers and Society · Computer Science 2026-02-17 Aisha Aijaz , Raghava Mutharaju , Manohar Kumar

The Virtuous Machine - Old Ethics for New Technology?

Modern AI and robotic systems are characterized by a high and ever-increasing level of autonomy. At the same time, their applications in fields such as autonomous driving, service robotics and digital personal assistants move closer to…

Artificial Intelligence · Computer Science 2018-06-28 Nicolas Berberich , Klaus Diepold

Towards Theory-based Moral AI: Moral AI with Aggregating Models Based on Normative Ethical Theory

Moral AI has been studied in the fields of philosophy and artificial intelligence. Although most existing studies are only theoretical, recent developments in AI have made it increasingly necessary to implement AI with morality. On the…

Artificial Intelligence · Computer Science 2023-06-21 Masashi Takeshita , Rzepka Rafal , Kenji Araki

Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes

As AI systems become an increasing part of people's everyday lives, it becomes ever more important that they understand people's ethical norms. Motivated by descriptive ethics, a field of study that focuses on people's descriptive judgments…

Computation and Language · Computer Science 2021-03-25 Nicholas Lourie , Ronan Le Bras , Yejin Choi

The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems

Conversational agents have come increasingly closer to human competence in open-domain dialogue settings; however, such models can reflect insensitive, hurtful, or entirely incoherent viewpoints that erode a user's trust in the moral…

Computation and Language · Computer Science 2022-04-08 Caleb Ziems , Jane A. Yu , Yi-Chia Wang , Alon Halevy , Diyi Yang

Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction

The question of whether artificial entities deserve moral consideration has become one of the defining ethical challenges of AI research. Existing frameworks for moral patiency rely on verified ontological properties, such as sentience,…

Computers and Society · Computer Science 2026-03-03 Faezeh B. Pasandi , Hannah B. Pasandi

Criticizing Ethics According to Artificial Intelligence

This article presents a critique of ethics in the context of artificial intelligence (AI). It argues for the need to question established patterns of thought and traditional authorities, including core concepts such as autonomy, morality,…

Computers and Society · Computer Science 2024-08-09 Irina Spiegel

MoralBench: Moral Evaluation of LLMs

In the rapidly evolving field of artificial intelligence, large language models (LLMs) have emerged as powerful tools for a myriad of applications, from natural language processing to decision-making support systems. However, as these…

Computation and Language · Computer Science 2025-07-08 Jianchao Ji , Yutong Chen , Mingyu Jin , Wujiang Xu , Wenyue Hua , Yongfeng Zhang

The Morality of Probability: How Implicit Moral Biases in LLMs May Shape the Future of Human-AI Symbiosis

Artificial intelligence (AI) is advancing at a pace that raises urgent questions about how to align machine decision-making with human moral values. This working paper investigates how leading AI systems prioritize moral outcomes and what…

Artificial Intelligence · Computer Science 2025-09-15 Eoin O'Doherty , Nicole Weinrauch , Andrew Talone , Uri Klempner , Xiaoyuan Yi , Xing Xie , Yi Zeng

The AI Ethical Resonance Hypothesis: The Possibility of Discovering Moral Meta-Patterns in AI Systems

This paper presents a theoretical framework for the AI ethical resonance hypothesis, which proposes that advanced AI systems with purposefully designed cognitive structures ("ethical resonators") may emerge with the ability to identify…

Computers and Society · Computer Science 2025-07-21 Tomasz Zgliczyński-Cuber

Ethic-BERT: An Enhanced Deep Learning Model for Ethical and Non-Ethical Content Classification

Developing AI systems capable of nuanced ethical reasoning is critical as they increasingly influence human decisions, yet existing models often rely on superficial correlations rather than principled moral understanding. This paper…

Computers and Society · Computer Science 2025-10-16 Mahamodul Hasan Mahadi , Md. Nasif Safwan , Souhardo Rahman , Shahnaj Parvin , Aminun Nahar , Kamruddin Nur

Using Machine Learning to Guide Cognitive Modeling: A Case Study in Moral Reasoning

Large-scale behavioral datasets enable researchers to use complex machine learning algorithms to better predict human behavior, yet this increased predictive power does not always lead to a better understanding of the behavior in question.…

Computers and Society · Computer Science 2019-05-14 Mayank Agrawal , Joshua C. Peterson , Thomas L. Griffiths

Beyond Bias and Compliance: Towards Individual Agency and Plurality of Ethics in AI

AI ethics is an emerging field with multiple, competing narratives about how to best solve the problem of building human values into machines. Two major approaches are focused on bias and compliance, respectively. But neither of these ideas…

Artificial Intelligence · Computer Science 2023-02-24 Thomas Krendl Gilbert , Megan Welle Brozek , Andrew Brozek

In conversation with Artificial Intelligence: aligning language models with human values

Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in…

Computers and Society · Computer Science 2022-12-22 Atoosa Kasirzadeh , Iason Gabriel