English
Related papers

Related papers: Aligning AI With Shared Human Values

200 papers

ETHICS is probably the most-cited dataset for testing the ethical capabilities of language models. Drawing on moral theory, psychology, and prompt evaluation, we interrogate the validity of the ETHICS benchmark. Adding to prior work, our…

Computers and Society · Computer Science 2024-11-27 Leif Hancox-Li , Borhane Blili-Hamelin

This report summarizes a short study of the performance of GPT-4 on the ETHICS dataset. The ETHICS dataset consists of five sub-datasets covering different fields of ethics: Justice, Deontology, Virtue Ethics, Utilitarianism, and…

Computation and Language · Computer Science 2023-09-20 Sergey Rodionov , Zarathustra Amadeus Goertzel , Ben Goertzel

This study establishes a novel framework for systematically evaluating the moral reasoning capabilities of large language models (LLMs) as they increasingly integrate into critical societal domains. Current assessment methodologies lack the…

Computers and Society · Computer Science 2025-05-05 Junfeng Jiao , Saleh Afroogh , Abhejay Murali , Kevin Chen , David Atkinson , Amit Dhurandhar

Big models have greatly advanced AI's ability to understand, generate, and manipulate information and content, enabling numerous applications. However, as these models become increasingly integrated into everyday life, their inherent…

Computers and Society · Computer Science 2023-10-27 Xiaoyuan Yi , Jing Yao , Xiting Wang , Xing Xie

Benchmarks are seen as the cornerstone for measuring technical progress in Artificial Intelligence (AI) research and have been developed for a variety of tasks ranging from question answering to facial recognition. An increasingly prominent…

Computers and Society · Computer Science 2022-04-12 Travis LaCroix , Alexandra Sasha Luccioni

Artificial intelligence (AI) technologies should adhere to human norms to better serve our society and avoid disseminating harmful or misleading information, particularly in Conversational Information Retrieval (CIR). Previous work,…

Computation and Language · Computer Science 2023-10-03 Yiyao Yu , Junjie Wang , Yuxiang Zhang , Lin Zhang , Yujiu Yang , Tetsuya Sakai

Moral cognition is a crucial yet underexplored aspect of decision-making in AI models. Regardless of the application domain, it should be a consideration that allows for ethically aligned decision-making. This paper presents a multifaceted…

Computers and Society · Computer Science 2026-02-17 Aisha Aijaz , Raghava Mutharaju , Manohar Kumar

Modern AI and robotic systems are characterized by a high and ever-increasing level of autonomy. At the same time, their applications in fields such as autonomous driving, service robotics and digital personal assistants move closer to…

Artificial Intelligence · Computer Science 2018-06-28 Nicolas Berberich , Klaus Diepold

Moral AI has been studied in the fields of philosophy and artificial intelligence. Although most existing studies are only theoretical, recent developments in AI have made it increasingly necessary to implement AI with morality. On the…

Artificial Intelligence · Computer Science 2023-06-21 Masashi Takeshita , Rzepka Rafal , Kenji Araki

As AI systems become an increasing part of people's everyday lives, it becomes ever more important that they understand people's ethical norms. Motivated by descriptive ethics, a field of study that focuses on people's descriptive judgments…

Computation and Language · Computer Science 2021-03-25 Nicholas Lourie , Ronan Le Bras , Yejin Choi

Conversational agents have come increasingly closer to human competence in open-domain dialogue settings; however, such models can reflect insensitive, hurtful, or entirely incoherent viewpoints that erode a user's trust in the moral…

Computation and Language · Computer Science 2022-04-08 Caleb Ziems , Jane A. Yu , Yi-Chia Wang , Alon Halevy , Diyi Yang

The question of whether artificial entities deserve moral consideration has become one of the defining ethical challenges of AI research. Existing frameworks for moral patiency rely on verified ontological properties, such as sentience,…

Computers and Society · Computer Science 2026-03-03 Faezeh B. Pasandi , Hannah B. Pasandi

This article presents a critique of ethics in the context of artificial intelligence (AI). It argues for the need to question established patterns of thought and traditional authorities, including core concepts such as autonomy, morality,…

Computers and Society · Computer Science 2024-08-09 Irina Spiegel

In the rapidly evolving field of artificial intelligence, large language models (LLMs) have emerged as powerful tools for a myriad of applications, from natural language processing to decision-making support systems. However, as these…

Computation and Language · Computer Science 2025-07-08 Jianchao Ji , Yutong Chen , Mingyu Jin , Wujiang Xu , Wenyue Hua , Yongfeng Zhang

Artificial intelligence (AI) is advancing at a pace that raises urgent questions about how to align machine decision-making with human moral values. This working paper investigates how leading AI systems prioritize moral outcomes and what…

Artificial Intelligence · Computer Science 2025-09-15 Eoin O'Doherty , Nicole Weinrauch , Andrew Talone , Uri Klempner , Xiaoyuan Yi , Xing Xie , Yi Zeng

This paper presents a theoretical framework for the AI ethical resonance hypothesis, which proposes that advanced AI systems with purposefully designed cognitive structures ("ethical resonators") may emerge with the ability to identify…

Computers and Society · Computer Science 2025-07-21 Tomasz Zgliczyński-Cuber

Developing AI systems capable of nuanced ethical reasoning is critical as they increasingly influence human decisions, yet existing models often rely on superficial correlations rather than principled moral understanding. This paper…

Computers and Society · Computer Science 2025-10-16 Mahamodul Hasan Mahadi , Md. Nasif Safwan , Souhardo Rahman , Shahnaj Parvin , Aminun Nahar , Kamruddin Nur

Large-scale behavioral datasets enable researchers to use complex machine learning algorithms to better predict human behavior, yet this increased predictive power does not always lead to a better understanding of the behavior in question.…

Computers and Society · Computer Science 2019-05-14 Mayank Agrawal , Joshua C. Peterson , Thomas L. Griffiths

AI ethics is an emerging field with multiple, competing narratives about how to best solve the problem of building human values into machines. Two major approaches are focused on bias and compliance, respectively. But neither of these ideas…

Artificial Intelligence · Computer Science 2023-02-24 Thomas Krendl Gilbert , Megan Welle Brozek , Andrew Brozek

Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in…

Computers and Society · Computer Science 2022-12-22 Atoosa Kasirzadeh , Iason Gabriel
‹ Prev 1 2 3 10 Next ›