Related papers: Interactive AI Alignment: Specification, Process, …

AI Alignment Dialogues: An Interactive Approach to AI Alignment in Support Agents

AI alignment is about ensuring AI systems only pursue goals and activities that are beneficial to humans. Most of the current approach to AI alignment is to learn what humans value from their behavioural data. This paper proposes a…

Artificial Intelligence · Computer Science 2023-10-06 Pei-Yu Chen , Myrthe L. Tielman , Dirk K. J. Heylen , Catholijn M. Jonker , M. Birna van Riemsdijk

Building Intelligent User Interfaces for Human-AI Alignment

Aligning AI systems with human values fundamentally relies on effective human feedback. While significant research has addressed training algorithms, the role of user interface is often overlooked and only treated as an implementation…

Human-Computer Interaction · Computer Science 2026-02-13 Danqing Shi

AI Alignment: A Comprehensive Survey

AI alignment aims to make AI systems behave in line with human intentions and values. As AI systems grow more capable, so do risks from misalignment. To provide a comprehensive and up-to-date overview of the alignment field, in this survey,…

Artificial Intelligence · Computer Science 2025-04-07 Jiaming Ji , Tianyi Qiu , Boyuan Chen , Borong Zhang , Hantao Lou , Kaile Wang , Yawen Duan , Zhonghao He , Lukas Vierling , Donghai Hong , Jiayi Zhou , Zhaowei Zhang , Fanzhi Zeng , Juntao Dai , Xuehai Pan , Kwan Yee Ng , Aidan O'Gara , Hua Xu , Brian Tse , Jie Fu , Stephen McAleer , Yaodong Yang , Yizhou Wang , Song-Chun Zhu , Yike Guo , Wen Gao

Position: Towards Bidirectional Human-AI Alignment

Recent advances in general-purpose AI underscore the urgent need to align AI systems with human goals and values. Yet, the lack of a clear, shared understanding of what constitutes "alignment" limits meaningful progress and…

Human-Computer Interaction · Computer Science 2025-09-30 Hua Shen , Tiffany Knearem , Reshmi Ghosh , Kenan Alkiek , Kundan Krishna , Yachuan Liu , Ziqiao Ma , Savvas Petridis , Yi-Hao Peng , Li Qiwei , Sushrita Rakshit , Chenglei Si , Yutong Xie , Jeffrey P. Bigham , Frank Bentley , Joyce Chai , Zachary Lipton , Qiaozhu Mei , Rada Mihalcea , Michael Terry , Diyi Yang , Meredith Ringel Morris , Paul Resnick , David Jurgens

Understanding the Process of Human-AI Value Alignment

Background: Value alignment in computer science research is often used to refer to the process of aligning artificial intelligence with humans, but the way the phrase is used often lacks precision. Objectives: In this paper, we conduct a…

Computers and Society · Computer Science 2026-03-27 Jack McKinlay , Marina De Vos , Janina A. Hoffmann , Andreas Theodorou

Artificial Intelligence, Values and Alignment

This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive…

Computers and Society · Computer Science 2020-10-07 Iason Gabriel

Alignment has a Fantasia Problem

Modern AI assistants are trained to follow instructions, implicitly assuming that users can clearly articulate their goals and the kind of assistance they need. Decades of behavioral research, however, show that people often engage with AI…

Artificial Intelligence · Computer Science 2026-04-24 Nathanael Jo , Zoe De Simone , Mitchell Gordon , Ashia Wilson

A Statistical Case Against Empirical Human-AI Alignment

Empirical human-AI alignment aims to make AI systems act in line with observed human behavior. While noble in its goals, we argue that empirical alignment can inadvertently introduce statistical biases that warrant caution. This position…

Artificial Intelligence · Computer Science 2025-05-13 Julian Rodemann , Esteban Garces Arias , Christoph Luther , Christoph Jansen , Thomas Augustin

Alignment-Process-Outcome: Rethinking How AIs and Humans Collaborate

In real-world collaboration, alignment, process structure, and outcome quality do not exhibit a simple linear or one-to-one correspondence: similar alignment may accompany either rapid convergence or extensive multi-branch exploration, and…

Human-Computer Interaction · Computer Science 2026-03-12 Haichang Li , Anjun Zhu , Arpit Narechania

Concept Alignment

Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is…

Machine Learning · Computer Science 2024-01-18 Sunayana Rane , Polyphony J. Bruna , Ilia Sucholutsky , Christopher Kello , Thomas L. Griffiths

Toward a Unified Framework for Collaborative Design of Human-AI Interaction

Human computer interaction is shifting from screen-based systems to multimodal interfaces where artificial intelligence powered systems increasingly interpret user intent through speech, gesture, and gaze. Yet users rarely understand how…

Human-Computer Interaction · Computer Science 2026-05-05 Ankur Bhatt , Sven Mayer

Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative Practice

Design is a non-linear, reflective process in which practitioners engage with visual, semantic, and other expressive materials to explore, iterate, and refine ideas. As Generative AI (GenAI) becomes integrated into professional design…

Human-Computer Interaction · Computer Science 2026-03-04 Xiaohan Peng , Wendy E. Mackay , Janin Koch

User-Driven Value Alignment: Understanding Users' Perceptions and Strategies for Addressing Biased and Discriminatory Statements in AI Companions

Large language model-based AI companions are increasingly viewed by users as friends or romantic partners, leading to deep emotional bonds. However, they can generate biased, discriminatory, and harmful outputs. Recently, users are taking…

Human-Computer Interaction · Computer Science 2025-02-14 Xianzhe Fan , Qing Xiao , Xuhui Zhou , Jiaxin Pei , Maarten Sap , Zhicong Lu , Hong Shen

Co-Constructing Alignment: A Participatory Approach to Situate AI Values

As AI systems become embedded in everyday practice, value misalignment has emerged as a pressing concern. Yet, dominant alignment approaches remain model centric, treating users as passive recipients of prespecified values rather than as…

Human-Computer Interaction · Computer Science 2026-04-22 Anne Arzberger , Enrico Liscio , Maria Luce Lupetti , Inigo Martinez de Rituerto de Troya , Jie Yang

The AI Alignment Paradox

The field of AI alignment aims to steer AI systems toward human goals, preferences, and ethical principles. Its contributions have been instrumental for improving the output quality, safety, and trustworthiness of today's AI models. This…

Artificial Intelligence · Computer Science 2024-11-26 Robert West , Roland Aydin

Beyond Prompts: Learning from Human Communication for Enhanced AI Intent Alignment

AI intent alignment, ensuring that AI produces outcomes as intended by users, is a critical challenge in human-AI interaction. The emergence of generative AI, including LLMs, has intensified the significance of this problem, as interactions…

Human-Computer Interaction · Computer Science 2024-06-21 Yoonsu Kim , Kihoon Son , Seoyoung Kim , Juho Kim

Designing Interfaces for Human-Computer Communication: An On-Going Collection of Considerations

While we do not always use words, communicating what we want to an AI is a conversation -- with ourselves as well as with it, a recurring loop with optional steps depending on the complexity of the situation and our request. Any given…

Human-Computer Interaction · Computer Science 2023-09-06 Elena L. Glassman

Positive Alignment: Artificial Intelligence for Human Flourishing

Existing alignment research is dominated by concerns about safety and preventing harm: safeguards, controllability, and compliance. This paradigm of alignment parallels early psychology's focus on mental illness: necessary but incomplete.…

Artificial Intelligence · Computer Science 2026-05-15 Ruben Laukkonen , Seb Krier , Chloé Bakalar , Shamil Chandaria , Morten Kringelbach , Adam Elwood , Daniel Ford , Fernando Rosas , Maty Bohacek , Matija Franklin , Nenad Tomašev , Stephanie Chan , Verena Rieser , Roma Patel , Michael Levin , Arun Rao

Scopes of Alignment

Much of the research focus on AI alignment seeks to align large language models and other foundation models to the context-less and generic values of helpfulness, harmlessness, and honesty. Frontier model providers also strive to align…

Computers and Society · Computer Science 2025-01-23 Kush R. Varshney , Zahra Ashktorab , Djallel Bouneffouf , Matthew Riemer , Justin D. Weisz

Demanding and Designing Aligned Cognitive Architectures

With AI systems becoming more powerful and pervasive, there is increasing debate about keeping their actions aligned with the broader goals and needs of humanity. This multi-disciplinary and multi-stakeholder debate must resolve many…

Artificial Intelligence · Computer Science 2021-12-21 Koen Holtman