A computational framework for human values

Nardine Osman; Mark d'Inverno

doi:10.5555/3635637.3663013

A computational framework for human values

Artificial Intelligence 2026-02-09 v2 Computers and Society Multiagent Systems

Authors: Nardine Osman , Mark d'Inverno

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

In the diverse array of work investigating the nature of human values from psychology, philosophy and social sciences, there is a clear consensus that values guide behaviour. More recently, a recognition that values provide a means to engineer ethical AI has emerged. Indeed, Stuart Russell proposed shifting AI's focus away from simply ``intelligence'' towards intelligence ``provably aligned with human values''. This challenge -- the value alignment problem -- with others including an AI's learning of human values, aggregating individual values to groups, and designing computational mechanisms to reason over values, has energised a sustained research effort. Despite this, no formal, computational definition of values has yet been proposed. We address this through a formal conceptual framework rooted in the social sciences, that provides a foundation for the systematic, integrated and interdisciplinary investigation into how human values can support designing ethical AI.

Keywords

ethics artificial intelligence fairness in machine learning

Cite

@article{arxiv.2305.02748,
  title  = {A computational framework for human values},
  author = {Nardine Osman and Mark d'Inverno},
  journal= {arXiv preprint arXiv:2305.02748},
  year   = {2026}
}

A computational framework for human values

Abstract

Keywords

Cite

Related papers