English
Related papers

Related papers: Getting aligned on representational alignment

200 papers

It has recently been argued that AI models' representations are becoming aligned as their scale and performance increase. Empirical analyses have been designed to support this idea and conjecture the possible alignment of different…

Machine Learning · Computer Science 2025-02-21 Francesco Insulla , Shuo Huang , Lorenzo Rosasco

As robots are increasingly deployed in real-world scenarios, a key question is how to best transfer knowledge learned in one environment to another, where shifting constraints and human preferences render adaptation challenging. A central…

Human-Computer Interaction · Computer Science 2022-05-18 Andreea Bobu , Andi Peng

As AI adoption expands across human society, the problem of aligning AI models to match human preferences remains a grand challenge. Currently, the AI alignment field is deeply divided between behavioral and representational approaches,…

Computers and Society · Computer Science 2025-08-12 Ben Y. Reis , William La Cava

Artificial and biological systems may evolve similar computational solutions despite fundamental differences in architecture and learning mechanisms -- a form of convergent evolution. We demonstrate this phenomenon through large-scale…

Neurons and Cognition · Quantitative Biology 2025-07-04 Guobin Shen , Dongcheng Zhao , Yiting Dong , Qian Zhang , Yi Zeng

The extent to which different biological and artificial neural systems rely on equivalent internal representations to support similar tasks remains a central question in neuroscience and machine learning. Prior work typically compares…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Jialin Wu , Shreya Saha , Yiqing Bo , Meenakshi Khosla

AI alignment research is the field of study dedicated to ensuring that artificial intelligence (AI) benefits humans. As machine intelligence gets more advanced, this research is becoming increasingly important. Researchers in the field…

Computers and Society · Computer Science 2022-06-08 Jan H. Kirchner , Logan Smith , Jacques Thibodeau , Kyle McDonell , Laria Reynolds

Multi-modal learning is a fast growing area in artificial intelligence. It tries to help machines understand complex things by combining information from different sources, like images, text, and audio. By using the strengths of each…

Machine Learning · Computer Science 2025-12-22 Qihang Jin , Enze Ge , Yuhang Xie , Hongying Luo , Junhao Song , Ziqian Bi , Chia Xin Liang , Jibin Guan , Joe Yeong , Xinyuan Song , Junfeng Hao

The field of AI alignment aims to steer AI systems toward human goals, preferences, and ethical principles. Its contributions have been instrumental for improving the output quality, safety, and trustworthiness of today's AI models. This…

Artificial Intelligence · Computer Science 2024-11-26 Robert West , Roland Aydin

Understanding convergent learning -- the degree to which independently trained neural systems -- whether multiple artificial networks or brains and models -- arrive at similar internal representations -- is crucial for both neuroscience and…

Neurons and Cognition · Quantitative Biology 2026-01-26 Chaitanya Kapoor , Sudhanshu Srivastava , Meenakshi Khosla

Complex networks are frequently employed to model physical or virtual complex systems. When certain entities exist across multiple systems simultaneously, unveiling their corresponding relationships across the networks becomes crucial. This…

Physics and Society · Physics 2025-04-16 Rui Tang , Ziyun Yong , Shuyu Jiang , Xingshu Chen , Yaofang Liu , Yi-Cheng Zhang , Gui-Quan Sun , Wei Wang

Biological intelligence is remarkable in its ability to produce complex behaviour in many diverse situations through data efficient, generalisable and transferable skill acquisition. It is believed that learning "good" sensory…

Neurons and Cognition · Quantitative Biology 2022-03-18 Irina Higgins , Sébastien Racanière , Danilo Rezende

Jurisprudence, the study of how judges should properly decide cases, and alignment, the science of getting AI models to conform to human values, share a fundamental structure. These seemingly distant fields both seek to predict and shape…

Artificial Intelligence · Computer Science 2026-05-12 Nicholas Caputo

Deep neural networks have achieved success across a wide range of applications, including as models of human behavior and neural representations in vision tasks. However, neural network training and human learning differ in fundamental…

Computer Vision and Pattern Recognition · Computer Science 2025-09-04 Lukas Muttenthaler , Klaus Greff , Frieda Born , Bernhard Spitzer , Simon Kornblith , Michael C. Mozer , Klaus-Robert Müller , Thomas Unterthiner , Andrew K. Lampinen

Multimodal representation learning is fundamentally about transforming incomparable modalities into comparable representations. While prior research primarily focused on explicitly aligning these representations through targeted learning…

Machine Learning · Computer Science 2025-06-16 Megan Tjandrasuwita , Chanakya Ekbote , Liu Ziyin , Paul Pu Liang

Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is…

Machine Learning · Computer Science 2024-01-18 Sunayana Rane , Polyphony J. Bruna , Ilia Sucholutsky , Christopher Kello , Thomas L. Griffiths

Informatics studies all aspects of the structure of natural and artificial information systems. Theoretical and abstract approaches to information have made great advances, but human information processing is still unmatched in many areas,…

Artificial Intelligence · Computer Science 2021-01-12 Włodzisław Duch

To act in the world, robots rely on a representation of salient task aspects: for example, to carry a coffee mug, a robot may consider movement efficiency or mug orientation in its behavior. However, if we want robots to act for and with…

Robotics · Computer Science 2024-01-30 Andreea Bobu , Andi Peng , Pulkit Agrawal , Julie Shah , Anca D. Dragan

Alignment is a social phenomenon wherein individuals share a common goal or perspective. Mirroring, or mimicking the behaviors and opinions of another individual, is one mechanism by which individuals can become aligned. Large scale…

Multiagent Systems · Computer Science 2025-02-18 Harvey McGuinness , Tianyu Wang , Carey E. Priebe , Hayden Helm

The increasing prevalence of artificial agents creates a correspondingly increasing need to manage disagreements between humans and artificial agents, as well as between artificial agents themselves. Considering this larger space of…

Neurons and Cognition · Quantitative Biology 2023-10-23 Kerem Oktar , Ilia Sucholutsky , Tania Lombrozo , Thomas L. Griffiths

Humans can effortlessly describe what they see, yet establishing a shared representational format between vision and language remains a significant challenge. Emerging evidence suggests that human brain representations in both vision and…

Neurons and Cognition · Quantitative Biology 2025-07-30 Katerina Marie Simkova , Adrien Doerig , Clayton Hickey , Ian Charest
‹ Prev 1 2 3 10 Next ›