Related papers: Getting aligned on representational alignment

Towards a Learning Theory of Representation Alignment

It has recently been argued that AI models' representations are becoming aligned as their scale and performance increase. Empirical analyses have been designed to support this idea and conjecture the possible alignment of different…

Machine Learning · Computer Science 2025-02-21 Francesco Insulla , Shuo Huang , Lorenzo Rosasco

Aligning Robot Representations with Humans

As robots are increasingly deployed in real-world scenarios, a key question is how to best transfer knowledge learned in one environment to another, where shifting constraints and human preferences render adaptation challenging. A central…

Human-Computer Interaction · Computer Science 2022-05-18 Andreea Bobu , Andi Peng

Towards Integrated Alignment

As AI adoption expands across human society, the problem of aligning AI models to match human preferences remains a grand challenge. Currently, the AI alignment field is deeply divided between behavioral and representational approaches,…

Computers and Society · Computer Science 2025-08-12 Ben Y. Reis , William La Cava

Alignment between Brains and AI: Evidence for Convergent Evolution across Modalities, Scales and Training Trajectories

Artificial and biological systems may evolve similar computational solutions despite fundamental differences in architecture and learning mechanisms -- a form of convergent evolution. We demonstrate this phenomenon through large-scale…

Neurons and Cognition · Quantitative Biology 2025-07-04 Guobin Shen , Dongcheng Zhao , Yiting Dong , Qian Zhang , Yi Zeng

Comparing and Integrating Different Notions of Representational Correspondence in Neural Systems

The extent to which different biological and artificial neural systems rely on equivalent internal representations to support similar tasks remains a central question in neuroscience and machine learning. Prior work typically compares…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Jialin Wu , Shreya Saha , Yiqing Bo , Meenakshi Khosla

Researching Alignment Research: Unsupervised Analysis

AI alignment research is the field of study dedicated to ensuring that artificial intelligence (AI) benefits humans. As machine intelligence gets more advanced, this research is becoming increasingly important. Researchers in the field…

Computers and Society · Computer Science 2022-06-08 Jan H. Kirchner , Logan Smith , Jacques Thibodeau , Kyle McDonell , Laria Reynolds

Multimodal Representation Learning and Fusion

Multi-modal learning is a fast growing area in artificial intelligence. It tries to help machines understand complex things by combining information from different sources, like images, text, and audio. By using the strengths of each…

Machine Learning · Computer Science 2025-12-22 Qihang Jin , Enze Ge , Yuhang Xie , Hongying Luo , Junhao Song , Ziqian Bi , Chia Xin Liang , Jibin Guan , Joe Yeong , Xinyuan Song , Junfeng Hao

The AI Alignment Paradox

The field of AI alignment aims to steer AI systems toward human goals, preferences, and ethical principles. Its contributions have been instrumental for improving the output quality, safety, and trustworthiness of today's AI models. This…

Artificial Intelligence · Computer Science 2024-11-26 Robert West , Roland Aydin

Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts

Understanding convergent learning -- the degree to which independently trained neural systems -- whether multiple artificial networks or brains and models -- arrive at similar internal representations -- is crucial for both neuroscience and…

Neurons and Cognition · Quantitative Biology 2026-01-26 Chaitanya Kapoor , Sudhanshu Srivastava , Meenakshi Khosla

Network Alignment

Complex networks are frequently employed to model physical or virtual complex systems. When certain entities exist across multiple systems simultaneously, unveiling their corresponding relationships across the networks becomes crucial. This…

Physics and Society · Physics 2025-04-16 Rui Tang , Ziyun Yong , Shuyu Jiang , Xingshu Chen , Yaofang Liu , Yi-Cheng Zhang , Gui-Quan Sun , Wei Wang

Symmetry-Based Representations for Artificial and Biological General Intelligence

Biological intelligence is remarkable in its ability to produce complex behaviour in many diverse situations through data efficient, generalisable and transferable skill acquisition. It is believed that learning "good" sensory…

Neurons and Cognition · Quantitative Biology 2022-03-18 Irina Higgins , Sébastien Racanière , Danilo Rezende

Alignment as Jurisprudence

Jurisprudence, the study of how judges should properly decide cases, and alignment, the science of getting AI models to conform to human values, share a fundamental structure. These seemingly distant fields both seek to predict and shape…

Artificial Intelligence · Computer Science 2026-05-12 Nicholas Caputo

Aligning Machine and Human Visual Representations across Abstraction Levels

Deep neural networks have achieved success across a wide range of applications, including as models of human behavior and neural representations in vision tasks. However, neural network training and human learning differ in fundamental…

Computer Vision and Pattern Recognition · Computer Science 2025-09-04 Lukas Muttenthaler , Klaus Greff , Frieda Born , Bernhard Spitzer , Simon Kornblith , Michael C. Mozer , Klaus-Robert Müller , Thomas Unterthiner , Andrew K. Lampinen

Understanding the Emergence of Multimodal Representation Alignment

Multimodal representation learning is fundamentally about transforming incomparable modalities into comparable representations. While prior research primarily focused on explicitly aligning these representations through targeted learning…

Machine Learning · Computer Science 2025-06-16 Megan Tjandrasuwita , Chanakya Ekbote , Liu Ziyin , Paul Pu Liang

Concept Alignment

Discussion of AI alignment (alignment between humans and AI systems) has focused on value alignment, broadly referring to creating AI systems that share human values. We argue that before we can even attempt to align values, it is…

Machine Learning · Computer Science 2024-01-18 Sunayana Rane , Polyphony J. Bruna , Ilia Sucholutsky , Christopher Kello , Thomas L. Griffiths

Neurocognitive Informatics Manifesto

Informatics studies all aspects of the structure of natural and artificial information systems. Theoretical and abstract approaches to information have made great advances, but human information processing is still unmatched in many areas,…

Artificial Intelligence · Computer Science 2021-01-12 Włodzisław Duch

Aligning Robot and Human Representations

To act in the world, robots rely on a representation of salient task aspects: for example, to carry a coffee mug, a robot may consider movement efficiency or mug orientation in its behavior. However, if we want robots to act for and with…

Robotics · Computer Science 2024-01-30 Andreea Bobu , Andi Peng , Pulkit Agrawal , Julie Shah , Anca D. Dragan

Investigating social alignment via mirroring in a system of interacting language models

Alignment is a social phenomenon wherein individuals share a common goal or perspective. Mirroring, or mimicking the behaviors and opinions of another individual, is one mechanism by which individuals can become aligned. Large scale…

Multiagent Systems · Computer Science 2025-02-18 Harvey McGuinness , Tianyu Wang , Carey E. Priebe , Hayden Helm

Dimensions of Disagreement: Unpacking Divergence and Misalignment in Cognitive Science and Artificial Intelligence

The increasing prevalence of artificial agents creates a correspondingly increasing need to manage disagreements between humans and artificial agents, as well as between artificial agents themselves. Considering this larger space of…

Neurons and Cognition · Quantitative Biology 2023-10-23 Kerem Oktar , Ilia Sucholutsky , Tania Lombrozo , Thomas L. Griffiths

Representations in vision and language converge in a shared, multidimensional space of perceived similarities

Humans can effortlessly describe what they see, yet establishing a shared representational format between vision and language remains a significant challenge. Emerging evidence suggests that human brain representations in both vision and…

Neurons and Cognition · Quantitative Biology 2025-07-30 Katerina Marie Simkova , Adrien Doerig , Clayton Hickey , Ian Charest