Related papers: Demystifying Embedding Spaces using Large Language…

Interpreting Embedding Spaces by Conceptualization

One of the main methods for computational interpretation of a text is mapping it into a vector in some embedding space. Such vectors can then be used for a variety of textual processing tasks. Recently, most embedding spaces are a product…

Computation and Language · Computer Science 2023-11-10 Adi Simhi , Shaul Markovitch

LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Large language models (LLMs) represent words through contextual word embeddings encoding different language properties like semantics and syntax. Understanding these properties is crucial, especially for researchers investigating language…

Computation and Language · Computer Science 2025-04-16 Rita Sevastjanova , Robin Gerling , Thilo Spinner , Mennatallah El-Assady

LLMs Explain't: A Post-Mortem on Semantic Interpretability in Transformer Models

Large Language Models (LLMs) are becoming increasingly popular in pervasive computing due to their versatility and strong performance. However, despite their ubiquitous use, the exact mechanisms underlying their outstanding performance…

Computation and Language · Computer Science 2026-02-02 Alhassan Abdelhalim , Janick Edinger , Sören Laue , Michaela Regneri

Crafting Interpretable Embeddings by Asking LLMs Questions

Large language models (LLMs) have rapidly improved text embeddings for a growing array of natural-language processing tasks. However, their opaqueness and proliferation into scientific domains such as neuroscience have created a growing…

Computation and Language · Computer Science 2024-05-28 Vinamra Benara , Chandan Singh , John X. Morris , Richard Antonello , Ion Stoica , Alexander G. Huth , Jianfeng Gao

Explaining Multi-modal Large Language Models by Analyzing their Vision Perception

Multi-modal Large Language Models (MLLMs) have demonstrated remarkable capabilities in understanding and generating content across various modalities, such as images and text. However, their interpretability remains a challenge, hindering…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Loris Giulivi , Giacomo Boracchi

When Text Embedding Meets Large Language Model: A Comprehensive Survey

Text embedding has become a foundational technology in natural language processing (NLP) during the deep learning era, driving advancements across a wide array of downstream tasks. While many natural language understanding challenges can…

Computation and Language · Computer Science 2025-10-22 Zhijie Nie , Zhangchi Feng , Mingxin Li , Cunwang Zhang , Yanzhao Zhang , Dingkun Long , Richong Zhang

Entity Embeddings : Perspectives Towards an Omni-Modality Era for Large Language Models

Large Language Models (LLMs) are evolving to integrate multiple modalities, such as text, image, and audio into a unified linguistic space. We envision a future direction based on this framework where conceptual entities defined in…

Machine Learning · Computer Science 2023-10-31 Eren Unlu , Unver Ciftci

What's in an embedding? Would a rose by any embedding smell as sweet?

Large Language Models (LLMs) are often criticized for lacking true "understanding" and the ability to "reason" with their knowledge, being seen merely as autocomplete systems. We believe that this assessment might be missing a nuanced…

Artificial Intelligence · Computer Science 2024-06-18 Venkat Venkatasubramanian

Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization

The opaque nature of Large Language Models (LLMs) has led to significant research efforts aimed at enhancing their interpretability, primarily through post-hoc methods. More recent in-hoc approaches, such as Concept Bottleneck Models…

Machine Learning · Computer Science 2025-02-20 Or Raphael Bidusa , Shaul Markovitch

Do LLMs Benefit from User and Item Embeddings in Recommendation Tasks?

Large Language Models (LLMs) have emerged as promising recommendation systems, offering novel ways to model user preferences through generative approaches. However, many existing methods often rely solely on text semantics or incorporate…

Machine Learning · Computer Science 2026-01-09 Mir Rayat Imtiaz Hossain , Leo Feng , Leonid Sigal , Mohamed Osama Ahmed

Data Science with LLMs and Interpretable Models

Recent years have seen important advances in the building of interpretable models, machine learning models that are designed to be easily understood by humans. In this work, we show that large language models (LLMs) are remarkably good at…

Machine Learning · Computer Science 2024-02-23 Sebastian Bordt , Ben Lengerich , Harsha Nori , Rich Caruana

User-LLM: Efficient LLM Contextualization with User Embeddings

Large language models (LLMs) have achieved remarkable success across various domains, but effectively incorporating complex and potentially noisy user timeline data into LLMs remains a challenge. Current approaches often involve translating…

Computation and Language · Computer Science 2024-09-11 Lin Ning , Luyang Liu , Jiaxing Wu , Neo Wu , Devora Berlowitz , Sushant Prakash , Bradley Green , Shawn O'Banion , Jun Xie

EmbedLLM: Learning Compact Representations of Large Language Models

With hundreds of thousands of language models available on Huggingface today, efficiently evaluating and utilizing these models across various downstream, tasks has become increasingly critical. Many existing methods repeatedly learn…

Computation and Language · Computer Science 2024-10-18 Richard Zhuang , Tianhao Wu , Zhaojin Wen , Andrew Li , Jiantao Jiao , Kannan Ramchandran

Sampling Latent Material-Property Information From LLM-Derived Embedding Representations

Vector embeddings derived from large language models (LLMs) show promise in capturing latent information from the literature. Interestingly, these can be integrated into material embeddings, potentially useful for data-driven predictions of…

Computation and Language · Computer Science 2024-09-19 Luke P. J. Gilligan , Matteo Cobelli , Hasan M. Sayeed , Taylor D. Sparks , Stefano Sanvito

Rethinking Interpretability in the Era of Large Language Models

Interpretable machine learning has exploded as an area of interest over the last decade, sparked by the rise of increasingly large datasets and deep neural networks. Simultaneously, large language models (LLMs) have demonstrated remarkable…

Computation and Language · Computer Science 2024-02-06 Chandan Singh , Jeevana Priya Inala , Michel Galley , Rich Caruana , Jianfeng Gao

Towards Uncovering How Large Language Model Works: An Explainability Perspective

Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as…

Computation and Language · Computer Science 2024-04-17 Haiyan Zhao , Fan Yang , Bo Shen , Himabindu Lakkaraju , Mengnan Du

Prediction is not Explanation: Revisiting the Explanatory Capacity of Mapping Embeddings

Understanding what knowledge is implicitly encoded in deep learning models is essential for improving the interpretability of AI systems. This paper examines common methods to explain the knowledge encoded in word embeddings, which are core…

Computation and Language · Computer Science 2025-08-20 Hanna Herasimchyk , Alhassan Abdelhalim , Sören Laue , Michaela Regneri

A Concept-Based Explainability Framework for Large Multimodal Models

Large multimodal models (LMMs) combine unimodal encoders and large language models (LLMs) to perform multimodal tasks. Despite recent advancements towards the interpretability of these models, understanding internal representations of LMMs…

Machine Learning · Computer Science 2024-12-03 Jayneel Parekh , Pegah Khayatan , Mustafa Shukor , Alasdair Newson , Matthieu Cord

Language Models are Universal Embedders

In the large language model (LLM) revolution, embedding is a key component of various systems, such as retrieving knowledge or memories for LLMs or building content moderation filters. As such cases span from English to other natural or…

Computation and Language · Computer Science 2025-05-23 Xin Zhang , Zehan Li , Yanzhao Zhang , Dingkun Long , Pengjun Xie , Meishan Zhang , Min Zhang

Representing LLMs in Prompt Semantic Task Space

Large language models (LLMs) achieve impressive results over various tasks, and ever-expanding public repositories contain an abundance of pre-trained models. Therefore, identifying the best-performing LLM for a given task is a significant…

Computation and Language · Computer Science 2025-11-13 Idan Kashani , Avi Mendelson , Yaniv Nemcovsky