Su-In Lee — Scifaro

Agents that Matter: Optimizing Multi-Agent LLMs via Removal-Based Attribution

As multi-agent systems (MAS) become increasingly complex, identifying the contributions of individual agents is critical for system optimization. However, existing approaches lack a rigorous, unified framework for credit assignment. In this…

Multiagent Systems · Computer Science 2026-05-28 Mingyu Lu , Yushan Huang , Chris Lin , Su-In Lee

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

Steering vectors have emerged as a lightweight and effective approach for aligning large language models (LLMs) at inference time, enabling modulation over model behaviors by shifting LLM representations towards a target behavior. However,…

Machine Learning · Computer Science 2026-04-07 Soham Gadgil , Chris Lin , Su-In Lee

SurrogateSHAP: Training-Free Contributor Attribution for Text-to-Image (T2I) Models

As Text-to-Image (T2I) diffusion models are increasingly used in real-world creative workflows, a principled framework for valuing contributors who provide a collection of data is essential for fair compensation and sustainable data…

Machine Learning · Computer Science 2026-02-02 Mingyu Lu , Soham Gadgil , Chris Lin , Chanwoo Kim , Su-In Lee

Explainable AI for computational pathology identifies model limitations and tissue biomarkers

Deep learning models show promise in digital pathology, but their opaque decision-making processes limit trust and clinical adoption. To address this challenge, we present HIPPO, an explainable AI method for analyzing weakly-supervised…

Tissues and Organs · Quantitative Biology 2025-12-10 Jakub R. Kaczmarzyk , Chanwoo Kim , Soham Gadgil , Deepika Savant , Zhen Zhao , Joel H. Saltz , Su-In Lee , Peter K. Koo

CellCLIP -- Learning Perturbation Effects in Cell Painting via Text-Guided Contrastive Learning

High-content screening (HCS) assays based on high-throughput microscopy techniques such as Cell Painting have enabled the interrogation of cells' morphological responses to perturbations at an unprecedented scale. The collection of such…

Machine Learning · Computer Science 2025-09-25 Mingyu Lu , Ethan Weinberger , Chanwoo Kim , Su-In Lee

Ensembling Sparse Autoencoders

Sparse autoencoders (SAEs) are used to decompose neural network activations into human-interpretable features. Typically, features learned by a single SAE are used for downstream applications. However, it has recently been shown that SAEs…

Machine Learning · Computer Science 2025-05-23 Soham Gadgil , Chris Lin , Su-In Lee

An Efficient Framework for Crediting Data Contributors of Diffusion Models

As diffusion models are deployed in real-world settings, and their performance is driven by training data, appraising the contribution of data contributors is crucial to creating incentives for sharing quality data and to implementing…

Machine Learning · Computer Science 2025-03-05 Chris Lin , Mingyu Lu , Chanwoo Kim , Su-In Lee

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution

Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and are intractable for large datasets. These methods require efficient approximations, and…

Machine Learning · Computer Science 2024-10-31 Ian Covert , Chanwoo Kim , Su-In Lee , James Zou , Tatsunori Hashimoto

Estimating Conditional Mutual Information for Dynamic Feature Selection

Dynamic feature selection, where we sequentially query features to make accurate predictions with a minimal budget, is a promising paradigm to reduce feature acquisition costs and provide transparency into a model's predictions. The problem…

Machine Learning · Computer Science 2024-09-10 Soham Gadgil , Ian Covert , Su-In Lee

On the Robustness of Removal-Based Feature Attributions

To explain predictions made by complex machine learning models, many feature attribution methods have been developed that assign importance scores to input features. Some recent work challenges the robustness of these methods by showing…

Machine Learning · Computer Science 2023-11-01 Chris Lin , Ian Covert , Su-In Lee

Feature Selection in the Contrastive Analysis Setting

Contrastive analysis (CA) refers to the exploration of variations uniquely enriched in a target dataset as compared to a corresponding background dataset generated from sources of variation that are irrelevant to a given task. For example,…

Machine Learning · Computer Science 2023-10-31 Ethan Weinberger , Ian Covert , Su-In Lee

Contrastive Corpus Attribution for Explaining Representations

Despite the widespread use of unsupervised models, very few methods are designed to explain them. Most explanation methods explain a scalar model output. However, unsupervised models output representation vectors, the elements of which are…

Machine Learning · Computer Science 2023-06-14 Chris Lin , Hugh Chen , Chanwoo Kim , Su-In Lee

Learning to Maximize Mutual Information for Dynamic Feature Selection

Feature selection helps reduce data acquisition costs in ML, but the standard approach is to train models with static feature subsets. Here, we consider the dynamic feature selection (DFS) problem where a model sequentially queries features…

Machine Learning · Computer Science 2023-06-09 Ian Covert , Wei Qiu , Mingyu Lu , Nayoon Kim , Nathan White , Su-In Lee

Learning to Estimate Shapley Values with Vision Transformers

Transformers have become a default architecture in computer vision, but understanding what drives their predictions remains a challenging problem. Current explanation approaches rely on attention values or input gradients, but these provide…

Computer Vision and Pattern Recognition · Computer Science 2023-03-03 Ian Covert , Chanwoo Kim , Su-In Lee

Explaining a Series of Models by Propagating Shapley Values

Local feature attribution methods are increasingly used to explain complex machine learning models. However, current methods are limited because they are extremely expensive to compute or are not capable of explaining a distributed series…

Machine Learning · Computer Science 2022-10-12 Hugh Chen , Scott M. Lundberg , Su-In Lee

Feature Removal Is a Unifying Principle for Model Explanation Methods

Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. We examine the literature and find that many methods are based on a…

Machine Learning · Computer Science 2022-08-24 Ian Covert , Scott Lundberg , Su-In Lee

Algorithms to estimate Shapley value feature attributions

Feature attributions based on the Shapley value are popular for explaining machine learning models; however, their estimation is complex from both a theoretical and computational standpoint. We disentangle this complexity into two factors:…

Machine Learning · Computer Science 2022-07-18 Hugh Chen , Ian C. Covert , Scott M. Lundberg , Su-In Lee

A Deep Bayesian Bandits Approach for Anticancer Therapy: Exploration via Functional Prior

Learning personalized cancer treatment with machine learning holds great promise to improve cancer patients' chance of survival. Despite recent advances in machine learning and precision oncology, this approach remains challenging as…

Machine Learning · Computer Science 2022-07-12 Mingyu Lu , Yifang Chen , Su-In Lee

Explaining by Removing: A Unified Framework for Model Explanation

Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. We describe a new unified class of methods, removal-based…

Machine Learning · Computer Science 2022-05-16 Ian Covert , Scott Lundberg , Su-In Lee

FastSHAP: Real-Time Shapley Value Estimation

Shapley values are widely used to explain black-box models, but they are costly to calculate because they require many model evaluations. We introduce FastSHAP, a method for estimating Shapley values in a single forward pass using a learned…

Machine Learning · Statistics 2022-03-24 Neil Jethani , Mukund Sudarshan , Ian Covert , Su-In Lee , Rajesh Ranganath