English
Related papers

Related papers: Localizing Model Behavior with Path Patching

200 papers

In this report we investigate fundamental requirements for the application of classifier patching on neural networks. Neural network patching is an approach for adapting neural network models to handle concept drift in nonstationary…

Machine Learning · Computer Science 2019-01-17 Sebastian Kauschke , David Hermann Lehmann

Mechanistic interpretability seeks to understand the internal mechanisms of machine learning models, where localization -- identifying the important model components -- is a key step. Activation patching, also known as causal tracing or…

Machine Learning · Computer Science 2024-01-18 Fred Zhang , Neel Nanda

Localization phenomena permeate many branches of physics playing a fundamental role on dynamical processes evolving on heterogeneous networks. These localization analyses are frequently grounded, for example, on eigenvectors of adjacency or…

Physics and Society · Physics 2020-11-24 Diogo H. Silva , Silvio C. Ferreira

Deep neural networks are successfully used in various applications, but show their vulnerability to adversarial examples. With the development of adversarial patches, the feasibility of attacks in physical scenes increases, and the defenses…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Junwen Chen , Xingxing Wei

We investigate the capability of localizing node failures in communication networks from binary states (normal/failed) of end-to-end paths. Given a set of nodes of interest, uniquely localizing failures within this set requires that…

Networking and Internet Architecture · Computer Science 2015-09-22 Liang Ma , Ting He , Ananthram Swami , Don Towsley , Kin K. Leung

Mechanistic interpretability aims to understand model behaviors in terms of specific, interpretable features, often hypothesized to manifest as low-dimensional subspaces of activations. Specifically, recent studies have explored subspace…

Machine Learning · Computer Science 2023-12-07 Aleksandar Makelov , Georg Lange , Neel Nanda

Adaptive cognition requires structured internal models of objects and their relations. Predictive neural networks are often proposed to learn such world models, but how these are instantiated and how they support prediction remain unclear.…

Machine Learning · Computer Science 2026-05-11 Linda Ariel Ventura , Victoria Bosch , Tim C Kietzmann , Sushrut Thorat

We propose a method to investigate modular structure in networks based on fitted probabilistic model, where the connection probability between nodes is related to a set of introduced local attributes. The attributes, as parameters of the…

Physics and Society · Physics 2009-07-03 Xiaofeng Gong , C. -H. Lai

In the scenario of one/multi-shot learning, conventional end-to-end learning strategies without sufficient supervision are usually not powerful enough to learn correct patterns from noisy signals. Thus, given a CNN pre-trained for object…

Computer Vision and Pattern Recognition · Computer Science 2017-11-23 Quanshi Zhang , Ruiming Cao , Shengming Zhang , Mark Redmonds , Ying Nian Wu , Song-Chun Zhu

Current approaches for fixing systematic problems in NLP models (e.g. regex patches, finetuning on more data) are either brittle, or labor-intensive and liable to shortcuts. In contrast, humans often provide corrections to each other…

Computation and Language · Computer Science 2022-11-22 Shikhar Murty , Christopher D. Manning , Scott Lundberg , Marco Tulio Ribeiro

Protein subcellular localization is an important factor in normal cellular processes and disease. While many protein localization resources treat it as static, protein localization is dynamic and heavily influenced by biological context.…

Molecular Networks · Quantitative Biology 2022-12-13 Chris S. Magnano , Anthony Gitter

Localized patterns are coherent structures embedded in a quiescent state and occur in both discrete and continuous media across a wide range of applications. While it is well-understood how domain covering patterns (for example stripes and…

Pattern Formation and Solitons · Physics 2025-03-19 Jason J. Bramburger , Dan J. Hill , David J. B. Lloyd

Most research concerning the influence of network structure on phenomena taking place on the network focus on relationships between global statistics of the network structure and characteristic properties of those phenomena, even though…

Social and Information Networks · Computer Science 2012-03-07 Tomoyuki Yuasa , Susumu Shirayama

This paper addresses the problem of bearing-based network localization, which aims to localize all the nodes in a static network given the locations of a subset of nodes termed anchors and inter-node bearings measured in a common reference…

Optimization and Control · Mathematics 2016-02-23 Shiyu Zhao , Daniel Zelazo

The performance of neural network models deteriorates due to their unreliable behavior on non-robust features of corrupted samples. Owing to their opaque nature, rectifying models to address this problem often necessitates arduous data…

Machine Learning · Computer Science 2026-03-18 Peiyu Yang , Naveed Akhtar , Jiantong Jiang , Ajmal Mian

Humans routinely retrace paths in a novel environment both forwards and backwards despite uncertainty in their motion. This paper presents an approach for doing so. Given a demonstration of a path, a first network generates a path…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Ashish Kumar , Saurabh Gupta , David Fouhey , Sergey Levine , Jitendra Malik

In graph theory and its practical networking applications, e.g., telecommunications and transportation, the problem of finding paths has particular importance. Selecting paths requires giving scores to the alternative solutions to drive a…

Networking and Internet Architecture · Computer Science 2025-11-10 Giovanni Fiaschi , Carlo Vitucci , Thomas Westerbäck , Daniel Sundmark , Thomas Nolte

The classification of time-series data is pivotal for streaming data and comes with many challenges. Although the amount of publicly available datasets increases rapidly, deep neural models are only exploited in a few areas. Traditional…

Machine Learning · Computer Science 2021-09-27 Dominique Mercier , Andreas Dengel , Sheraz Ahmed

Link prediction problem has increasingly become prominent in many domains such as social network analyses, bioinformatics experiments, transportation networks, criminal investigations and so forth. A variety of techniques has been developed…

Artificial Intelligence · Computer Science 2023-05-18 Safiye Ghasemi , Amin Zarei

A neural network is locally specialized to the extent that parts of its computational graph (i.e. structure) can be abstractly represented as performing some comprehensible sub-task relevant to the overall task (i.e. functionality). Are…

Machine Learning · Computer Science 2022-02-09 Shlomi Hod , Daniel Filan , Stephen Casper , Andrew Critch , Stuart Russell
‹ Prev 1 2 3 10 Next ›