Related papers: A Non-Linear Structural Probe

Hitting "Probe"rty with Non-Linearity, and More

Structural probes learn a linear transformation to find how dependency trees are embedded in the hidden states of language models. This simple design may not allow for full exploitation of the structure of the encoded information. Hence, to…

Computation and Language · Computer Science 2024-02-27 Avik Pal , Madhura Pawar

A Tale of a Probe and a Parser

Measuring what linguistic information is encoded in neural models of language has become popular in NLP. Researchers approach this enterprise by training "probes" - supervised models designed to extract linguistic structure from another…

Computation and Language · Computer Science 2020-05-13 Rowan Hall Maudslay , Josef Valvoda , Tiago Pimentel , Adina Williams , Ryan Cotterell

Introducing Orthogonal Constraint in Structural Probes

With the recent success of pre-trained models in NLP, a significant focus was put on interpreting their representations. One of the most prominent approaches is structural probing (Hewitt and Manning, 2019), where a linear projection of…

Computation and Language · Computer Science 2021-06-25 Tomasz Limisiewicz , David Mareček

Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing

Analysing whether neural language models encode linguistic information has become popular in NLP. One method of doing so, which is frequently cited to support the claim that models like BERT encode syntax, is called probing; probes are…

Computation and Language · Computer Science 2021-06-07 Rowan Hall Maudslay , Ryan Cotterell

Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing

The question of what kinds of linguistic information are encoded in different layers of Transformer-based language models is of considerable interest for the NLP community. Existing work, however, has overwhelmingly focused on word-level…

Computation and Language · Computer Science 2023-10-19 Dmitry Nikolaev , Sebastian Padó

Probing Syntax in Large Language Models: Successes and Remaining Challenges

The syntactic structures of sentences can be readily read-out from the activations of large language models (LLMs). However, the ``structural probes'' that have been developed to reveal this phenomenon are typically evaluated on an…

Computation and Language · Computer Science 2025-08-12 Pablo J. Diego-Simón , Emmanuel Chemla , Jean-Rémi King , Yair Lakretz

Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?

Although neural models have achieved impressive results on several NLP benchmarks, little is understood about the mechanisms they use to perform language tasks. Thus, much recent attention has been devoted to analyzing the sentence…

Computation and Language · Computer Science 2021-03-09 Abhilasha Ravichander , Yonatan Belinkov , Eduard Hovy

INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers

Pre-trained models of source code have recently been successfully applied to a wide variety of Software Engineering tasks; they have also seen some practical adoption in practice, e.g. for code completion. Yet, we still know very little…

Software Engineering · Computer Science 2023-12-11 Anjan Karmakar , Romain Robbes

Information-Theoretic Probing for Linguistic Structure

The success of neural networks on a diverse set of NLP tasks has led researchers to question how much these networks actually ``know'' about natural language. Probes are a natural way of assessing this. When probing, a researcher chooses a…

Computation and Language · Computer Science 2020-05-26 Tiago Pimentel , Josef Valvoda , Rowan Hall Maudslay , Ran Zmigrod , Adina Williams , Ryan Cotterell

Do We Really Need GNNs with Explicit Structural Modeling? MLPs Suffice for Language Model Representations

Explicit structural information has been proven to be encoded by Graph Neural Networks (GNNs), serving as auxiliary knowledge to enhance model capabilities and improve performance in downstream NLP tasks. However, recent studies indicate…

Computation and Language · Computer Science 2025-06-30 Li Zhou , Hao Jiang , Junjie Li , Zefeng Zhao , Feng Jiang , Wenyu Chen , Haizhou Li

How to Probe Sentence Embeddings in Low-Resource Languages: On Structural Design Choices for Probing Task Evaluation

Sentence encoders map sentences to real valued vectors for use in downstream applications. To peek into these representations - e.g., to increase interpretability of their results - probing tasks have been designed which query them for…

Computation and Language · Computer Science 2020-10-29 Steffen Eger , Johannes Daxenberger , Iryna Gurevych

Conditional probing: measuring usable information beyond a baseline

Probing experiments investigate the extent to which neural representations make properties -- like part-of-speech -- predictable. One suggests that a representation encodes a property if probing that representation produces higher accuracy…

Computation and Language · Computer Science 2021-09-21 John Hewitt , Kawin Ethayarajh , Percy Liang , Christopher D. Manning

Probing for the Usage of Grammatical Number

A central quest of probing is to uncover how pre-trained models encode a linguistic property within their representations. An encoding, however, might be spurious-i.e., the model might not rely on it when making predictions. In this paper,…

Computation and Language · Computer Science 2024-05-24 Karim Lasri , Tiago Pimentel , Alessandro Lenci , Thierry Poibeau , Ryan Cotterell

Spectral Probing

Linguistic information is encoded at varying timescales (subwords, phrases, etc.) and communicative levels, such as syntax and semantics. Contextualized embeddings have analogously been found to capture these phenomena at distinctive layers…

Computation and Language · Computer Science 2022-10-24 Max Müller-Eberstein , Rob van der Goot , Barbara Plank

Probing Linguistic Features of Sentence-Level Representations in Neural Relation Extraction

Despite the recent progress, little is known about the features captured by state-of-the-art neural relation extraction (RE) models. Common methods encode the source sentence, conditioned on the entity mentions, before classifying the…

Computation and Language · Computer Science 2020-04-20 Christoph Alt , Aleksandra Gabryszak , Leonhard Hennig

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

By introducing a small set of additional parameters, a probe learns to solve specific linguistic tasks (e.g., dependency parsing) in a supervised manner using feature representations (e.g., contextualized embeddings). The effectiveness of…

Computation and Language · Computer Science 2021-05-31 Zhiyong Wu , Yun Chen , Ben Kao , Qun Liu

When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes

Recent causal probing literature reveals when language models and syntactic probes use similar representations. Such techniques may yield "false negative" causality results: models may use representations of syntax, but probes may have…

Computation and Language · Computer Science 2022-04-22 Mycal Tucker , Tiwalayo Eisape , Peng Qian , Roger Levy , Julie Shah

Idioms, Probing and Dangerous Things: Towards Structural Probing for Idiomaticity in Vector Space

The goal of this paper is to learn more about how idiomatic information is structurally encoded in embeddings, using a structural probing method. We repurpose an existing English verbal multi-word expression (MWE) dataset to suit the…

Computation and Language · Computer Science 2023-04-28 Filip Klubička , Vasudevan Nedumpozhimana , John D. Kelleher

A Matter of Framing: The Impact of Linguistic Formalism on Probing Results

Deep pre-trained contextualized encoders like BERT (Delvin et al., 2019) demonstrate remarkable performance on a range of downstream tasks. A recent line of research in probing investigates the linguistic knowledge implicitly learned by…

Computation and Language · Computer Science 2020-05-01 Ilia Kuznetsov , Iryna Gurevych

Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes

State-of-the-art contextual embeddings are obtained from large language models available only for a few languages. For others, we need to learn representations using a multilingual model. There is an ongoing debate on whether multilingual…

Computation and Language · Computer Science 2021-09-13 Tomasz Limisiewicz , David Mareček