English
Related papers

Related papers: A Non-Linear Structural Probe

200 papers

Structural probes learn a linear transformation to find how dependency trees are embedded in the hidden states of language models. This simple design may not allow for full exploitation of the structure of the encoded information. Hence, to…

Computation and Language · Computer Science 2024-02-27 Avik Pal , Madhura Pawar

Measuring what linguistic information is encoded in neural models of language has become popular in NLP. Researchers approach this enterprise by training "probes" - supervised models designed to extract linguistic structure from another…

Computation and Language · Computer Science 2020-05-13 Rowan Hall Maudslay , Josef Valvoda , Tiago Pimentel , Adina Williams , Ryan Cotterell

With the recent success of pre-trained models in NLP, a significant focus was put on interpreting their representations. One of the most prominent approaches is structural probing (Hewitt and Manning, 2019), where a linear projection of…

Computation and Language · Computer Science 2021-06-25 Tomasz Limisiewicz , David Mareček

Analysing whether neural language models encode linguistic information has become popular in NLP. One method of doing so, which is frequently cited to support the claim that models like BERT encode syntax, is called probing; probes are…

Computation and Language · Computer Science 2021-06-07 Rowan Hall Maudslay , Ryan Cotterell

The question of what kinds of linguistic information are encoded in different layers of Transformer-based language models is of considerable interest for the NLP community. Existing work, however, has overwhelmingly focused on word-level…

Computation and Language · Computer Science 2023-10-19 Dmitry Nikolaev , Sebastian Padó

The syntactic structures of sentences can be readily read-out from the activations of large language models (LLMs). However, the ``structural probes'' that have been developed to reveal this phenomenon are typically evaluated on an…

Computation and Language · Computer Science 2025-08-12 Pablo J. Diego-Simón , Emmanuel Chemla , Jean-Rémi King , Yair Lakretz

Although neural models have achieved impressive results on several NLP benchmarks, little is understood about the mechanisms they use to perform language tasks. Thus, much recent attention has been devoted to analyzing the sentence…

Computation and Language · Computer Science 2021-03-09 Abhilasha Ravichander , Yonatan Belinkov , Eduard Hovy

Pre-trained models of source code have recently been successfully applied to a wide variety of Software Engineering tasks; they have also seen some practical adoption in practice, e.g. for code completion. Yet, we still know very little…

Software Engineering · Computer Science 2023-12-11 Anjan Karmakar , Romain Robbes

The success of neural networks on a diverse set of NLP tasks has led researchers to question how much these networks actually ``know'' about natural language. Probes are a natural way of assessing this. When probing, a researcher chooses a…

Computation and Language · Computer Science 2020-05-26 Tiago Pimentel , Josef Valvoda , Rowan Hall Maudslay , Ran Zmigrod , Adina Williams , Ryan Cotterell

Explicit structural information has been proven to be encoded by Graph Neural Networks (GNNs), serving as auxiliary knowledge to enhance model capabilities and improve performance in downstream NLP tasks. However, recent studies indicate…

Computation and Language · Computer Science 2025-06-30 Li Zhou , Hao Jiang , Junjie Li , Zefeng Zhao , Feng Jiang , Wenyu Chen , Haizhou Li

Sentence encoders map sentences to real valued vectors for use in downstream applications. To peek into these representations - e.g., to increase interpretability of their results - probing tasks have been designed which query them for…

Computation and Language · Computer Science 2020-10-29 Steffen Eger , Johannes Daxenberger , Iryna Gurevych

Probing experiments investigate the extent to which neural representations make properties -- like part-of-speech -- predictable. One suggests that a representation encodes a property if probing that representation produces higher accuracy…

Computation and Language · Computer Science 2021-09-21 John Hewitt , Kawin Ethayarajh , Percy Liang , Christopher D. Manning

A central quest of probing is to uncover how pre-trained models encode a linguistic property within their representations. An encoding, however, might be spurious-i.e., the model might not rely on it when making predictions. In this paper,…

Computation and Language · Computer Science 2024-05-24 Karim Lasri , Tiago Pimentel , Alessandro Lenci , Thierry Poibeau , Ryan Cotterell

Linguistic information is encoded at varying timescales (subwords, phrases, etc.) and communicative levels, such as syntax and semantics. Contextualized embeddings have analogously been found to capture these phenomena at distinctive layers…

Computation and Language · Computer Science 2022-10-24 Max Müller-Eberstein , Rob van der Goot , Barbara Plank

Despite the recent progress, little is known about the features captured by state-of-the-art neural relation extraction (RE) models. Common methods encode the source sentence, conditioned on the entity mentions, before classifying the…

Computation and Language · Computer Science 2020-04-20 Christoph Alt , Aleksandra Gabryszak , Leonhard Hennig

By introducing a small set of additional parameters, a probe learns to solve specific linguistic tasks (e.g., dependency parsing) in a supervised manner using feature representations (e.g., contextualized embeddings). The effectiveness of…

Computation and Language · Computer Science 2021-05-31 Zhiyong Wu , Yun Chen , Ben Kao , Qun Liu

Recent causal probing literature reveals when language models and syntactic probes use similar representations. Such techniques may yield "false negative" causality results: models may use representations of syntax, but probes may have…

Computation and Language · Computer Science 2022-04-22 Mycal Tucker , Tiwalayo Eisape , Peng Qian , Roger Levy , Julie Shah

The goal of this paper is to learn more about how idiomatic information is structurally encoded in embeddings, using a structural probing method. We repurpose an existing English verbal multi-word expression (MWE) dataset to suit the…

Computation and Language · Computer Science 2023-04-28 Filip Klubička , Vasudevan Nedumpozhimana , John D. Kelleher

Deep pre-trained contextualized encoders like BERT (Delvin et al., 2019) demonstrate remarkable performance on a range of downstream tasks. A recent line of research in probing investigates the linguistic knowledge implicitly learned by…

Computation and Language · Computer Science 2020-05-01 Ilia Kuznetsov , Iryna Gurevych

State-of-the-art contextual embeddings are obtained from large language models available only for a few languages. For others, we need to learn representations using a multilingual model. There is an ongoing debate on whether multilingual…

Computation and Language · Computer Science 2021-09-13 Tomasz Limisiewicz , David Mareček
‹ Prev 1 2 3 10 Next ›