English
Related papers

Related papers: DirectProbe: Studying Representations without Clas…

200 papers

Probing has become an important tool for analyzing representations in Natural Language Processing (NLP). For graphical NLP tasks such as dependency parsing, linear probes are currently limited to extracting undirected or unlabeled parse…

Computation and Language · Computer Science 2022-03-25 Max Müller-Eberstein , Rob van der Goot , Barbara Plank

The question of what kinds of linguistic information are encoded in different layers of Transformer-based language models is of considerable interest for the NLP community. Existing work, however, has overwhelmingly focused on word-level…

Computation and Language · Computer Science 2023-10-19 Dmitry Nikolaev , Sebastian Padó

In this paper we introduce a method for visually analyzing contextualized embeddings produced by deep neural network-based language models. Our approach is inspired by linguistic probes for natural language processing, where tasks are…

Human-Computer Interaction · Computer Science 2020-09-08 Matthew Berger

Although neural models have achieved impressive results on several NLP benchmarks, little is understood about the mechanisms they use to perform language tasks. Thus, much recent attention has been devoted to analyzing the sentence…

Computation and Language · Computer Science 2021-03-09 Abhilasha Ravichander , Yonatan Belinkov , Eduard Hovy

The success of pre-trained contextualized representations has prompted researchers to analyze them for the presence of linguistic information. Indeed, it is natural to assume that these pre-trained representations do encode some level of…

Computation and Language · Computer Science 2025-08-08 Karolina Stańczak , Lucas Torroba Hennigen , Adina Williams , Ryan Cotterell , Isabelle Augenstein

Large language models are able to learn new tasks in context, where they are provided with instructions and a few annotated examples. However, the effectiveness of in-context learning is dependent on the provided context, and the…

Computation and Language · Computer Science 2023-12-25 Afra Amini , Massimiliano Ciaramita

The probing classifiers framework has been employed for interpreting deep neural network models for a variety of natural language processing (NLP) applications. Studies, however, have largely focused on sentencelevel NLP tasks. This work is…

Computation and Language · Computer Science 2023-10-25 Barry Wang , Xinya Du , Claire Cardie

In this work we propose a simple and efficient framework for learning sentence representations from unlabelled data. Drawing inspiration from the distributional hypothesis and recent work on learning sentence representations, we reformulate…

Computation and Language · Computer Science 2018-03-09 Lajanugen Logeswaran , Honglak Lee

Pre-trained contextual representations have led to dramatic performance improvements on a range of downstream tasks. Such performance improvements have motivated researchers to quantify and understand the linguistic information encoded in…

Computation and Language · Computer Science 2022-03-28 Alexander Immer , Lucas Torroba Hennigen , Vincent Fortuin , Ryan Cotterell

Probing is widely used to study which features can be decoded from language model representations. However, the common decoding probe approach has two limitations that we aim to solve with our new encoding probe approach: contributions of…

Computation and Language · Computer Science 2026-05-04 Gaofei Shen , Martijn Bentum , Tom Lentz , Afra Alishahi , Grzegorz Chrupała

Probing experiments investigate the extent to which neural representations make properties -- like part-of-speech -- predictable. One suggests that a representation encodes a property if probing that representation produces higher accuracy…

Computation and Language · Computer Science 2021-09-21 John Hewitt , Kawin Ethayarajh , Percy Liang , Christopher D. Manning

Most modern NLP systems make use of pre-trained contextual representations that attain astonishingly high performance on a variety of tasks. Such high performance should not be possible unless some form of linguistic structure inheres in…

Computation and Language · Computer Science 2020-10-07 Lucas Torroba Hennigen , Adina Williams , Ryan Cotterell

Recent multi-modal contrastive learning models have demonstrated the ability to learn an embedding space suitable for building strong vision classifiers, by leveraging the rich information in large-scale image-caption datasets. Our work…

Machine Learning · Computer Science 2023-02-09 Yuhui Zhang , Jeff Z. HaoChen , Shih-Cheng Huang , Kuan-Chieh Wang , James Zou , Serena Yeung

Interpretability benefits the theoretical understanding of representations. Existing word embeddings are generally dense representations. Hence, the meaning of latent dimensions is difficult to interpret. This makes word embeddings like a…

Computation and Language · Computer Science 2023-06-27 Minxue Xia , Hao Zhu

Distributed representations of meaning are a natural way to encode covariance relationships between words and phrases in NLP. By overcoming data sparsity problems, as well as providing information about semantic relatedness which is not…

Computation and Language · Computer Science 2014-03-21 Karl Moritz Hermann , Phil Blunsom

Contextual word representations derived from pre-trained bidirectional language models (biLMs) have recently been shown to provide significant improvements to the state of the art for a wide range of NLP tasks. However, many questions…

Computation and Language · Computer Science 2018-10-01 Matthew E. Peters , Mark Neumann , Luke Zettlemoyer , Wen-tau Yih

The ability to identify and control different kinds of linguistic information encoded in vector representations of words has many use cases, especially for explainability and bias removal. This is usually done via a set of simple…

Computation and Language · Computer Science 2023-10-25 Tal Levy , Omer Goldman , Reut Tsarfaty

Learning neural program embeddings is key to utilizing deep neural networks in program languages research --- precise and efficient program representations enable the application of deep models to a wide range of program analysis tasks.…

Software Engineering · Computer Science 2019-07-12 Ke Wang , Zhendong Su

Deep neural networks have gained tremendous success in a broad range of machine learning tasks due to its remarkable capability to learn semantic-rich features from high-dimensional data. However, they often require large-scale labelled…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Hu Wang , Guansong Pang , Chunhua Shen , Congbo Ma

Contextual embeddings represent a new generation of semantic representations learned from Neural Language Modelling (NLM) that addresses the issue of meaning conflation hampering traditional word embeddings. In this work, we show that…

Computation and Language · Computer Science 2019-06-25 Daniel Loureiro , Alipio Jorge
‹ Prev 1 2 3 10 Next ›