English
Related papers

Related papers: Finding Structural Knowledge in Multimodal-BERT

200 papers

State-of-the-art contextual embeddings are obtained from large language models available only for a few languages. For others, we need to learn representations using a multilingual model. There is an ongoing debate on whether multilingual…

Computation and Language · Computer Science 2021-09-13 Tomasz Limisiewicz , David Mareček

Recent work has found evidence that Multilingual BERT (mBERT), a transformer-based multilingual masked language model, is capable of zero-shot cross-lingual transfer, suggesting that some aspects of its representations are shared…

Computation and Language · Computer Science 2020-05-21 Ethan A. Chi , John Hewitt , Christopher D. Manning

Learning representations that accurately model semantics is an important goal of natural language processing research. Many semantic phenomena depend on syntactic structure. Recent work examines the extent to which state-of-the-art models…

Computation and Language · Computer Science 2019-08-28 Geoff Bacon , Terry Regier

Current approaches to learning semantic representations of sentences often use prior word-level knowledge. The current study aims to leverage visual information in order to capture sentence level semantics without the need for word…

Computation and Language · Computer Science 2019-09-25 Danny Merkx , Stefan Frank

Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple…

Computation and Language · Computer Science 2020-10-19 Hila Gonen , Shauli Ravfogel , Yanai Elazar , Yoav Goldberg

Sentence embeddings induced with various transformer architectures encode much semantic and syntactic information in a distributed manner in a one-dimensional array. We investigate whether specific grammatical information can be accessed in…

Computation and Language · Computer Science 2023-12-18 Vivi Nastase , Paola Merlo

Language models have become nearly ubiquitous in natural language processing applications achieving state-of-the-art results in many tasks including prosody. As the model design does not define predetermined linguistic targets during…

Computation and Language · Computer Science 2023-04-26 Sofoklis Kakouros , Johannah O'Mahony

This work describes experiments which probe the hidden representations of several BERT-style models for morphological content. The goal is to examine the extent to which discrete linguistic structure, in the form of morphological features…

Computation and Language · Computer Science 2020-04-08 Daniel Edmiston

Models based on the transformer architecture, such as BERT, have marked a crucial step forward in the field of Natural Language Processing. Importantly, they allow the creation of word embeddings that capture important semantic information…

Computation and Language · Computer Science 2021-01-01 Jacob Turton , David Vinson , Robert Elliott Smith

One of the most remarkable properties of word embeddings is the fact that they capture certain types of semantic and syntactic relationships. Recently, pre-trained language models such as BERT have achieved groundbreaking results across a…

Computation and Language · Computer Science 2019-12-02 Zied Bouraoui , Jose Camacho-Collados , Steven Schockaert

While lexico-semantic elements no doubt capture a large amount of linguistic information, it has been argued that they do not capture all information contained in text. This assumption is central to constructionist approaches to language…

Computation and Language · Computer Science 2020-11-10 Harish Tayyar Madabushi , Laurence Romain , Dagmar Divjak , Petar Milin

In this paper we propose to learn a multimodal image and text embedding from Web and Social Media data, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval. We…

Computer Vision and Pattern Recognition · Computer Science 2018-08-21 Raul Gomez , Lluis Gomez , Jaume Gibert , Dimosthenis Karatzas

Multi-modal word semantics aims to enhance embeddings with perceptual input, assuming that human meaning representation is grounded in sensory experience. Most research focuses on evaluation involving direct visual input, however, visual…

Computation and Language · Computer Science 2021-10-07 Anita L. Verő , Ann Copestake

The goal of this paper is to learn more about how idiomatic information is structurally encoded in embeddings, using a structural probing method. We repurpose an existing English verbal multi-word expression (MWE) dataset to suit the…

Computation and Language · Computer Science 2023-04-28 Filip Klubička , Vasudevan Nedumpozhimana , John D. Kelleher

The ability to model intra-modal and inter-modal interactions is fundamental in multimodal machine learning. The current state-of-the-art models usually adopt deep learning models with fixed structures. They can achieve exceptional…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Qingpei Guo , Kaisheng Yao , Wei Chu

Representing the semantics of words is a long-standing problem for the natural language processing community. Most methods compute word semantics given their textual context in large corpora. More recently, researchers attempted to…

Computation and Language · Computer Science 2017-11-10 Éloi Zablocki , Benjamin Piwowarski , Laure Soulier , Patrick Gallinari

In this paper, we explore the learning of neural network embeddings for natural images and speech waveforms describing the content of those images. These embeddings are learned directly from the waveforms without the use of linguistic…

Computation and Language · Computer Science 2018-04-10 David Harwath , Galen Chuang , James Glass

We introduce an extensive dataset for multilingual probing of morphological information in language models (247 tasks across 42 languages from 10 families), each consisting of a sentence with a target word and a morphological tag as the…

Computation and Language · Computer Science 2024-11-20 Judit Acs , Endre Hamerlik , Roy Schwartz , Noah A. Smith , Andras Kornai

While large language models like BERT demonstrate strong empirical performance on semantic tasks, whether this reflects true conceptual competence or surface-level statistical association remains unclear. I investigate whether BERT encodes…

Computation and Language · Computer Science 2025-06-16 Cole Gawin

Recently, the bidirectional encoder representations from transformers (BERT) model has attracted much attention in the field of natural language processing, owing to its high performance in language understanding-related tasks. The BERT…

Machine Learning · Computer Science 2020-04-16 Kazuki Miyazawa , Tatsuya Aoki , Takato Horii , Takayuki Nagai
‹ Prev 1 2 3 10 Next ›