Related papers: Static Embeddings as Efficient Knowledge Bases?

Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings

Large language models (LLMs) acquire knowledge across diverse domains such as science, history, and geography encountered during generative pre-training. However, due to their stochasticity, it is difficult to predict what LLMs have…

Computation and Language · Computer Science 2026-01-27 Kartik Sharma , Yiqiao Jin , Rakshit Trivedi , Srijan Kumar

Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models

Recently, it has been found that monolingual English language models can be used as knowledge bases. Instead of structural knowledge base queries, masked sentences such as "Paris is the capital of [MASK]" are used as probes. We translate…

Computation and Language · Computer Science 2021-02-02 Nora Kassner , Philipp Dufter , Hinrich Schütze

Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection

Contextual word embeddings obtained from pre-trained language model (PLM) have proven effective for various natural language processing tasks at the word level. However, interpreting the hidden aspects within embeddings, such as syntax and…

Computation and Language · Computer Science 2023-10-10 Nayoung Choi

Fixed-Length Protein Embeddings using Contextual Lenses

The Basic Local Alignment Search Tool (BLAST) is currently the most popular method for searching databases of biological sequences. BLAST compares sequences via similarity defined by a weighted edit distance, which results in it being…

Biomolecules · Quantitative Biology 2020-10-29 Amir Shanehsazzadeh , David Belanger , David Dohan

Question Answering over Knowledge Base using Language Model Embeddings

Knowledge Base, represents facts about the world, often in some form of subsumption ontology, rather than implicitly, embedded in procedural code, the way a conventional computer program does. While there is a rapid growth in knowledge…

Computation and Language · Computer Science 2020-10-20 Sai Sharath Japa , Rekabdar Banafsheh

Contextual Embeddings: When Are They Worth It?

We study the settings for which deep contextual embeddings (e.g., BERT) give large improvements in performance relative to classic pretrained embeddings (e.g., GloVe), and an even simpler baseline---random word embeddings---focusing on the…

Computation and Language · Computer Science 2020-05-20 Simran Arora , Avner May , Jian Zhang , Christopher Ré

How Can BERT Help Lexical Semantics Tasks?

Contextualized embeddings such as BERT can serve as strong input representations to NLP tasks, outperforming their static embeddings counterparts such as skip-gram, CBOW and GloVe. However, such embeddings are dynamic, calculated according…

Computation and Language · Computer Science 2020-04-07 Yile Wang , Leyang Cui , Yue Zhang

Language Models as Knowledge Bases?

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the…

Computation and Language · Computer Science 2019-09-05 Fabio Petroni , Tim Rocktäschel , Patrick Lewis , Anton Bakhtin , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?

Recent work has investigated the interesting question using pre-trained language models (PLMs) as knowledge bases for answering open questions. However, existing work is limited in using small benchmarks with high test-train overlaps. We…

Computation and Language · Computer Science 2021-06-04 Cunxiang Wang , Pai Liu , Yue Zhang

Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases

Previous literatures show that pre-trained masked language models (MLMs) such as BERT can achieve competitive factual knowledge extraction performance on some datasets, indicating that MLMs can potentially be a reliable knowledge source. In…

Computation and Language · Computer Science 2021-06-18 Boxi Cao , Hongyu Lin , Xianpei Han , Le Sun , Lingyong Yan , Meng Liao , Tong Xue , Jin Xu

Probing Pretrained Language Models for Lexical Semantics

The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on…

Computation and Language · Computer Science 2020-10-13 Ivan Vulić , Edoardo Maria Ponti , Robert Litschko , Goran Glavaš , Anna Korhonen

GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge

Contextualized embeddings based on large language models (LLMs) are available for various languages, but their coverage is often limited for lower resourced languages. Using LLMs for such languages is often difficult due to a high…

Computation and Language · Computer Science 2025-01-28 Daniil Gurgurov , Rishu Kumar , Simon Ostermann

Pre-training Language Models with Deterministic Factual Knowledge

Previous works show that Pre-trained Language Models (PLMs) can capture factual knowledge. However, some analyses reveal that PLMs fail to perform it robustly, e.g., being sensitive to the changes of prompts when extracting factual…

Computation and Language · Computer Science 2022-10-21 Shaobo Li , Xiaoguang Li , Lifeng Shang , Chengjie Sun , Bingquan Liu , Zhenzhou Ji , Xin Jiang , Qun Liu

MLMLM: Link Prediction with Mean Likelihood Masked Language Model

Knowledge Bases (KBs) are easy to query, verifiable, and interpretable. They however scale with man-hours and high-quality data. Masked Language Models (MLMs), such as BERT, scale with computing power as well as unstructured raw text data.…

Computation and Language · Computer Science 2020-09-16 Louis Clouatre , Philippe Trempe , Amal Zouaq , Sarath Chandar

Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models

We investigate the extent to which verb alternation classes, as described by Levin (1993), are encoded in the embeddings of Large Pre-trained Language Models (PLMs) such as BERT, RoBERTa, ELECTRA, and DeBERTa using selectively constructed…

Computation and Language · Computer Science 2022-09-13 David K. Yi , James V. Bruno , Jiayu Han , Peter Zukerman , Shane Steinert-Threlkeld

An Exploratory Study on Code Attention in BERT

Many recent models in software engineering introduced deep neural models based on the Transformer architecture or use transformer-based Pre-trained Language Models (PLM) trained on code. Although these models achieve the state of the arts…

Software Engineering · Computer Science 2022-04-22 Rishab Sharma , Fuxiang Chen , Fatemeh Fard , David Lo

Can Language Models Act as Knowledge Bases at Scale?

Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating responses to complex queries through large-scale pre-training. However, the efficacy of these models in memorizing and reasoning among…

Computation and Language · Computer Science 2024-02-23 Qiyuan He , Yizhong Wang , Wenya Wang

Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models

Pre-trained Language Models (PLMs) are trained on vast unlabeled data, rich in world knowledge. This fact has sparked the interest of the community in quantifying the amount of factual knowledge present in PLMs, as this explains their…

Computation and Language · Computer Science 2023-12-06 Paul Youssef , Osman Alperen Koraş , Meijie Li , Jörg Schlötterer , Christin Seifert

Learning Meta-Embeddings by Using Ensembles of Embedding Sets

Word embeddings -- distributed representations of words -- in deep learning are beneficial for many tasks in natural language processing (NLP). However, different embedding sets vary greatly in quality and characteristics of the captured…

Computation and Language · Computer Science 2015-12-31 Wenpeng Yin , Hinrich Schütze

Explainable Knowledge Tracing via Probabilistic Embeddings and Pattern-based Reasoning

Knowledge Tracing (KT) models students' knowledge states based on learning interactions to predict performance. While deep learning-based KT models have boosted predictive accuracy, most models rely on deterministic vector embeddings and…

Artificial Intelligence · Computer Science 2026-05-12 Siyu Wu , Cong Xu , Wei Zhang