English
Related papers

Related papers: Static Embeddings as Efficient Knowledge Bases?

200 papers

Large language models (LLMs) acquire knowledge across diverse domains such as science, history, and geography encountered during generative pre-training. However, due to their stochasticity, it is difficult to predict what LLMs have…

Computation and Language · Computer Science 2026-01-27 Kartik Sharma , Yiqiao Jin , Rakshit Trivedi , Srijan Kumar

Recently, it has been found that monolingual English language models can be used as knowledge bases. Instead of structural knowledge base queries, masked sentences such as "Paris is the capital of [MASK]" are used as probes. We translate…

Computation and Language · Computer Science 2021-02-02 Nora Kassner , Philipp Dufter , Hinrich Schütze

Contextual word embeddings obtained from pre-trained language model (PLM) have proven effective for various natural language processing tasks at the word level. However, interpreting the hidden aspects within embeddings, such as syntax and…

Computation and Language · Computer Science 2023-10-10 Nayoung Choi

The Basic Local Alignment Search Tool (BLAST) is currently the most popular method for searching databases of biological sequences. BLAST compares sequences via similarity defined by a weighted edit distance, which results in it being…

Biomolecules · Quantitative Biology 2020-10-29 Amir Shanehsazzadeh , David Belanger , David Dohan

Knowledge Base, represents facts about the world, often in some form of subsumption ontology, rather than implicitly, embedded in procedural code, the way a conventional computer program does. While there is a rapid growth in knowledge…

Computation and Language · Computer Science 2020-10-20 Sai Sharath Japa , Rekabdar Banafsheh

We study the settings for which deep contextual embeddings (e.g., BERT) give large improvements in performance relative to classic pretrained embeddings (e.g., GloVe), and an even simpler baseline---random word embeddings---focusing on the…

Computation and Language · Computer Science 2020-05-20 Simran Arora , Avner May , Jian Zhang , Christopher Ré

Contextualized embeddings such as BERT can serve as strong input representations to NLP tasks, outperforming their static embeddings counterparts such as skip-gram, CBOW and GloVe. However, such embeddings are dynamic, calculated according…

Computation and Language · Computer Science 2020-04-07 Yile Wang , Leyang Cui , Yue Zhang

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the…

Computation and Language · Computer Science 2019-09-05 Fabio Petroni , Tim Rocktäschel , Patrick Lewis , Anton Bakhtin , Yuxiang Wu , Alexander H. Miller , Sebastian Riedel

Recent work has investigated the interesting question using pre-trained language models (PLMs) as knowledge bases for answering open questions. However, existing work is limited in using small benchmarks with high test-train overlaps. We…

Computation and Language · Computer Science 2021-06-04 Cunxiang Wang , Pai Liu , Yue Zhang

Previous literatures show that pre-trained masked language models (MLMs) such as BERT can achieve competitive factual knowledge extraction performance on some datasets, indicating that MLMs can potentially be a reliable knowledge source. In…

Computation and Language · Computer Science 2021-06-18 Boxi Cao , Hongyu Lin , Xianpei Han , Le Sun , Lingyong Yan , Meng Liao , Tong Xue , Jin Xu

The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on…

Computation and Language · Computer Science 2020-10-13 Ivan Vulić , Edoardo Maria Ponti , Robert Litschko , Goran Glavaš , Anna Korhonen

Contextualized embeddings based on large language models (LLMs) are available for various languages, but their coverage is often limited for lower resourced languages. Using LLMs for such languages is often difficult due to a high…

Computation and Language · Computer Science 2025-01-28 Daniil Gurgurov , Rishu Kumar , Simon Ostermann

Previous works show that Pre-trained Language Models (PLMs) can capture factual knowledge. However, some analyses reveal that PLMs fail to perform it robustly, e.g., being sensitive to the changes of prompts when extracting factual…

Computation and Language · Computer Science 2022-10-21 Shaobo Li , Xiaoguang Li , Lifeng Shang , Chengjie Sun , Bingquan Liu , Zhenzhou Ji , Xin Jiang , Qun Liu

Knowledge Bases (KBs) are easy to query, verifiable, and interpretable. They however scale with man-hours and high-quality data. Masked Language Models (MLMs), such as BERT, scale with computing power as well as unstructured raw text data.…

Computation and Language · Computer Science 2020-09-16 Louis Clouatre , Philippe Trempe , Amal Zouaq , Sarath Chandar

We investigate the extent to which verb alternation classes, as described by Levin (1993), are encoded in the embeddings of Large Pre-trained Language Models (PLMs) such as BERT, RoBERTa, ELECTRA, and DeBERTa using selectively constructed…

Computation and Language · Computer Science 2022-09-13 David K. Yi , James V. Bruno , Jiayu Han , Peter Zukerman , Shane Steinert-Threlkeld

Many recent models in software engineering introduced deep neural models based on the Transformer architecture or use transformer-based Pre-trained Language Models (PLM) trained on code. Although these models achieve the state of the arts…

Software Engineering · Computer Science 2022-04-22 Rishab Sharma , Fuxiang Chen , Fatemeh Fard , David Lo

Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating responses to complex queries through large-scale pre-training. However, the efficacy of these models in memorizing and reasoning among…

Computation and Language · Computer Science 2024-02-23 Qiyuan He , Yizhong Wang , Wenya Wang

Pre-trained Language Models (PLMs) are trained on vast unlabeled data, rich in world knowledge. This fact has sparked the interest of the community in quantifying the amount of factual knowledge present in PLMs, as this explains their…

Computation and Language · Computer Science 2023-12-06 Paul Youssef , Osman Alperen Koraş , Meijie Li , Jörg Schlötterer , Christin Seifert

Word embeddings -- distributed representations of words -- in deep learning are beneficial for many tasks in natural language processing (NLP). However, different embedding sets vary greatly in quality and characteristics of the captured…

Computation and Language · Computer Science 2015-12-31 Wenpeng Yin , Hinrich Schütze

Knowledge Tracing (KT) models students' knowledge states based on learning interactions to predict performance. While deep learning-based KT models have boosted predictive accuracy, most models rely on deterministic vector embeddings and…

Artificial Intelligence · Computer Science 2026-05-12 Siyu Wu , Cong Xu , Wei Zhang
‹ Prev 1 2 3 10 Next ›