Related papers: Instilling Type Knowledge in Language Models via M…

SKILL: Structured Knowledge Infusion for Large Language Models

Large language models (LLMs) have demonstrated human-level performance on a vast spectrum of natural language tasks. However, it is largely unexplored whether they can better internalize knowledge from a structured data, such as a knowledge…

Computation and Language · Computer Science 2022-05-18 Fedor Moiseev , Zhe Dong , Enrique Alfonseca , Martin Jaggi

Improving Multilingual Named Entity Recognition with Wikipedia Entity Type Mapping

The state-of-the-art named entity recognition (NER) systems are statistical machine learning models that have strong generalization capability (i.e., can recognize unseen entities that do not appear in training data) based on lexical and…

Computation and Language · Computer Science 2019-11-04 Jian Ni , Radu Florian

Entity Type Prediction in Knowledge Graphs using Embeddings

Open Knowledge Graphs (such as DBpedia, Wikidata, YAGO) have been recognized as the backbone of diverse applications in the field of data mining and information retrieval. Hence, the completeness and correctness of the Knowledge Graphs…

Computation and Language · Computer Science 2020-05-07 Russa Biswas , Radina Sofronova , Mehwish Alam , Harald Sack

Wiki-TabNER: Integrating Named Entity Recognition into Wikipedia Tables

Interest in solving table interpretation tasks has grown over the years, yet it still relies on existing datasets that may be overly simplified. This is potentially reducing the effectiveness of the dataset for thorough evaluation and…

Artificial Intelligence · Computer Science 2025-05-05 Aneta Koleva , Martin Ringsquandl , Ahmed Hatem , Thomas Runkler , Volker Tresp

Embedded Topic Models Enhanced by Wikification

Topic modeling analyzes a collection of documents to learn meaningful patterns of words. However, previous topic models consider only the spelling of words and do not take into consideration the homography of words. In this study, we…

Computation and Language · Computer Science 2024-10-04 Takashi Shibuya , Takehito Utsuro

Statistical and Neural Methods for Cross-lingual Entity Label Mapping in Knowledge Graphs

Knowledge bases such as Wikidata amass vast amounts of named entity information, such as multilingual labels, which can be extremely useful for various multilingual and cross-lingual applications. However, such labels are not guaranteed to…

Computation and Language · Computer Science 2022-06-20 Gabriel Amaral , Mārcis Pinnis , Inguna Skadiņa , Odinaldo Rodrigues , Elena Simperl

KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness

In recent years, Pre-trained Language Models (PLMs) have shown their superiority by pre-training on unstructured text corpus and then fine-tuning on downstream tasks. On entity-rich textual resources like Wikipedia, Knowledge-Enhanced PLMs…

Computation and Language · Computer Science 2023-05-04 Yichuan Li , Jialong Han , Kyumin Lee , Chengyuan Ma , Benjamin Yao , Derek Liu

Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text

Knowledge graphs can represent information about the real-world using entities and their relations in a structured and semantically rich manner and they enable a variety of downstream applications such as question-answering, recommendation…

Computation and Language · Computer Science 2023-05-16 Hanieh Khorashadizadeh , Nandana Mihindukulasooriya , Sanju Tiwari , Jinghua Groppe , Sven Groppe

Large language models converge toward human-like concept organization

Large language models show human-like performance in knowledge extraction, reasoning and dialogue, but it remains controversial whether this performance is best explained by memorization and pattern matching, or whether it reflects…

Machine Learning · Computer Science 2023-08-30 Mathias Lykke Gammelgaard , Jonathan Gabel Christiansen , Anders Søgaard

WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

We present WikiReading, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the…

Computation and Language · Computer Science 2017-03-17 Daniel Hewlett , Alexandre Lacoste , Llion Jones , Illia Polosukhin , Andrew Fandrianto , Jay Han , Matthew Kelcey , David Berthelot

An Evaluation Framework for Mapping News Headlines to Event Classes in a Knowledge Graph

Mapping ongoing news headlines to event-related classes in a rich knowledge base can be an important component in a knowledge-based event analysis and forecasting solution. In this paper, we present a methodology for creating a benchmark…

Computation and Language · Computer Science 2023-12-06 Steve Fonin Mbouadeu , Martin Lorenzo , Ken Barker , Oktie Hassanzadeh

Context-aware Entity Typing in Knowledge Graphs

Knowledge graph entity typing aims to infer entities' missing types in knowledge graphs which is an important but under-explored issue. This paper proposes a novel method for this task by utilizing entities' contextual information.…

Computation and Language · Computer Science 2021-09-17 Weiran Pan , Wei Wei , Xian-Ling Mao

Multi-Multi-View Learning: Multilingual and Multi-Representation Entity Typing

Knowledge bases (KBs) are paramount in NLP. We employ multiview learning for increasing accuracy and coverage of entity type information in KBs. We rely on two metaviews: language and representation. For language, we consider high-resource…

Computation and Language · Computer Science 2018-10-25 Yadollah Yaghoobzadeh , Hinrich Schütze

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

In this work, we aim at equipping pre-trained language models with structured knowledge. We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs. Building upon entity-level masked language models,…

Computation and Language · Computer Science 2020-04-30 Tao Shen , Yi Mao , Pengcheng He , Guodong Long , Adam Trischler , Weizhu Chen

ENT-DESC: Entity Description Generation by Exploring Knowledge Graph

Previous works on knowledge-to-text generation take as input a few RDF triples or key-value pairs conveying the knowledge of some entities to generate a natural language description. Existing datasets, such as WIKIBIO, WebNLG, and E2E,…

Computation and Language · Computer Science 2020-10-27 Liying Cheng , Dekun Wu , Lidong Bing , Yan Zhang , Zhanming Jie , Wei Lu , Luo Si

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

We present a new dataset of Wikipedia articles each paired with a knowledge graph, to facilitate the research in conditional text generation, graph generation and graph representation learning. Existing graph-text paired datasets typically…

Computation and Language · Computer Science 2021-07-21 Luyu Wang , Yujia Li , Ozlem Aslan , Oriol Vinyals

Answer Candidate Type Selection: Text-to-Text Language Model for Closed Book Question Answering Meets Knowledge Graphs

Pre-trained Text-to-Text Language Models (LMs), such as T5 or BART yield promising results in the Knowledge Graph Question Answering (KGQA) task. However, the capacity of the models is limited and the quality decreases for questions with…

Computation and Language · Computer Science 2023-10-12 Mikhail Salnikov , Maria Lysyuk , Pavel Braslavski , Anton Razzhigaev , Valentin Malykh , Alexander Panchenko

Towards Knowledge-Grounded Natural Language Understanding and Generation

This thesis investigates how natural language understanding and generation with transformer models can benefit from grounding the models with knowledge representations and addresses the following key research questions: (i) Can knowledge of…

Computation and Language · Computer Science 2024-03-25 Chenxi Whitehouse

KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding

Contextualized entity representations learned by state-of-the-art transformer-based language models (TLMs) like BERT, GPT, T5, etc., leverage the attention mechanism to learn the data context from training data corpus. However, these models…

Computation and Language · Computer Science 2021-09-06 Keyur Faldu , Amit Sheth , Prashant Kikani , Hemang Akbari

Entity Context Graph: Learning Entity Representations fromSemi-Structured Textual Sources on the Web

Knowledge is captured in the form of entities and their relationships and stored in knowledge graphs. Knowledge graphs enhance the capabilities of applications in many different areas including Web search, recommendation, and natural…

Machine Learning · Computer Science 2021-03-31 Kalpa Gunaratna , Yu Wang , Hongxia Jin