Related papers: Structural Regularities in Text-based Entity Vecto…

Semantic Entity Retrieval Toolkit

Unsupervised learning of low-dimensional, semantic representations of words and entities has recently gained attention. In this paper we describe the Semantic Entity Retrieval Toolkit (SERT) that provides implementations of our previously…

Computation and Language · Computer Science 2017-07-18 Christophe Van Gysel , Maarten de Rijke , Evangelos Kanoulas

Semantic Document Clustering on Named Entity Features

Keyword-based information processing has limitations due to simple treatment of words. In this paper, we introduce named entities as objectives into document clustering, which are the key elements defining document semantics and in many…

Information Retrieval · Computer Science 2018-07-23 Tru H. Cao , Vuong M. Ngo , Dung T. Hong , Tho T. Quan

Inclusion of Role into Named Entity Recognition and Ranking

Most of the Natural Language Processing systems are involved in entity-based processing for several tasks like Information Extraction, Question-Answering, Text-Summarization and so on. A new challenge comes when entities play roles…

Computation and Language · Computer Science 2025-11-11 Neelesh Kumar Shukla , Sanasam Ranbir Singh

Nested and Balanced Entity Recognition using Multi-Task Learning

Entity Recognition (ER) within a text is a fundamental exercise in Natural Language Processing, enabling further depending tasks such as Knowledge Extraction, Text Summarisation, or Keyphrase Extraction. An entity consists of single words…

Computation and Language · Computer Science 2021-06-14 Andreas Waldis , Luca Mazzola

Entity Context Graph: Learning Entity Representations fromSemi-Structured Textual Sources on the Web

Knowledge is captured in the form of entities and their relationships and stored in knowledge graphs. Knowledge graphs enhance the capabilities of applications in many different areas including Web search, recommendation, and natural…

Machine Learning · Computer Science 2021-03-31 Kalpa Gunaratna , Yu Wang , Hongxia Jin

Neural Entity Linking on Technical Service Tickets

Entity linking, the task of mapping textual mentions to known entities, has recently been tackled using contextualized neural networks. We address the question whether these results -- reported for large, high-quality datasets such as…

Computation and Language · Computer Science 2020-05-20 Nadja Kurz , Felix Hamann , Adrian Ulges

Entity Type Recognition using an Ensemble of Distributional Semantic Models to Enhance Query Understanding

We present an ensemble approach for categorizing search query entities in the recruitment domain. Understanding the types of entities expressed in a search query (Company, Skill, Job Title, etc.) enables more intelligent information…

Computation and Language · Computer Science 2016-11-17 Walid Shalaby , Khalifeh Al Jadda , Mohammed Korayem , Trey Grainger

Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search

Traditional information retrieval systems represent documents and queries by keyword sets. However, the content of a document or a query is mainly defined by both keywords and named entities occurring in it. Named entities have ontological…

Information Retrieval · Computer Science 2018-07-17 Vuong M. Ngo , Tru H. Cao

Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks

Topic models aim to reveal latent structures within a corpus of text, typically through the use of term-frequency statistics over bag-of-words representations from documents. In recent years, conceptual entities -- interpretable,…

Computation and Language · Computer Science 2024-08-27 Manuel V. Loureiro , Steven Derby , Tri Kurniawan Wijaya

Analyzing Structures in the Semantic Vector Space: A Framework for Decomposing Word Embeddings

Word embeddings are rich word representations, which in combination with deep neural networks, lead to large performance gains for many NLP tasks. However, word embeddings are represented by dense, real-valued vectors and they are therefore…

Computation and Language · Computer Science 2019-12-24 Andreas Hanselowski , Iryna Gurevych

Interpretable Entity Representations through Large-Scale Typing

In standard methodology for natural language processing, entities in text are typically embedded in dense vector spaces with pre-trained models. The embeddings produced this way are effective when fed into downstream models, but they…

Computation and Language · Computer Science 2020-10-14 Yasumasa Onoe , Greg Durrett

Clustering Prominent People and Organizations in Topic-Specific Text Corpora

Named entities in text documents are the names of people, organization, location or other types of objects in the documents that exist in the real world. A persisting research challenge is to use computational techniques to identify such…

Computation and Language · Computer Science 2019-07-09 Abdulkareem Alsudais , Hovig Tchalian

WISER: A Semantic Approach for Expert Finding in Academia based on Entity Linking

We present WISER, a new semantic search engine for expert finding in academia. Our system is unsupervised and it jointly combines classical language modeling techniques, based on text evidences, with the Wikipedia Knowledge Graph, via…

Information Retrieval · Computer Science 2019-06-11 Paolo Cifariello , Paolo Ferragina , Marco Ponza

Text Classification Models for Form Entity Linking

Forms are a widespread type of template-based document used in a great variety of fields including, among others, administration, medicine, finance, or insurance. The automatic extraction of the information included in these documents is…

Computation and Language · Computer Science 2021-12-15 María Villota , César Domínguez , Jónathan Heras , Eloy Mata , Vico Pascual

Retrieving Multi-Entity Associations: An Evaluation of Combination Modes for Word Embeddings

Word embeddings have gained significant attention as learnable representations of semantic relations between words, and have been shown to improve upon the results of traditional word representations. However, little effort has been devoted…

Information Retrieval · Computer Science 2019-05-23 Gloria Feher , Andreas Spitz , Michael Gertz

VectorSearch: Enhancing Document Retrieval with Semantic Embeddings and Optimized Search

Traditional retrieval methods have been essential for assessing document similarity but struggle with capturing semantic nuances. Despite advancements in latent semantic analysis (LSA) and deep learning, achieving comprehensive semantic…

Information Retrieval · Computer Science 2024-09-27 Solmaz Seyed Monir , Irene Lau , Shubing Yang , Dongfang Zhao

Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Recent advances in machine learning, particularly Large Language Models (LLMs) such as BERT and GPT, provide rich contextual embeddings that improve text representation. However, current document clustering approaches often ignore the…

Computation and Language · Computer Science 2024-12-20 Imed Keraghel , Mohamed Nadif

Entity Retrieval for Answering Entity-Centric Questions

The similarity between the question and indexed documents is a crucial factor in document retrieval for retrieval-augmented question answering. Although this is typically the only method for obtaining the relevant documents, it is not the…

Information Retrieval · Computer Science 2024-08-07 Hassan S. Shavarani , Anoop Sarkar

Unsupervised Matching of Data and Text

Entity resolution is a widely studied problem with several proposals to match records across relations. Matching textual content is a widespread task in many applications, such as question answering and search. While recent methods achieve…

Databases · Computer Science 2021-12-17 Naser Ahmadi , Hansjorg Sand , Paolo Papotti

A Survey on Deep Learning for Named Entity Recognition

Named entity recognition (NER) is the task to identify mentions of rigid designators from text belonging to predefined semantic types such as person, location, organization etc. NER always serves as the foundation for many natural language…

Computation and Language · Computer Science 2023-04-26 Jing Li , Aixin Sun , Jianglei Han , Chenliang Li