Related papers: Entity Extraction with Knowledge from Web Scale Co…

A Technical Report: Entity Extraction using Both Character-based and Token-based Similarity

Entity extraction is fundamental to many text mining tasks such as organisation name recognition. A popular approach to entity extraction is based on matching sub-string candidates in a document against a dictionary of entities. To handle…

Databases · Computer Science 2017-02-14 Zeyi Wen , Dong Deng , Rui Zhang , Kotagiri Ramamohanarao

A Web Scale Entity Extraction System

Understanding the semantic meaning of content on the web through the lens of entities and concepts has many practical advantages. However, when building large-scale entity extraction systems, practitioners are facing unique challenges…

Computation and Language · Computer Science 2021-10-04 Xuanting Cai , Quanbin Ma , Pan Li , Jianyu Liu , Qi Zeng , Zhengkan Yang , Pushkar Tripathi

An Operator for Entity Extraction in MapReduce

Dictionary-based entity extraction involves finding mentions of dictionary entities in text. Text mentions are often noisy, containing spurious or missing words. Efficient algorithms for detecting approximate entity mentions follow one of…

Databases · Computer Science 2015-12-17 Ndapandula Nakashole

Hypertext Entity Extraction in Webpage

Webpage entity extraction is a fundamental natural language processing task in both research and applications. Nowadays, the majority of webpage entity extraction models are trained on structured datasets which strive to retain textual…

Computation and Language · Computer Science 2024-03-05 Yifei Yang , Tianqiao Liu , Bo Shao , Hai Zhao , Linjun Shou , Ming Gong , Daxin Jiang

A New Entity Extraction Method Based on Machine Reading Comprehension

Entity extraction is a key technology for obtaining information from massive texts in natural language processing. The further interaction between them does not meet the standards of human reading comprehension, thus limiting the…

Computation and Language · Computer Science 2021-08-23 Xiaobo Jiang , Kun He , Jiajun He , Guangyu Yan

Automatic Labeling for Entity Extraction in Cyber Security

Timely analysis of cyber-security information necessitates automated information extraction from unstructured text. While state-of-the-art extraction methods produce extremely accurate results, they require ample training data, which is…

Information Retrieval · Computer Science 2014-06-11 Robert A. Bridges , Corinne L. Jones , Michael D. Iannacone , Kelly M. Testa , John R. Goodall

Entity Identifier: A Natural Text Parsing-based Framework For Entity Relation Extraction

The field of programming has a diversity of paradigms that are used according to the working framework. While current neural code generation methods are able to learn and generate code directly from text, we believe that this approach is…

Computation and Language · Computer Science 2023-07-12 El Mehdi Chouham , Jessica López Espejel , Mahaman Sanoussi Yahaya Alassan , Walid Dahhane , El Hassane Ettifouri

Contextual Augmentation for Entity Linking using Large Language Models

Entity Linking involves detecting and linking entity mentions in natural language texts to a knowledge graph. Traditional methods use a two-step process with separate models for entity recognition and disambiguation, which can be…

Computation and Language · Computer Science 2025-10-23 Daniel Vollmers , Hamada M. Zahera , Diego Moussallem , Axel-Cyrille Ngonga Ngomo

Techniques for Jointly Extracting Entities and Relations: A Survey

Relation Extraction is an important task in Information Extraction which deals with identifying semantic relations between entity mentions. Traditionally, relation extraction is carried out after entity extraction in a "pipeline" fashion,…

Computation and Language · Computer Science 2021-03-11 Sachin Pawar , Pushpak Bhattacharyya , Girish K. Palshikar

SEE: Syntax-aware Entity Embedding for Neural Relation Extraction

Distant supervised relation extraction is an efficient approach to scale relation extraction to very large corpora, and has been widely used to find novel relational facts from plain text. Recent studies on neural relation extraction have…

Computation and Language · Computer Science 2018-01-12 Zhengqiu He , Wenliang Chen , Zhenghua Li , Meishan Zhang , Wei Zhang , Min Zhang

Interpretable Entity Representations through Large-Scale Typing

In standard methodology for natural language processing, entities in text are typically embedded in dense vector spaces with pre-trained models. The embeddings produced this way are effective when fed into downstream models, but they…

Computation and Language · Computer Science 2020-10-14 Yasumasa Onoe , Greg Durrett

Entity Ranking in Wikipedia

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document collections. Examples of named entities…

Information Retrieval · Computer Science 2007-11-21 Anne-Marie Vercoustre , James A. Thom , Jovan Pehcevski

Inclusion of Role into Named Entity Recognition and Ranking

Most of the Natural Language Processing systems are involved in entity-based processing for several tasks like Information Extraction, Question-Answering, Text-Summarization and so on. A new challenge comes when entities play roles…

Computation and Language · Computer Science 2025-11-11 Neelesh Kumar Shukla , Sanasam Ranbir Singh

Entity Tagging: Extracting Entities in Text Without Mention Supervision

Detection and disambiguation of all entities in text is a crucial task for a wide range of applications. The typical formulation of the problem involves two stages: detect mention boundaries and link all mentions to a knowledge base. For a…

Information Retrieval · Computer Science 2022-09-14 Christina Du , Kashyap Popat , Louis Martin , Fabio Petroni

Entity Context Graph: Learning Entity Representations fromSemi-Structured Textual Sources on the Web

Knowledge is captured in the form of entities and their relationships and stored in knowledge graphs. Knowledge graphs enhance the capabilities of applications in many different areas including Web search, recommendation, and natural…

Machine Learning · Computer Science 2021-03-31 Kalpa Gunaratna , Yu Wang , Hongxia Jin

Entity Retrieval for Answering Entity-Centric Questions

The similarity between the question and indexed documents is a crucial factor in document retrieval for retrieval-augmented question answering. Although this is typically the only method for obtaining the relevant documents, it is not the…

Information Retrieval · Computer Science 2024-08-07 Hassan S. Shavarani , Anoop Sarkar

SpanRE: Entities and Overlapping Relations Extraction Based on Spans and Entity Attention

Extracting entities and relations is an essential task of information extraction. Triplets extracted from a sentence might overlap with each other. Previous methods either did not address the overlapping issues or solved overlapping issues…

Computation and Language · Computer Science 2023-04-07 Hao Zhang

Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme

Joint extraction of entities and relations is an important task in information extraction. To tackle this problem, we firstly propose a novel tagging scheme that can convert the joint extraction task to a tagging problem. Then, based on our…

Computation and Language · Computer Science 2017-06-19 Suncong Zheng , Feng Wang , Hongyun Bao , Yuexing Hao , Peng Zhou , Bo Xu

Bridging Queries and Tables through Entities in Table Retrieval

Table retrieval is essential for accessing information stored in structured tabular formats; however, it remains less explored than text retrieval. The content of the table primarily consists of phrases and words, which include a large…

Information Retrieval · Computer Science 2025-04-10 Da Li , Keping Bi , Jiafeng Guo , Xueqi Cheng

Citation Data-set for Machine Learning Citation Styles and Entity Extraction from Citation Strings

Citation parsing is fundamental for search engines within academia and the protection of intellectual property. Meticulous extraction is further needed when evaluating the similarity of documents and calculating their citation impact.…

Digital Libraries · Computer Science 2018-05-23 Niall Martin Ryan