English
Related papers

Related papers: Entity Extraction with Knowledge from Web Scale Co…

200 papers

Entity extraction is fundamental to many text mining tasks such as organisation name recognition. A popular approach to entity extraction is based on matching sub-string candidates in a document against a dictionary of entities. To handle…

Databases · Computer Science 2017-02-14 Zeyi Wen , Dong Deng , Rui Zhang , Kotagiri Ramamohanarao

Understanding the semantic meaning of content on the web through the lens of entities and concepts has many practical advantages. However, when building large-scale entity extraction systems, practitioners are facing unique challenges…

Computation and Language · Computer Science 2021-10-04 Xuanting Cai , Quanbin Ma , Pan Li , Jianyu Liu , Qi Zeng , Zhengkan Yang , Pushkar Tripathi

Dictionary-based entity extraction involves finding mentions of dictionary entities in text. Text mentions are often noisy, containing spurious or missing words. Efficient algorithms for detecting approximate entity mentions follow one of…

Databases · Computer Science 2015-12-17 Ndapandula Nakashole

Webpage entity extraction is a fundamental natural language processing task in both research and applications. Nowadays, the majority of webpage entity extraction models are trained on structured datasets which strive to retain textual…

Computation and Language · Computer Science 2024-03-05 Yifei Yang , Tianqiao Liu , Bo Shao , Hai Zhao , Linjun Shou , Ming Gong , Daxin Jiang

Entity extraction is a key technology for obtaining information from massive texts in natural language processing. The further interaction between them does not meet the standards of human reading comprehension, thus limiting the…

Computation and Language · Computer Science 2021-08-23 Xiaobo Jiang , Kun He , Jiajun He , Guangyu Yan

Timely analysis of cyber-security information necessitates automated information extraction from unstructured text. While state-of-the-art extraction methods produce extremely accurate results, they require ample training data, which is…

Information Retrieval · Computer Science 2014-06-11 Robert A. Bridges , Corinne L. Jones , Michael D. Iannacone , Kelly M. Testa , John R. Goodall

The field of programming has a diversity of paradigms that are used according to the working framework. While current neural code generation methods are able to learn and generate code directly from text, we believe that this approach is…

Computation and Language · Computer Science 2023-07-12 El Mehdi Chouham , Jessica López Espejel , Mahaman Sanoussi Yahaya Alassan , Walid Dahhane , El Hassane Ettifouri

Entity Linking involves detecting and linking entity mentions in natural language texts to a knowledge graph. Traditional methods use a two-step process with separate models for entity recognition and disambiguation, which can be…

Computation and Language · Computer Science 2025-10-23 Daniel Vollmers , Hamada M. Zahera , Diego Moussallem , Axel-Cyrille Ngonga Ngomo

Relation Extraction is an important task in Information Extraction which deals with identifying semantic relations between entity mentions. Traditionally, relation extraction is carried out after entity extraction in a "pipeline" fashion,…

Computation and Language · Computer Science 2021-03-11 Sachin Pawar , Pushpak Bhattacharyya , Girish K. Palshikar

Distant supervised relation extraction is an efficient approach to scale relation extraction to very large corpora, and has been widely used to find novel relational facts from plain text. Recent studies on neural relation extraction have…

Computation and Language · Computer Science 2018-01-12 Zhengqiu He , Wenliang Chen , Zhenghua Li , Meishan Zhang , Wei Zhang , Min Zhang

In standard methodology for natural language processing, entities in text are typically embedded in dense vector spaces with pre-trained models. The embeddings produced this way are effective when fed into downstream models, but they…

Computation and Language · Computer Science 2020-10-14 Yasumasa Onoe , Greg Durrett

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document collections. Examples of named entities…

Information Retrieval · Computer Science 2007-11-21 Anne-Marie Vercoustre , James A. Thom , Jovan Pehcevski

Most of the Natural Language Processing systems are involved in entity-based processing for several tasks like Information Extraction, Question-Answering, Text-Summarization and so on. A new challenge comes when entities play roles…

Computation and Language · Computer Science 2025-11-11 Neelesh Kumar Shukla , Sanasam Ranbir Singh

Detection and disambiguation of all entities in text is a crucial task for a wide range of applications. The typical formulation of the problem involves two stages: detect mention boundaries and link all mentions to a knowledge base. For a…

Information Retrieval · Computer Science 2022-09-14 Christina Du , Kashyap Popat , Louis Martin , Fabio Petroni

Knowledge is captured in the form of entities and their relationships and stored in knowledge graphs. Knowledge graphs enhance the capabilities of applications in many different areas including Web search, recommendation, and natural…

Machine Learning · Computer Science 2021-03-31 Kalpa Gunaratna , Yu Wang , Hongxia Jin

The similarity between the question and indexed documents is a crucial factor in document retrieval for retrieval-augmented question answering. Although this is typically the only method for obtaining the relevant documents, it is not the…

Information Retrieval · Computer Science 2024-08-07 Hassan S. Shavarani , Anoop Sarkar

Extracting entities and relations is an essential task of information extraction. Triplets extracted from a sentence might overlap with each other. Previous methods either did not address the overlapping issues or solved overlapping issues…

Computation and Language · Computer Science 2023-04-07 Hao Zhang

Joint extraction of entities and relations is an important task in information extraction. To tackle this problem, we firstly propose a novel tagging scheme that can convert the joint extraction task to a tagging problem. Then, based on our…

Computation and Language · Computer Science 2017-06-19 Suncong Zheng , Feng Wang , Hongyun Bao , Yuexing Hao , Peng Zhou , Bo Xu

Table retrieval is essential for accessing information stored in structured tabular formats; however, it remains less explored than text retrieval. The content of the table primarily consists of phrases and words, which include a large…

Information Retrieval · Computer Science 2025-04-10 Da Li , Keping Bi , Jiafeng Guo , Xueqi Cheng

Citation parsing is fundamental for search engines within academia and the protection of intellectual property. Meticulous extraction is further needed when evaluating the similarity of documents and calculating their citation impact.…

Digital Libraries · Computer Science 2018-05-23 Niall Martin Ryan
‹ Prev 1 2 3 10 Next ›