Related papers: Constructing Reference Sets from Unstructured, Ung…

Simple Large-scale Relation Extraction from Unstructured Text

Knowledge-based question answering relies on the availability of facts, the majority of which cannot be found in structured sources (e.g. Wikipedia info-boxes, Wikidata). One of the major components of extracting facts from unstructured…

Computation and Language · Computer Science 2018-03-28 Christos Christodoulopoulos , Arpit Mittal

CERES: Distantly Supervised Relation Extraction from the Semi-Structured Web

The web contains countless semi-structured websites, which can be a rich source of information for populating knowledge bases. Existing methods for extracting relations from the DOM trees of semi-structured webpages can achieve high…

Artificial Intelligence · Computer Science 2018-04-13 Colin Lockard , Xin Luna Dong , Arash Einolghozati , Prashant Shiralkar

Recognizing Referential Links: An Information Extraction Perspective

We present an efficient and robust reference resolution algorithm in an end-to-end state-of-the-art information extraction system, which must work with a considerably impoverished syntactic analysis of the input sentences. Considering this…

cmp-lg · Computer Science 2008-02-03 Megumi Kameyama

Content-Based Table Retrieval for Web Queries

Understanding the connections between unstructured text and semi-structured table is an important yet neglected problem in natural language processing. In this work, we focus on content-based table retrieval. Given a query, the task is to…

Computation and Language · Computer Science 2017-06-09 Zhao Yan , Duyu Tang , Nan Duan , Junwei Bao , Yuanhua Lv , Ming Zhou , Zhoujun Li

Biographical: A Semi-Supervised Relation Extraction Dataset

Extracting biographical information from online documents is a popular research topic among the information extraction (IE) community. Various natural language processing (NLP) techniques such as text classification, text summarisation and…

Information Retrieval · Computer Science 2022-05-03 Alistair Plum , Tharindu Ranasinghe , Spencer Jones , Constantin Orasan , Ruslan Mitkov

PageRank without hyperlinks: Structural re-ranking using links induced by language models

Inspired by the PageRank and HITS (hubs and authorities) algorithms for Web search, we propose a structural re-ranking approach to ad hoc information retrieval: we reorder the documents in an initially retrieved set by exploiting asymmetric…

Information Retrieval · Computer Science 2007-05-23 Oren Kurland , Lillian Lee

Neural Relation Extraction via Inner-Sentence Noise Reduction and Transfer Learning

Extracting relations is critical for knowledge base completion and construction in which distant supervised methods are widely used to extract relational facts automatically with the existing knowledge bases. However, the automatically…

Computation and Language · Computer Science 2018-11-09 Tianyi Liu , Xinsong Zhang , Wanhao Zhou , Weijia Jia

Distantly Supervised Morpho-Syntactic Model for Relation Extraction

The task of Information Extraction (IE) involves automatically converting unstructured textual content into structured data. Most research in this field concentrates on extracting all facts or a specific set of relationships from documents.…

Computation and Language · Computer Science 2024-01-19 Nicolas Gutehrlé , Iana Atanassova

Relational Learning and Feature Extraction by Querying over Heterogeneous Information Networks

Many real world systems need to operate on heterogeneous information networks that consist of numerous interacting components of different types. Examples include systems that perform data analysis on biological information networks; social…

Artificial Intelligence · Computer Science 2017-07-26 Parisa Kordjamshidi , Sameer Singh , Daniel Khashabi , Christos Christodoulopoulos , Mark Summons , Saurabh Sinha , Dan Roth

Representation Learning for Weakly Supervised Relation Extraction

Recent years have seen rapid development in Information Extraction, as well as its subtask, Relation Extraction. Relation Extraction is able to detect semantic relations between entities in sentences. Currently, many efficient approaches…

Computation and Language · Computer Science 2024-03-19 Zhuang Li

Learning to Revise References for Faithful Summarization

In real-world scenarios with naturally occurring datasets, reference summaries are noisy and may contain information that cannot be inferred from the source text. On large news corpora, removing low quality samples has been shown to reduce…

Computation and Language · Computer Science 2022-10-13 Griffin Adams , Han-Chin Shing , Qing Sun , Christopher Winestock , Kathleen McKeown , Noémie Elhadad

Exploit Multiple Reference Graphs for Semi-supervised Relation Extraction

Manual annotation of the labeled data for relation extraction is time-consuming and labor-intensive. Semi-supervised methods can offer helping hands for this problem and have aroused great research interests. Existing work focuses on…

Computation and Language · Computer Science 2020-10-23 Wanli Li , Tieyun Qian

The Links Have It: Infobox Generation by Summarization over Linked Entities

Online encyclopedia such as Wikipedia has become one of the best sources of knowledge. Much effort has been devoted to expanding and enriching the structured data by automatic information extraction from unstructured text in Wikipedia.…

Information Retrieval · Computer Science 2014-06-26 Kezun Zhang , Yanghua Xiao , Hanghang Tong , Haixun Wang , Wei Wang

Semi-Automated Construction of Food Composition Knowledge Base

A food composition knowledge base, which stores the essential phyto-, micro-, and macro-nutrients of foods is useful for both research and industrial applications. Although many existing knowledge bases attempt to curate such information,…

Computation and Language · Computer Science 2023-01-27 Jason Youn , Fangzhou Li , Ilias Tagkopoulos

Semi-Automated Labeling of Requirement Datasets for Relation Extraction

Creating datasets manually by human annotators is a laborious task that can lead to biased and inhomogeneous labels. We propose a flexible, semi-automatic framework for labeling data for relation extraction. Furthermore, we provide a…

Software Engineering · Computer Science 2021-09-07 Jeremias Bohn , Jannik Fischbach , Martin Schmitt , Hinrich Schütze , Andreas Vogelsang

Construction et enrichissement automatique d'ontologie \`a partir de ressources externes

Automatic construction of ontologies from text is generally based on retrieving text content. For a much more rich ontology we extend these approaches by taking into account the document structure and some external resources (like thesaurus…

Information Retrieval · Computer Science 2010-02-02 Eric Kergosien , Mouna Kamel , Christian Sallaberry , Marie-Noëlle Bessagnet , Nathalie Aussenac- Gilles , Mauro Gaio

Relation Extraction : A Survey

With the advent of the Internet, large amount of digital text is generated everyday in the form of news articles, research publications, blogs, question answering forums and social media. It is important to develop techniques for extracting…

Computation and Language · Computer Science 2017-12-15 Sachin Pawar , Girish K. Palshikar , Pushpak Bhattacharyya

Content-Based Book Recommending Using Learning for Text Categorization

Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems use social filtering methods that base…

Digital Libraries · Computer Science 2007-05-23 Raymond J. Mooney , Loriene Roy

Concept Trees: Building Dynamic Concepts from Semi-Structured Data using Nature-Inspired Methods

This paper describes a method for creating structure from heterogeneous sources, as part of an information database, or more specifically, a 'concept base'. Structures called 'concept trees' can grow from the semi-structured sources when…

Information Retrieval · Computer Science 2015-03-17 Kieran Greer

Making Sense of Unstructured Text Data

Many network analysis tasks in social sciences rely on pre-existing data sources that were created with explicit relations or interactions between entities under consideration. Examples include email logs, friends and followers networks on…

Social and Information Networks · Computer Science 2017-04-20 Lin Li , William M. Campbell , Cagri Dagli , Joseph P. Campbell