Related papers: Structured Knowledge Discovery from Massive Text C…

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

In this work, we aim at equipping pre-trained language models with structured knowledge. We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs. Building upon entity-level masked language models,…

Computation and Language · Computer Science 2020-04-30 Tao Shen , Yi Mao , Pengcheng He , Guodong Long , Adam Trischler , Weizhu Chen

Learning Joint Query Interpretation and Response Ranking

Thanks to information extraction and semantic Web efforts, search on unstructured text is increasingly refined using semantic annotations and structured knowledge bases. However, most users cannot become familiar with the schema of…

Information Retrieval · Computer Science 2012-12-27 Uma Sawant , Soumen Chakrabarti

Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach

Knowledge-enhanced text generation aims to enhance the quality of generated text by utilizing internal or external knowledge sources. While language models have demonstrated impressive capabilities in generating coherent and fluent text,…

Computation and Language · Computer Science 2026-01-15 Shuqi Liu , Han Wu , Guanzhi Deng , Jianshu Chen , Xiaoyang Wang , Linqi Song

StrucText-Eval: Evaluating Large Language Model's Reasoning Ability in Structure-Rich Text

The effective utilization of structured data, integral to corporate data strategies, has been challenged by the rise of large language models (LLMs) capable of processing unstructured information. This shift prompts the question: can LLMs…

Computation and Language · Computer Science 2024-10-22 Zhouhong Gu , Haoning Ye , Xingzhou Chen , Zeyang Zhou , Hongwei Feng , Yanghua Xiao

MetaPAD: Meta Pattern Discovery from Massive Text Corpora

Mining textual patterns in news, tweets, papers, and many other kinds of text corpora has been an active theme in text mining and NLP research. Previous studies adopt a dependency parsing-based pattern discovery approach. However, the…

Computation and Language · Computer Science 2017-03-16 Meng Jiang , Jingbo Shang , Taylor Cassidy , Xiang Ren , Lance M. Kaplan , Timothy P. Hanratty , Jiawei Han

Machine Understanding of Scientific Language

Scientific information expresses human understanding of nature. This knowledge is largely disseminated in different forms of text, including scientific papers, news articles, and discourse among people on social media. While important for…

Computation and Language · Computer Science 2025-07-01 Dustin Wright

Unsupervised Pre-training with Structured Knowledge for Improving Natural Language Inference

While recent research on natural language inference has considerably benefited from large annotated datasets, the amount of inference-related knowledge (including commonsense) provided in the annotated data is still rather limited. There…

Computation and Language · Computer Science 2021-09-10 Xiaoyu Yang , Xiaodan Zhu , Zhan Shi , Tianda Li

Implicit Knowledge in Argumentative Texts: An Annotated Corpus

When speaking or writing, people omit information that seems clear and evident, such that only part of the message is expressed in words. Especially in argumentative texts it is very common that (important) parts of the argument are implied…

Computation and Language · Computer Science 2019-12-24 Maria Becker , Katharina Korfhage , Anette Frank

Generating Navigable Semantic Maps from Social Sciences Corpora

It is now commonplace to observe that we are facing a deluge of online information. Researchers have of course long acknowledged the potential value of this information since digital traces make it possible to directly observe, describe and…

Computation and Language · Computer Science 2015-07-09 Thierry Poibeau , Pablo Ruiz

A Framework for Capturing and Analyzing Unstructured and Semi-structured Data for a Knowledge Management System

Mainstream knowledge management researchers generally agree that knowledge extracted from unstructured data and semi-structured data have become imperative for organizational strategic decision making. In this research, we develop a…

Information Retrieval · Computer Science 2020-07-15 Gerald Onwujekwe , Kweku-Muata Osei-Bryson , Nnatubemugo Ngwum

Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey

Incorporating external knowledge into dialogue generation has been proven to benefit the performance of an open-domain Dialogue System (DS), such as generating informative or stylized responses, controlling conversation topics. In this…

Computation and Language · Computer Science 2024-11-15 Longxuan Ma , Mingda Li , Weinan Zhang , Jiapeng Li , Ting Liu

Towards information-rich, logical text generation with knowledge-enhanced neural models

Text generation system has made massive promising progress contributed by deep learning techniques and has been widely applied in our life. However, existing end-to-end neural models suffer from the problem of tending to generate…

Artificial Intelligence · Computer Science 2020-03-03 Hao Wang , Bin Guo , Wei Wu , Zhiwen Yu

Unfolding the Structure of a Document using Deep Learning

Understanding and extracting of information from large documents, such as business opportunities, academic articles, medical documents and technical reports, poses challenges not present in short documents. Such large documents may be…

Computation and Language · Computer Science 2019-10-10 Muhammad Mahbubur Rahman , Tim Finin

Open-world Story Generation with Structured Knowledge Enhancement: A Comprehensive Survey

Storytelling and narrative are fundamental to human experience, intertwined with our social and cultural engagement. As such, researchers have long attempted to create systems that can generate stories automatically. In recent years,…

Computation and Language · Computer Science 2023-09-13 Yuxin Wang , Jieru Lin , Zhiwei Yu , Wei Hu , Börje F. Karlsson

Logical Structure as Knowledge: Enhancing LLM Reasoning via Structured Logical Knowledge Density Estimation

The reasoning capabilities of Large Language Models (LLMs) are increasingly attributed to training data quality rather than mere parameter scaling. However, existing data-centric paradigms often equate quality with factuality or diversity…

Artificial Intelligence · Computer Science 2026-02-13 Zhen Bi , Zhenlin Hu , Xueshu Chen , Mingyang Chen , Cheng Deng , Yida Xue , Zhen Wang , Qing Shen , Ningyu Zhang , Jungang Lou

Unsupervised Construction of Knowledge Graphs From Text and Code

The scientific literature is a rich source of information for data mining with conceptual knowledge graphs; the open science movement has enriched this literature with complementary source code that implements scientific models. To exploit…

Machine Learning · Computer Science 2019-08-27 Kun Cao , James Fairbanks

Learning from Web: Review of Approaches

Knowledge discovery is defined as non-trivial extraction of implicit, previously unknown and potentially useful information from given data. Knowledge extraction from web documents deals with unstructured, free-format documents whose number…

Neural and Evolutionary Computing · Computer Science 2007-05-23 Vitaly Schetinin

From Text to Knowledge with Graphs: modelling, querying and exploiting textual content

This paper highlights the challenges, current trends, and open issues related to the representation, querying and analytics of content extracted from texts. The internet contains vast text-based information on various subjects, including…

Databases · Computer Science 2023-10-11 Genoveva Vargas-Solar , Mirian Halfeld Ferrari Alves , Anne-Lyse Minard Forst

A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking

Automated fact-checking based on machine learning is a promising approach to identify false information distributed on the web. In order to achieve satisfactory performance, machine learning methods require a large corpus with reliable…

Computation and Language · Computer Science 2019-11-05 Andreas Hanselowski , Christian Stab , Claudia Schulz , Zile Li , Iryna Gurevych

Reasoning Factual Knowledge in Structured Data with Large Language Models

Large language models (LLMs) have made remarkable progress in various natural language processing tasks as a benefit of their capability to comprehend and reason with factual knowledge. However, a significant amount of factual knowledge is…

Computation and Language · Computer Science 2024-08-23 Sirui Huang , Yanggan Gu , Xuming Hu , Zhonghao Li , Qing Li , Guandong Xu