Related papers: DeepKE: A Deep Learning Based Knowledge Extraction…

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

We introduce OneKE, a dockerized schema-guided knowledge extraction system, which can extract knowledge from the Web and raw PDF Books, and support various domains (science, news, etc.). Specifically, we design OneKE with multiple agents…

Computation and Language · Computer Science 2025-02-07 Yujie Luo , Xiangyuan Ru , Kangwei Liu , Lin Yuan , Mengshu Sun , Ningyu Zhang , Lei Liang , Zhiqiang Zhang , Jun Zhou , Lanning Wei , Da Zheng , Haofen Wang , Huajun Chen

Open Domain Knowledge Extraction for Knowledge Graphs

The quality of a knowledge graph directly impacts the quality of downstream applications (e.g. the number of answerable questions using the graph). One ongoing challenge when building a knowledge graph is to ensure completeness and…

Computation and Language · Computer Science 2023-12-18 Kun Qian , Anton Belyi , Fei Wu , Samira Khorshidi , Azadeh Nikfarjam , Rahul Khot , Yisi Sang , Katherine Luna , Xianqi Chu , Eric Choi , Yash Govind , Chloe Seivwright , Yiwen Sun , Ahmed Fakhry , Theo Rekatsinas , Ihab Ilyas , Xiaoguang Qi , Yunyao Li

OpenKI: Integrating Open Information Extraction and Knowledge Bases with Relation Inference

In this paper, we consider advancing web-scale knowledge extraction and alignment by integrating OpenIE extractions in the form of (subject, predicate, object) triples with Knowledge Bases (KB). Traditional techniques from universal schema…

Information Retrieval · Computer Science 2019-04-30 Dongxu Zhang , Subhabrata Mukherjee , Colin Lockard , Xin Luna Dong , Andrew McCallum

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction

OpenNRE is an open-source and extensible toolkit that provides a unified framework to implement neural models for relation extraction (RE). Specifically, by implementing typical RE methods, OpenNRE not only allows developers to train custom…

Computation and Language · Computer Science 2019-10-01 Xu Han , Tianyu Gao , Yuan Yao , Demin Ye , Zhiyuan Liu , Maosong Sun

ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs

Knowledge graphs (KGs) are foundational to many AI applications, but maintaining their freshness and completeness remains costly. We present ODKE+, a production-grade system that automatically extracts and ingests millions of open-domain…

Computation and Language · Computer Science 2025-09-08 Samira Khorshidi , Azadeh Nikfarjam , Suprita Shankar , Yisi Sang , Yash Govind , Hyun Jang , Ali Kasgari , Alexis McClimans , Mohamed Soliman , Vishnu Konda , Ahmed Fakhry , Xiaoguang Qi

KnowIt: Deep Time Series Modeling and Interpretation

KnowIt (Knowledge discovery in time series data) is a flexible framework for building deep time series models and interpreting them. It is implemented as a Python toolkit, with source code and documentation available from…

Machine Learning · Computer Science 2026-02-19 M. W. Theunissen , R. Rabe , H. L. Potgieter , M. H. Davel

TNNT: The Named Entity Recognition Toolkit

Extraction of categorised named entities from text is a complex task given the availability of a variety of Named Entity Recognition (NER) models and the unstructured information encoded in different source document formats. Processing the…

Computation and Language · Computer Science 2021-12-07 Sandaru Seneviratne , Sergio J. Rodríguez Méndez , Xuecheng Zhang , Pouya G. Omran , Kerry Taylor , Armin Haller

DeepRec: An Open-source Toolkit for Deep Learning based Recommendation

Deep learning based recommender systems have been extensively explored in recent years. However, the large number of models proposed each year poses a big challenge for both researchers and practitioners in reproducing the results for…

Information Retrieval · Computer Science 2019-05-28 Shuai Zhang , Yi Tay , Lina Yao , Bin Wu , Aixin Sun

Disentangling Knowledge Representations for Large Language Model Editing

Knowledge Editing has emerged as a promising solution for efficiently updating embedded knowledge in large language models (LLMs). While existing approaches demonstrate effectiveness in integrating new knowledge and preserving the original…

Computation and Language · Computer Science 2026-03-26 Mengqi Zhang , Zisheng Zhou , Xiaotian Ye , Qiang Liu , Zhaochun Ren , Zhumin Chen , Pengjie Ren

CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not…

Computation and Language · Computer Science 2023-07-04 Xiang Wei , Yufeng Chen , Ning Cheng , Xingyu Cui , Jinan Xu , Wenjuan Han

Interpretable Multi-Step Reasoning with Knowledge Extraction on Complex Healthcare Question Answering

Healthcare question answering assistance aims to provide customer healthcare information, which widely appears in both Web and mobile Internet. The questions usually require the assistance to have proficient healthcare background knowledge…

Artificial Intelligence · Computer Science 2020-09-29 Ye Liu , Shaika Chowdhury , Chenwei Zhang , Cornelia Caragea , Philip S. Yu

PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning

The challenge of recognizing named entities in a given text has been a very dynamic field in recent years. This is due to the advances in neural network architectures, increase of computing power and the availability of diverse labeled…

Computation and Language · Computer Science 2023-01-10 Nasi Jofche , Kostadin Mishev , Riste Stojanov , Milos Jovanovik , Dimitar Trajanov

REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction

Relation extraction is an important but challenging task that aims to extract all hidden relational facts from the text. With the development of deep language models, relation extraction methods have achieved good performance on various…

Computation and Language · Computer Science 2022-08-17 Sheng Zhang , Patrick Ng , Zhiguo Wang , Bing Xiang

TableLab: An Interactive Table Extraction System with Adaptive Deep Learning

Table extraction from PDF and image documents is a ubiquitous task in the real-world. Perfect extraction quality is difficult to achieve with one single out-of-box model due to (1) the wide variety of table styles, (2) the lack of training…

Human-Computer Interaction · Computer Science 2021-02-18 Nancy Xin Ru Wang , Douglas Burdick , Yunyao Li

A Dynamic Self-Evolving Extraction System

The extraction of structured information from raw text is a fundamental component of many NLP applications, including document retrieval, ranking, and relevance estimation. High-quality extractions often require domain-specific accuracy,…

Computation and Language · Computer Science 2026-03-10 Moin Amin-Naseri , Hannah Kim , Estevam Hruschka

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models

Recent knowledge editing methods have primarily focused on modifying structured knowledge in large language models. However, this task setting overlooks the fact that a significant portion of real-world knowledge is stored in an…

Computation and Language · Computer Science 2025-02-26 Jingcheng Deng , Zihao Wei , Liang Pang , Hanxing Ding , Huawei Shen , Xueqi Cheng

Wikidata-lite for Knowledge Extraction and Exploration

Wikidata is the largest collaborative general knowledge graph supported by a worldwide community. It includes many helpful topics for knowledge exploration and data science applications. However, due to the enormous size of Wikidata, it is…

Databases · Computer Science 2022-11-11 Phuc Nguyen , Hideaki Takeda

NeuralKG: An Open Source Library for Diverse Representation Learning of Knowledge Graphs

NeuralKG is an open-source Python-based library for diverse representation learning of knowledge graphs. It implements three different series of Knowledge Graph Embedding (KGE) methods, including conventional KGEs, GNN-based KGEs, and…

Machine Learning · Computer Science 2022-02-28 Wen Zhang , Xiangnan Chen , Zhen Yao , Mingyang Chen , Yushan Zhu , Hongtao Yu , Yufeng Huang , Zezhong Xu , Yajing Xu , Ningyu Zhang , Zonggang Yuan , Feiyu Xiong , Huajun Chen

Metaknowledge Extraction Based on Multi-Modal Documents

The triple-based knowledge in large-scale knowledge bases is most likely lacking in structural logic and problematic of conducting knowledge hierarchy. In this paper, we introduce the concept of metaknowledge to knowledge engineering…

Computer Vision and Pattern Recognition · Computer Science 2021-02-08 Shukan Liu , Ruilin Xu , Boying Geng , Qiao Sun , Li Duan , Yiming Liu

Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models

Even for a conservative estimate, 80% of enterprise data reside in unstructured files, stored in data lakes that accommodate heterogeneous formats. Classical search engines can no longer meet information seeking needs, especially when the…

Computation and Language · Computer Science 2024-06-06 Qiang Sun , Yuanyi Luo , Wenxiao Zhang , Sirui Li , Jichunyang Li , Kai Niu , Xiangrui Kong , Wei Liu