Related papers: Taxonomy Induction using Hypernym Subsequences

Improving Hypernymy Extraction with Distributional Semantic Classes

In this paper, we show how distributionally-induced semantic classes can be helpful for extracting hypernyms. We present methods for inducing sense-aware semantic classes using distributional semantics and using these induced semantic…

Computation and Language · Computer Science 2018-03-01 Alexander Panchenko , Dmitry Ustalov , Stefano Faralli , Simone P. Ponzetto , Chris Biemann

Taxonomy Enrichment with Text and Graph Vector Representations

Knowledge graphs such as DBpedia, Freebase or Wikidata always contain a taxonomic backbone that allows the arrangement and structuring of various concepts in accordance with the hypo-hypernym ("class-subclass") relationship. With the rapid…

Computation and Language · Computer Science 2022-01-24 Irina Nikishina , Mikhail Tikhomirov , Varvara Logacheva , Yuriy Nazarov , Alexander Panchenko , Natalia Loukachevitch

End-to-End Reinforcement Learning for Automatic Taxonomy Induction

We present a novel end-to-end reinforcement learning approach to automatic taxonomy induction from a set of terms. While prior methods treat the problem as a two-phase task (i.e., detecting hypernymy pairs followed by organizing these pairs…

Computation and Language · Computer Science 2018-05-14 Yuning Mao , Xiang Ren , Jiaming Shen , Xiaotao Gu , Jiawei Han

INRIASAC: Simple Hypernym Extraction Methods

Given a set of terms from a given domain, how can we structure them into a taxonomy without manual intervention? This is the task 17 of SemEval 2015. Here we present our simple taxonomy structuring techniques which, despite their…

Computation and Language · Computer Science 2016-01-07 Gregory Grefenstette

Learning Concept Taxonomies from Multi-modal Data

We study the problem of automatically building hypernym taxonomies from textual and visual data. Previous works in taxonomy induction generally ignore the increasingly prominent visual data, which encode important perceptual semantics.…

Computation and Language · Computer Science 2016-06-30 Hao Zhang , Zhiting Hu , Yuntian Deng , Mrinmaya Sachan , Zhicheng Yan , Eric P. Xing

Matroids Hitting Sets and Unsupervised Dependency Grammar Induction

This paper formulates a novel problem on graphs: find the minimal subset of edges in a fully connected graph, such that the resulting graph contains all spanning trees for a set of specifed sub-graphs. This formulation is motivated by an…

Discrete Mathematics · Computer Science 2017-07-18 Nicholas Harvey , Vahab Mirrokni , David Karger , Virginia Savova , Leonid Peshkin

HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Taxonomies are of great value to many knowledge-rich applications. As the manual taxonomy curation costs enormous human effects, automatic taxonomy construction is in great demand. However, most existing automatic taxonomy construction…

Computation and Language · Computer Science 2019-10-21 Jiaming Shen , Zeqiu Wu , Dongming Lei , Chao Zhang , Xiang Ren , Michelle T. Vanni , Brian M. Sadler , Jiawei Han

A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection

Our work addresses the problem of unsupervised Aspect Category Detection using a small set of seed words. Recent works have focused on learning embedding spaces for seed words and sentences to establish similarities between sentences and…

Computation and Language · Computer Science 2023-11-17 Thi-Nhung Nguyen , Hoang Ngo , Kiem-Hieu Nguyen , Tuan-Dung Cao

Semi-Supervised Learning for Bilingual Lexicon Induction

We consider the problem of aligning two sets of continuous word representations, corresponding to languages, to a common space in order to infer a bilingual lexicon. It was recently shown that it is possible to infer such lexicon, without…

Computation and Language · Computer Science 2024-02-13 Paul Garnier , Gauthier Guinet

PSG: Prompt-based Sequence Generation for Acronym Extraction

Acronym extraction aims to find acronyms (i.e., short-forms) and their meanings (i.e., long-forms) from the documents, which is important for scientific document understanding (SDU@AAAI-22) tasks. Previous works are devoted to modeling this…

Computation and Language · Computer Science 2021-12-10 Bin Li , Fei Xia , Yixuan Weng , Xiusheng Huang , Bin Sun , Shutao Li

Efficient Induction of Language Models Via Probabilistic Concept Formation

This paper presents a novel approach to the acquisition of language models from corpora. The framework builds on Cobweb, an early system for constructing taxonomic hierarchies of probabilistic concepts that used a tabular, attribute-value…

Computation and Language · Computer Science 2022-12-23 Christopher J. MacLellan , Peter Matsakis , Pat Langley

Watset: Automatic Induction of Synsets from a Graph of Synonyms

This paper presents a new graph-based approach that induces synsets using synonymy dictionaries and word embeddings. First, we build a weighted graph of synonyms extracted from commonly available resources, such as Wiktionary. Second, we…

Computation and Language · Computer Science 2018-05-21 Dmitry Ustalov , Alexander Panchenko , Chris Biemann

Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model

Taxonomies, which organize domain concepts into hierarchical structures, are crucial for building knowledge systems and downstream applications. As domain knowledge evolves, taxonomies need to be continuously updated to include new…

Computation and Language · Computer Science 2024-06-26 Fei Xia , Yixuan Weng , Shizhu He , Kang Liu , Jun Zhao

On Eliciting Syntax from Language Models via Hashing

Unsupervised parsing, also known as grammar induction, aims to infer syntactic structure from raw text. Recently, binary representation has exhibited remarkable information-preserving capabilities at both lexicon and syntax levels. In this…

Computation and Language · Computer Science 2024-10-08 Yiran Wang , Masao Utiyama

Neural Relation Extraction via Inner-Sentence Noise Reduction and Transfer Learning

Extracting relations is critical for knowledge base completion and construction in which distant supervised methods are widely used to extract relational facts automatically with the existing knowledge bases. However, the automatically…

Computation and Language · Computer Science 2018-11-09 Tianyi Liu , Xinsong Zhang , Wanhao Zhou , Weijia Jia

Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings

We introduce the use of Poincar\'e embeddings to improve existing state-of-the-art approaches to domain-specific taxonomy induction from text as a signal for both relocating wrong hyponym terms within a (pre-induced) taxonomy as well as for…

Computation and Language · Computer Science 2019-06-06 Rami Aly , Shantanu Acharya , Alexander Ossa , Arne Köhn , Chris Biemann , Alexander Panchenko

Seed Word Selection for Weakly-Supervised Text Classification with Unsupervised Error Estimation

Weakly-supervised text classification aims to induce text classifiers from only a few user-provided seed words. The vast majority of previous work assumes high-quality seed words are given. However, the expert-annotated seed words are…

Computation and Language · Computer Science 2021-04-21 Yiping Jin , Akshay Bhatia , Dittaya Wanvarie

Predictive Subsampling for Scalable Inference in Networks

Network datasets appear across a wide range of scientific fields, including biology, physics, and the social sciences. To enable data-driven discoveries from these networks, statistical inference techniques like estimation and hypothesis…

Methodology · Statistics 2026-02-19 Arpan Kumar , Minh Tang , Srijan Sengupta

STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths

Taxonomies are important knowledge ontologies that underpin numerous applications on a daily basis, but many taxonomies used in practice suffer from the low coverage issue. We study the taxonomy expansion problem, which aims to expand…

Computation and Language · Computer Science 2020-06-19 Yue Yu , Yinghao Li , Jiaming Shen , Hao Feng , Jimeng Sun , Chao Zhang

Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning

Labeling datasets is a noteworthy challenge in machine learning, both in terms of cost and time. This research, however, leverages an efficient answer. By exploring label propagation in semi-supervised learning, we can significantly reduce…

Machine Learning · Computer Science 2024-10-16 Minoo Jafarlou , Mario M. Kubek