Related papers: HyperMiner: Topic Taxonomy Mining with Hyperbolic …

Embedding Text in Hyperbolic Spaces

Natural language text exhibits hierarchical structure in a variety of respects. Ideally, we could incorporate our prior knowledge of this hierarchical structure into unsupervised learning algorithms that work on text data. Recent work by…

Computation and Language · Computer Science 2018-06-13 Bhuwan Dhingra , Christopher J. Shallue , Mohammad Norouzi , Andrew M. Dai , George E. Dahl

Fine-Grained Entity Typing in Hyperbolic Space

How can we represent hierarchical information present in large type inventories for entity typing? We study the ability of hyperbolic embeddings to capture hierarchical relations between mentions in context and their target types in a…

Computation and Language · Computer Science 2019-06-07 Federico López , Benjamin Heinzerling , Michael Strube

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a…

Computation and Language · Computer Science 2024-02-19 Marco Valentino , Danilo S. Carvalho , André Freitas

Unit Ball Model for Embedding Hierarchical Structures in the Complex Hyperbolic Space

Learning the representation of data with hierarchical structures in the hyperbolic space attracts increasing attention in recent years. Due to the constant negative curvature, the hyperbolic space resembles tree metrics and captures the…

Machine Learning · Computer Science 2022-02-21 Huiru Xiao , Caigao Jiang , Yangqiu Song , James Zhang , Junwu Xiong

Hyperbolic Neural Networks

Hyperbolic spaces have recently gained momentum in the context of machine learning due to their high capacity and tree-likeliness properties. However, the representational power of hyperbolic geometry is not yet on par with Euclidean…

Machine Learning · Computer Science 2018-06-29 Octavian-Eugen Ganea , Gary Bécigneul , Thomas Hofmann

TopicNet: Semantic Graph-Guided Topic Discovery

Existing deep hierarchical topic models are able to extract semantically meaningful topics from a text corpus in an unsupervised manner and automatically organize them into a topic hierarchy. However, it is unclear how to incorporate prior…

Machine Learning · Computer Science 2021-10-28 Zhibin Duan , Yishi Xu , Bo Chen , Dongsheng Wang , Chaojie Wang , Mingyuan Zhou

Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding

Mining a set of meaningful topics organized into a hierarchy is intuitively appealing since topic correlations are ubiquitous in massive text corpora. To account for potential hierarchical topic structures, hierarchical topic models…

Computation and Language · Computer Science 2020-07-21 Yu Meng , Yunyi Zhang , Jiaxin Huang , Yu Zhang , Chao Zhang , Jiawei Han

Hierarchical Graph Topic Modeling with Topic Tree-based Transformer

Textual documents are commonly connected in a hierarchical graph structure where a central document links to others with an exponentially growing connectivity. Though Hyperbolic Graph Neural Networks (HGNNs) excel at capturing such graph…

Computation and Language · Computer Science 2025-02-18 Delvin Ce Zhang , Menglin Yang , Xiaobao Wu , Jiasheng Zhang , Hady W. Lauw

Self-supervised Topic Taxonomy Discovery in the Box Embedding Space

Topic taxonomy discovery aims at uncovering topics of different abstraction levels and constructing hierarchical relations between them. Unfortunately, most of prior work can hardly model semantic scopes of words and topics by holding the…

Computation and Language · Computer Science 2024-08-28 Yuyin Lu , Hegang Chen , Pengbo Mao , Yanghui Rao , Haoran Xie , Fu Lee Wang , Qing Li

Discovering Multi-Scale Semantic Structure in Text Corpora Using Density-Based Trees and LLM Embeddings

Recent advances in large language models enable documents to be represented as dense semantic embeddings, supporting similarity-based operations over large text collections. However, many web-scale systems still rely on flat clustering or…

Computation and Language · Computer Science 2026-01-30 Thomas Haschka , Joseph Bakarji

Towards Better Understanding with Uniformity and Explicit Regularization of Embeddings in Embedding-based Neural Topic Models

Embedding-based neural topic models could explicitly represent words and topics by embedding them to a homogeneous feature space, which shows higher interpretability. However, there are no explicit constraints for the training of…

Computation and Language · Computer Science 2022-06-17 Wei Shao , Lei Huang , Shuqi Liu , Shihua Ma , Linqi Song

Inferring Concept Hierarchies from Text Corpora via Hyperbolic Embeddings

We consider the task of inferring is-a relationships from large text corpora. For this purpose, we propose a new method combining hyperbolic embeddings and Hearst patterns. This approach allows us to set appropriate constraints for…

Computation and Language · Computer Science 2019-02-05 Matt Le , Stephen Roller , Laetitia Papaxanthos , Douwe Kiela , Maximilian Nickel

HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning

Taxonomies are valuable resources for many applications, but the limited coverage due to the expensive manual curation process hinders their general applicability. Prior works attempt to automatically expand existing taxonomies to improve…

Computation and Language · Computer Science 2021-09-23 Mingyu Derek Ma , Muhao Chen , Te-Lin Wu , Nanyun Peng

Hyperbolic Multimodal Representation Learning for Biological Taxonomies

Taxonomic classification in biodiversity research involves organizing biological specimens into structured hierarchies based on evidence, which can come from multiple modalities such as images and genetic information. We investigate whether…

Machine Learning · Computer Science 2025-08-26 ZeMing Gong , Chuanqi Tang , Xiaoliang Huo , Nicholas Pellegrino , Austin T. Wang , Graham W. Taylor , Angel X. Chang , Scott C. Lowe , Joakim Bruslund Haurum

Hyperbolic Interaction Model For Hierarchical Multi-Label Classification

Different from the traditional classification tasks which assume mutual exclusion of labels, hierarchical multi-label classification (HMLC) aims to assign multiple labels to every instance with the labels organized under hierarchical…

Machine Learning · Computer Science 2019-09-05 Boli Chen , Xin Huang , Lin Xiao , Zixin Cai , Liping Jing

Topic Modeling in Embedding Spaces

Topic modeling analyzes documents to learn meaningful patterns of words. However, existing topic models fail to learn interpretable topics when working with large and heavy-tailed vocabularies. To this end, we develop the Embedded Topic…

Information Retrieval · Computer Science 2019-07-12 Adji B. Dieng , Francisco J. R. Ruiz , David M. Blei

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence

Topic models extract groups of words from documents, whose interpretation as a topic hopefully allows for a better understanding of the data. However, the resulting word groups are often not coherent, making them harder to interpret.…

Computation and Language · Computer Science 2021-06-18 Federico Bianchi , Silvia Terragni , Dirk Hovy

Compositional Entailment Learning for Hyperbolic Vision-Language Models

Image-text representation learning forms a cornerstone in vision-language models, where pairs of images and textual descriptions are contrastively aligned in a shared embedding space. Since visual and textual concepts are naturally…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Avik Pal , Max van Spengler , Guido Maria D'Amely di Melendugno , Alessandro Flaborea , Fabio Galasso , Pascal Mettes

HyperText: Endowing FastText with Hyperbolic Geometry

Natural language data exhibit tree-like hierarchical structures such as the hypernym-hyponym relations in WordNet. FastText, as the state-of-the-art text classifier based on shallow neural network in Euclidean space, may not model such…

Computation and Language · Computer Science 2021-12-20 Yudong Zhu , Di Zhou , Jinghui Xiao , Xin Jiang , Xiao Chen , Qun Liu

HyHTM: Hyperbolic Geometry based Hierarchical Topic Models

Hierarchical Topic Models (HTMs) are useful for discovering topic hierarchies in a collection of documents. However, traditional HTMs often produce hierarchies where lowerlevel topics are unrelated and not specific enough to their…

Information Retrieval · Computer Science 2023-05-17 Simra Shahid , Tanay Anand , Nikitha Srikanth , Sumit Bhatia , Balaji Krishnamurthy , Nikaash Puri