English
Related papers

Related papers: Nested Hierarchical Dirichlet Processes

200 papers

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according…

Machine Learning · Statistics 2013-01-17 John Paisley , Chong Wang , David Blei , Michael I. Jordan

Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mixture modeling, where the number of mixture components grows with the number of data items. The Hierarchical Dirichlet Process (HDP), is an extension of DP for grouped…

Machine Learning · Statistics 2015-09-02 Lavanya Sita Tekumalla , Priyanka Agrawal , Indrajit Bhattacharya

Nested Chinese Restaurant Process (nCRP) topic models are powerful nonparametric Bayesian methods to extract a topic hierarchy from a given text corpus, where the hierarchical structure is automatically determined by the data. Hierarchical…

Machine Learning · Statistics 2017-02-24 Jianfei Chen , Jun Zhu , Jie Lu , Shixia Liu

This paper focuses on the problem of hierarchical non-overlapping clustering of a dataset. In such a clustering, each data item is associated with exactly one leaf node and each internal node is associated with all the data items stored in…

Machine Learning · Statistics 2021-05-26 Weipeng Huang , Nishma Laitonjam , Guangyuan Piao , Neil Hurley

We study the problem of topic modeling in corpora whose documents are organized in a multi-level hierarchy. We explore a parametric approach to this problem, assuming that the number of topics is known or can be estimated by…

Machine Learning · Statistics 2015-04-14 Do-kyum Kim , Geoffrey M. Voelker , Lawrence K. Saul

Tree structures are ubiquitous in data across many domains, and many datasets are naturally modelled by unobserved tree structures. In this paper, first we review the theory of random fragmentation processes [Bertoin, 2006], and a number of…

Machine Learning · Statistics 2015-09-17 Hong Ge , Yarin Gal , Zoubin Ghahramani

Distributed word representations are widely used for modeling words in NLP tasks. Most of the existing models generate one representation per word and do not consider different meanings of a word. We present two approaches to learn multiple…

Computation and Language · Computer Science 2018-02-14 Marzieh Fadaee , Arianna Bisazza , Christof Monz

We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative model for the joint distribution of a group of observations and a response variable directly associated with that whole group. We compare the sHDP…

Machine Learning · Statistics 2014-12-18 Andrew M. Dai , Amos J. Storkey

To scale non-parametric extensions of probabilistic topic models such as Latent Dirichlet allocation to larger data sets, practitioners rely increasingly on parallel and distributed systems. In this work, we study data-parallel training for…

Machine Learning · Statistics 2020-10-07 Alexander Terenin , Måns Magnusson , Leif Jonsson

A tree-based dictionary learning model is developed for joint analysis of imagery and associated text. The dictionary learning may be applied directly to the imagery from patches, or to general feature vectors extracted from patches or…

Machine Learning · Computer Science 2012-10-19 Lingbo Li , XianXing Zhang , Mingyuan Zhou , Lawrence Carin

Time-varying mixture densities occur in many scenarios, for example, the distributions of keywords that appear in publications may evolve from year to year, video frame features associated with multiple targets may evolve in a sequence. Any…

Machine Learning · Statistics 2016-04-19 Cheng Luo , Yang Xiang , Richard Yi Da Xu

The cooperative hierarchical structure is a common and significant data structure observed in, or adopted by, many research areas, such as: text mining (author-paper-word) and multi-label classification (label-instance-feature). Renowned…

Machine Learning · Computer Science 2017-07-19 Junyu Xuan , Jie Lu , Guangquan Zhang , Richard Yi Da Xu

Social media has now become the de facto information source on real world events. The challenge, however, due to the high volume and velocity nature of social media streams, is in how to follow all posts pertaining to a given event over…

Information Retrieval · Computer Science 2016-06-14 P. K. Srijith , Mark Hepple , Kalina Bontcheva , Daniel Preotiuc-Pietro

We introduce a non-parametric hierarchical Bayesian approach for open-ended 3D object categorization, named the Local Hierarchical Dirichlet Process (Local-HDP). This method allows an agent to learn independent topics for each category…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 H. Ayoobi , H. Kasaei , M. Cao , R. Verbrugge , B. Verheij

Bayesian nonparametric hierarchical priors are highly effective in providing flexible models for latent data structures exhibiting sharing of information between and across groups. Most prominent is the Hierarchical Dirichlet Process (HDP),…

Statistics Theory · Mathematics 2021-03-23 Lancelot F. James , Juho Lee , Abhinav Pandey

We consider an array of random variables, taking values in a complete and separable metric space, that exhibits a kind of symmetry which we call row exchangeability. Given such an array, a natural model for Bayesian nonparametric inference…

Statistics Theory · Mathematics 2025-10-10 Evan Donald , Jason Swanson

We propose a novel document generation process based on hierarchical latent tree models (HLTMs) learned from data. An HLTM has a layer of observed word variables at the bottom and multiple layers of latent variables on top. For each…

Computation and Language · Computer Science 2019-07-01 Peixian Chen , Zhourong Chen , Nevin L. Zhang

Community detection has been an active research area for decades. Among all probabilistic models, Stochastic Block Model has been the most popular one. This paper introduces a novel probabilistic model: RW-HDP, based on random walks and…

Applications · Statistics 2016-08-04 Ruimin Zhu , Wenxin Jiang

In Natural Language Processing (NLP), predicting linguistic structures, such as parsing and chunking, has mostly relied on manual annotations of syntactic structures. This paper introduces an unsupervised approach to chunking, a syntactic…

Computation and Language · Computer Science 2025-12-19 Zijun Wu , Anup Anand Deshmukh , Yongkang Wu , Jimmy Lin , Lili Mou

This paper presents a new Bayesian non-parametric model by extending the usage of Hierarchical Dirichlet Allocation to extract tree structured word clusters from text data. The inference algorithm of the model collects words in a cluster if…

Computation and Language · Computer Science 2016-01-22 Halid Ziya Yerebakan , Fitsum Reda , Yiqiang Zhan , Yoshihisa Shinagawa
‹ Prev 1 2 3 10 Next ›