English
Related papers

Related papers: Dirichlet Fragmentation Processes

200 papers

Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mixture modeling, where the number of mixture components grows with the number of data items. The Hierarchical Dirichlet Process (HDP), is an extension of DP for grouped…

Machine Learning · Statistics 2015-09-02 Lavanya Sita Tekumalla , Priyanka Agrawal , Indrajit Bhattacharya

We present the nested Chinese restaurant process (nCRP), a stochastic process which assigns probability distributions to infinitely-deep, infinitely-branching trees. We show how this stochastic process can be used as a prior distribution in…

Machine Learning · Statistics 2009-08-27 David M. Blei , Thomas L. Griffiths , Michael I. Jordan

This paper focuses on the problem of hierarchical non-overlapping clustering of a dataset. In such a clustering, each data item is associated with exactly one leaf node and each internal node is associated with all the data items stored in…

Machine Learning · Statistics 2021-05-26 Weipeng Huang , Nishma Laitonjam , Guangyuan Piao , Neil Hurley

The two parameter Poisson-Dirichlet Process (PDP), a generalisation of the Dirichlet Process, is increasingly being used for probabilistic modelling in discrete areas such as language technology, bioinformatics, and image analysis. There is…

Statistics Theory · Mathematics 2012-02-17 Wray Buntine , Marcus Hutter

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according…

Machine Learning · Statistics 2016-11-17 John Paisley , Chong Wang , David M. Blei , Michael I. Jordan

While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In the…

Machine Learning · Computer Science 2022-05-24 Zhi Wang , Chunlin Chen , Daoyi Dong

We define the beta diffusion tree, a random tree structure with a set of leaves that defines a collection of overlapping subsets of objects, known as a feature allocation. A generative process for the tree structure is defined in terms of…

Machine Learning · Statistics 2015-04-06 Creighton Heaukulani , David A. Knowles , Zoubin Ghahramani

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according…

Machine Learning · Statistics 2013-01-17 John Paisley , Chong Wang , David Blei , Michael I. Jordan

The Chinese restaurant process (CRP) and the stick-breaking process are the two most commonly used representations of the Dirichlet process. However, the usual proof of the connection between them is indirect, relying on abstract properties…

Statistics Theory · Mathematics 2018-10-16 Jeffrey W. Miller

The Hierarchical Dirichlet process is a discrete random measure serving as an important prior in Bayesian non-parametrics. It is motivated with the study of groups of clustered data. Each group is modelled through a level two Dirichlet…

Probability · Mathematics 2022-10-25 Shui Feng

Random partition distribution is a crucial tool for model-based clustering. This study advances the field of random partition in the context of functional spatial data, focusing on the challenges posed by hourly population data across…

Methodology · Statistics 2025-06-05 Tomoya Wakayama , Shonosuke Sugasawa , Genya Kobayashi

We use a natural ordered extension of the Chinese Restaurant Process to grow a two-parameter family of binary self-similar continuum fragmentation trees. We provide an explicit embedding of Ford's sequence of alpha model trees in the…

Probability · Mathematics 2009-09-25 Jim Pitman , Matthias Winkel

Learning from a continuous stream of non-stationary data in an unsupervised manner is arguably one of the most common and most challenging settings facing intelligent agents. Here, we attack learning under all three conditions…

Machine Learning · Computer Science 2023-05-23 Rylan Schaeffer , Gabrielle Kaili-May Liu , Yilun Du , Scott Linderman , Ila Rani Fiete

We develop the distance dependent Chinese restaurant process (CRP), a flexible class of distributions over partitions that allows for non-exchangeability. This class can be used to model many kinds of dependencies between data in infinite…

Machine Learning · Statistics 2011-08-11 David M. Blei , Peter I. Frazier

The Distributional Random Forest (DRF) is a recently introduced Random Forest algorithm to estimate multivariate conditional distributions. Due to its general estimation procedure, it can be employed to estimate a wide range of targets such…

Statistics Theory · Mathematics 2023-12-20 Jeffrey Näf , Corinne Emmenegger , Peter Bühlmann , Nicolai Meinshausen

In this paper, we provide an explicit probability distribution for classification purposes. It is derived from the Bayesian nonparametric mixture of Dirichlet process model, but with suitable modifications which remove unsuitable aspects of…

Applications · Statistics 2009-05-05 Ruth Fuentes-Garcia , Ramses H Mena , Stephen G Walker

We introduce the Pitman Yor Diffusion Tree (PYDT) for hierarchical clustering, a generalization of the Dirichlet Diffusion Tree (Neal, 2001) which removes the restriction to binary branching structure. The generative process is described…

Machine Learning · Statistics 2011-06-17 David A. Knowles , Zoubin Ghahramani

We begin by reviewing some probabilistic results about the Dirichlet Process and its close relatives, focussing on their implications for statistical modelling and analysis. We then introduce a class of simple mixture models in which…

Methodology · Statistics 2010-03-23 Peter J. Green

The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular Bayesian nonparametric model for…

Methodology · Statistics 2022-01-25 Antonio Lijoi , Igor Prünster , Giovanni Rebaudo

We consider the problem of clustering grouped data with possibly non-exchangeable groups whose dependencies can be characterized by a known directed acyclic graph. To allow the sharing of clusters among the non-exchangeable groups, we…

‹ Prev 1 2 3 10 Next ›