English
Related papers

Related papers: Latent Tree Analysis

200 papers

In data analysis, latent variables play a central role because they help provide powerful insights into a wide variety of phenomena, ranging from biological to human sciences. The latent tree model, a particular type of probabilistic…

Machine Learning · Computer Science 2014-02-05 Raphaël Mourad , Christine Sinoquet , Nevin L. Zhang , Tengfei Liu , Philippe Leray

Latent tree models are graphical models defined on trees, in which only a subset of variables is observed. They were first discussed by Judea Pearl as tree-decomposable distributions to generalise star-decomposable distributions such as the…

Statistics Theory · Mathematics 2017-08-03 Piotr Zwiernik

Model performance is frequently reported only for the overall population under consideration. However, due to heterogeneity, overall performance measures often do not accurately represent model performance within specific subgroups. We…

Methodology · Statistics 2025-06-03 Ruotao Zhang , Constantine Gatsonis , Jon Steingrimsson

Individuals do not respond uniformly to treatments, events, or interventions. Sociologists routinely partition samples into subgroups to explore how the effects of treatments vary by covariates like race, gender, and socioeconomic status.…

Other Statistics · Statistics 2019-09-23 Jennie E. Brand , Jiahui Xu , Bernard Koch , Pablo Geraldo

Latent tree learning models represent sentences by composing their words according to an induced parse tree, all based on a downstream task. These models often outperform baselines which use (externally provided) syntax trees to drive the…

Computation and Language · Computer Science 2020-01-16 Jean Maillard , Stephen Clark

We study the problem of learning a latent tree graphical model where samples are available only from a subset of variables. We propose two consistent and computationally efficient algorithms for learning minimal latent trees, that is, trees…

Machine Learning · Statistics 2010-09-15 Myung Jin Choi , Vincent Y. F. Tan , Animashree Anandkumar , Alan S. Willsky

Probabilistic Latent Semantic Analysis is a novel statistical technique for the analysis of two-mode and co-occurrence data, which has applications in information retrieval and filtering, natural language processing, machine learning from…

Machine Learning · Computer Science 2013-01-30 Thomas Hofmann

We study a parametric family of latent variable models, namely topic models, equipped with a hierarchical structure among the topic variables. Such models may be viewed as a finite mixture of the latent Dirichlet allocation (LDA) induced…

Statistics Theory · Mathematics 2024-08-27 Sunrit Chakraborty , Rayleigh Lei , XuanLong Nguyen

In this paper we investigate the use of staged tree models for discrete longitudinal data. Staged trees are a type of probabilistic graphical model for finite sample space processes. They are a natural fit for longitudinal data because a…

Methodology · Statistics 2024-01-10 Jack Storror Carter , Manuele Leonelli , Eva Riccomagno , Alessandro Ugolini

As the key to sentiment analysis, sentiment composition considers the classification of a constituent via classifications of its contained sub-constituents and rules operated on them. Such compositionality has been widely studied previously…

Computation and Language · Computer Science 2023-09-01 Zhongtao Jiang , Yuanzhe Zhang , Cao Liu , Jiansong Chen , Jun Zhao , Kang Liu

Based on decision trees, many fields have arguably made tremendous progress in recent years. In simple words, decision trees use the strategy of "divide-and-conquer" to divide the complex problem on the dependency between input features and…

Machine Learning · Computer Science 2021-01-22 Jinxiong Zhang

Connectivity estimation is challenging in the context of high-dimensional data. A useful preprocessing step is to group variables into clusters, however, it is not always clear how to do so from the perspective of connectivity estimation.…

Machine Learning · Statistics 2018-05-25 Ricardo Pio Monti , Aapo Hyvärinen

Latent Gaussian models and boosting are widely used techniques in statistics and machine learning. Tree-boosting shows excellent prediction accuracy on many data sets, but potential drawbacks are that it assumes conditional independence of…

Machine Learning · Computer Science 2022-08-24 Fabio Sigrist

Although regression trees were originally designed for large datasets, they can profitably be used on small datasets as well, including those from replicated or unreplicated complete factorial experiments. We show that in the latter…

Statistics Theory · Mathematics 2007-06-13 Wei-Yin Loh

Scalable probabilistic modeling and prediction in high dimensional multivariate time-series is a challenging problem, particularly for systems with hidden sources of dependence and/or homogeneity. Examples of such problems include dynamic…

Social and Information Networks · Computer Science 2016-06-07 Forough Arabshahi , Furong Huang , Animashree Anandkumar , Carter T. Butts , Sean M. Fitshugh

Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a…

Machine Learning · Computer Science 2026-02-04 Vlad Niculae , Caio F. Corro , Nikita Nangia , Tsvetomila Mihaylova , André F. T. Martins

Model-based clustering methods for continuous data are well established and commonly used in a wide range of applications. However, model-based clustering methods for categorical data are less standard. Latent class analysis is a commonly…

Methodology · Statistics 2013-02-20 Isabella Gollini , Thomas Brendan Murphy

Dynamic regression trees are an attractive option for automatic regression and classification with complicated response surfaces in on-line application settings. We create a sequential tree model whose state changes in time with the…

Methodology · Statistics 2010-11-23 Matthew A. Taddy , Robert B. Gramacy , Nicholas G. Polson

A new family of tree models is proposed, which we call "differential trees." A differential tree model is constructed from multiple data sets and aims to detect distributional differences between them. The new methodology differs from the…

Methodology · Statistics 2012-10-01 Yong Wang , Ilze Ziedins , Mark Holmes , Neil Challands

This paper proposes FREEtree, a tree-based method for high dimensional longitudinal data with correlated features. Popular machine learning approaches, like Random Forests, commonly used for variable selection do not perform well when there…

Machine Learning · Statistics 2020-06-18 Yuancheng Xu , Athanasse Zafirov , R. Michael Alvarez , Dan Kojis , Min Tan , Christina M. Ramirez
‹ Prev 1 2 3 10 Next ›