Related papers: Learning Polytrees

Optimal estimation of Gaussian (poly)trees

We develop optimal algorithms for learning undirected Gaussian trees and directed Gaussian polytrees from data. We consider both problems of distribution learning (i.e. in KL distance) and structure learning (i.e. exact recovery). The first…

Machine Learning · Computer Science 2024-02-12 Yuhao Wang , Ming Gao , Wai Ming Tai , Bryon Aragam , Arnab Bhattacharyya

Maximum Likelihood Bounded Tree-Width Markov Networks

Chow and Liu (1968) studied the problem of learning a maximumlikelihood Markov tree. We generalize their work to more complexMarkov networks by considering the problem of learning a maximumlikelihood Markov network of bounded complexity. We…

Machine Learning · Computer Science 2013-01-14 Nathan Srebro

Exact and Approximate Algorithms for Polytree Learning

Polytrees are a subclass of Bayesian networks that seek to capture the conditional dependencies between a set of $n$ variables as a directed forest and are motivated by their more efficient inference and improved interpretability. Since the…

Data Structures and Algorithms · Computer Science 2026-05-06 Juha Harviainen , Frank Sommer , Manuel Sorge

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

We consider the problem of learning a tree-structured Ising model from data, such that subsequent predictions computed using the model are accurate. Concretely, we aim to learn a model such that posteriors $P(X_i|X_S)$ for small sets of…

Machine Learning · Computer Science 2021-11-25 Enric Boix-Adsera , Guy Bresler , Frederic Koehler

Latent Tree Approximation in Linear Model

We consider the problem of learning underlying tree structure from noisy, mixed data obtained from a linear model. To achieve this, we use the expectation maximization algorithm combined with Chow-Liu minimum spanning tree algorithm. This…

Information Theory · Computer Science 2017-10-06 Navid Tafaghodi Khajavi

Learning High-Dimensional Markov Forest Distributions: Analysis of Error Rates

The problem of learning forest-structured discrete graphical models from i.i.d. samples is considered. An algorithm based on pruning of the Chow-Liu tree through adaptive thresholding is proposed. It is shown that this algorithm is both…

Information Theory · Computer Science 2011-02-15 Vincent Y. F. Tan , Animashree Anandkumar , Alan S. Willsky

Sample-Optimal and Efficient Learning of Tree Ising models

We show that $n$-variable tree-structured Ising models can be learned computationally-efficiently to within total variation distance $\epsilon$ from an optimal $O(n \ln n/\epsilon^2)$ samples, where $O(\cdot)$ hides an absolute constant…

Machine Learning · Computer Science 2020-12-01 Constantinos Daskalakis , Qinxuan Pan

Finding a Path is Harder than Finding a Tree

I consider the problem of learning an optimal path graphical model from data and show the problem to be NP-hard for the maximum likelihood and minimum description length approaches and a Bayesian approach. This hardness result holds despite…

Artificial Intelligence · Computer Science 2011-06-10 C. Meek

Optimal Rates for Learning Hidden Tree Structures

We provide high probability finite sample complexity guarantees for hidden non-parametric structure learning of tree-shaped graphical models, whose hidden and observable nodes are discrete random variables with either finite or countable…

Machine Learning · Statistics 2021-04-01 Konstantinos E. Nikolakakis , Dionysios S. Kalogerias , Anand D. Sarwate

Efficient Learning of Optimal Markov Network Topology with k-Tree Modeling

The seminal work of Chow and Liu (1968) shows that approximation of a finite probabilistic system by Markov trees can achieve the minimum information loss with the topology of a maximum spanning tree. Our current paper generalizes the…

Data Structures and Algorithms · Computer Science 2018-01-23 Liang Ding , Di Chang , Russell Malmberg , Aaron Martinez , David Robinson , Matthew Wicker , Hongfei Yan , Liming Cai

Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

We provide finite sample guarantees for the classical Chow-Liu algorithm (IEEE Trans.~Inform.~Theory, 1968) to learn a tree-structured graphical model of a distribution. For a distribution $P$ on $\Sigma^n$ and a tree $T$ on $n$ nodes, we…

Data Structures and Algorithms · Computer Science 2021-07-23 Arnab Bhattacharyya , Sutanu Gayen , Eric Price , N. V. Vinodchandran

Properly Learning Decision Trees with Queries Is NP-Hard

We prove that it is NP-hard to properly PAC learn decision trees with queries, resolving a longstanding open problem in learning theory (Bshouty 1993; Guijarro-Lavin-Raghavan 1999; Mehta-Raghavan 2002; Feldman 2016). While there has been a…

Computational Complexity · Computer Science 2023-07-11 Caleb Koch , Carmen Strassle , Li-Yang Tan

Forest Learning from Data and its Universal Coding

This paper considers structure learning from data with $n$ samples of $p$ variables, assuming that the structure is a forest, using the Chow-Liu algorithm. Specifically, for incomplete data, we construct two model selection algorithms that…

Information Theory · Computer Science 2018-08-06 Joe Suzuki

On Finding Optimal Polytrees

Inferring probabilistic networks from data is a notoriously difficult task. Under various goodness-of-fit measures, finding an optimal network is NP-hard, even if restricted to polytrees of bounded in-degree. Polynomial-time algorithms are…

Data Structures and Algorithms · Computer Science 2012-08-16 Serge Gaspers , Mikko Koivisto , Mathieu Liedloff , Sebastian Ordyniak , Stefan Szeider

Learning to Branch

Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial and nonconvex problems. For example, they are the foremost method for solving (mixed) integer programs and constraint satisfaction…

Artificial Intelligence · Computer Science 2018-05-18 Maria-Florina Balcan , Travis Dick , Tuomas Sandholm , Ellen Vitercik

A Short Proof that Phylogenetic Tree Reconstruction by Maximum Likelihood is Hard

Maximum likelihood is one of the most widely used techniques to infer evolutionary histories. Although it is thought to be intractable, a proof of its hardness has been lacking. Here, we give a short proof that computing the maximum…

Probability · Mathematics 2011-09-30 S. Roch

On the Parameterized Complexity of Polytree Learning

A Bayesian network is a directed acyclic graph that represents statistical dependencies between variables of a joint probability distribution. A fundamental task in data science is to learn a Bayesian network from observed data.…

Data Structures and Algorithms · Computer Science 2021-05-21 Niels Grüttemeier , Christian Komusiewicz , Nils Morawietz

The tree reconstruction game: phylogenetic reconstruction using reinforcement learning

We propose a reinforcement-learning algorithm to tackle the challenge of reconstructing phylogenetic trees. The search for the tree that best describes the data is algorithmically challenging, thus all current algorithms for phylogeny…

Populations and Evolution · Quantitative Biology 2023-03-14 Dana Azouri , Oz Granit , Michael Alburquerque , Yishay Mansour , Tal Pupko , Itay Mayrose

Active Learning for Decision Trees with Provable Guarantees

This paper advances the theoretical understanding of active learning label complexity for decision trees as binary classifiers. We make two main contributions. First, we provide the first analysis of the disagreement coefficient for…

Machine Learning · Computer Science 2026-02-20 Arshia Soltani Moakhar , Tanapoom Laoaron , Faraz Ghahremani , Kiarash Banihashem , MohammadTaghi Hajiaghayi

Choosing among alternative histories of a tree

The structure of an evolving network contains information about its past. Extracting this information efficiently, however, is, in general, a difficult challenge. We formulate a fast and efficient method to estimate the most likely history…

Physics and Society · Physics 2020-09-16 Gábor Timár , Rui A. da Costa , Sergey N. Dorogovtsev , José F. F. Mendes