Related papers: Treelets--An adaptive multi-scale basis for sparse…

Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data

This is a discussion of paper "Treelets--An adaptive multi-scale basis for sparse unordered data" [arXiv:0707.0481] by Ann B. Lee, Boaz Nadler and Larry Wasserman. In this paper the authors defined a new type of dimension reduction…

Applications · Statistics 2008-07-28 Xing Qiu

Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data

We congratulate Lee, Nadler and Wasserman (henceforth LNW) on a very interesting paper on new methodology and supporting theory [arXiv:0707.0481]. Treelets seem to tackle two important problems of modern data analysis at once. For datasets…

Applications · Statistics 2008-07-28 Nicolai Meinshausen , Peter Bühlmann

Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data

We would like to congratulate Lee, Nadler and Wasserman on their contribution to clustering and data reduction methods for high $p$ and low $n$ situations. A composite of clustering and traditional principal components analysis, treelets is…

Applications · Statistics 2008-07-28 Catherine Tuglus , Mark J. van der Laan

Discussion of: Treelets--An adaptive multi-Scale basis for sparse unordered data

Discussion of "Treelets--An adaptive multi-Scale basis for sparse unordered data" [arXiv:0707.0481]

Applications · Statistics 2008-07-28 Fionn Murtagh

Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data

Discussion of "Treelets--An adaptive multi-scale basis for sparse unordered data" [arXiv:0707.0481]

Applications · Statistics 2008-07-28 Robert Tibshirani

Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data

Discussion of "Treelets--An adaptive multi-scale basis for sparse unordered data" [arXiv:0707.0481]

Applications · Statistics 2008-07-28 Peter J. Bickel , Ya'acov Ritov

Sparse Variable Selection on High Dimensional Heterogeneous Data with Tree Structured Responses

We consider the problem of sparse variable selection on high dimension heterogeneous data sets, which has been taking on renewed interest recently due to the growth of biological and medical data sets with complex, non-i.i.d. structures and…

Methodology · Statistics 2024-04-22 Hui Liu , Xiang Liu , Jing Diao , Wenting Ye , Xueling Liu , Dehui Wei

A Tree-based Dictionary Learning Framework

We propose a new outline for adaptive dictionary learning methods for sparse encoding based on a hierarchical clustering of the training data. Through recursive application of a clustering method, the data is organized into a binary…

Machine Learning · Computer Science 2020-06-11 Renato Budinich , Gerlind Plonka

Trainlets: Dictionary Learning in High Dimensions

Sparse representations has shown to be a very powerful model for real world signals, and has enabled the development of applications with notable performance. Combined with the ability to learn a dictionary from signal examples,…

Computer Vision and Pattern Recognition · Computer Science 2016-05-13 Jeremias Sulam , Boaz Ophir , Michael Zibulevsky , Michael Elad

Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity

Ensembles of decision trees are a useful tool for obtaining for obtaining flexible estimates of regression functions. Examples of these methods include gradient boosted decision trees, random forests, and Bayesian CART. Two potential…

Methodology · Statistics 2018-09-18 Antonio Ricardo Linero , Yun Yang

Exploring data subsets with vtree

Variable trees are a new method for the exploration of discrete multivariate data. They display nested subsets and corresponding frequencies and percentages. Manual calculation of these quantities can be laborious, especially when there are…

Computation · Statistics 2021-02-08 Nick Barrowman , Richard J. Webster

FREEtree: A Tree-based Approach for High Dimensional Longitudinal Data With Correlated Features

This paper proposes FREEtree, a tree-based method for high dimensional longitudinal data with correlated features. Popular machine learning approaches, like Random Forests, commonly used for variable selection do not perform well when there…

Machine Learning · Statistics 2020-06-18 Yuancheng Xu , Athanasse Zafirov , R. Michael Alvarez , Dan Kojis , Min Tan , Christina M. Ramirez

New Algorithms on Wavelet Trees and Applications to Information Retrieval

Wavelet trees are widely used in the representation of sequences, permutations, text collections, binary relations, discrete points, and other succinct data structures. We show, however, that this still falls short of exploiting all of the…

Data Structures and Algorithms · Computer Science 2010-11-23 Travis Gagie , Gonzalo Navarro , Simon J. Puglisi

A Survey on Latent Tree Models and Applications

In data analysis, latent variables play a central role because they help provide powerful insights into a wide variety of phenomena, ranging from biological to human sciences. The latent tree model, a particular type of probabilistic…

Machine Learning · Computer Science 2014-02-05 Raphaël Mourad , Christine Sinoquet , Nevin L. Zhang , Tengfei Liu , Philippe Leray

Dynamic Trees for Learning and Design

Dynamic regression trees are an attractive option for automatic regression and classification with complicated response surfaces in on-line application settings. We create a sequential tree model whose state changes in time with the…

Methodology · Statistics 2010-11-23 Matthew A. Taddy , Robert B. Gramacy , Nicholas G. Polson

Data-Driven Tree Transforms and Metrics

We consider the analysis of high dimensional data given in the form of a matrix with columns consisting of observations and rows consisting of features. Often the data is such that the observations do not reside on a regular grid, and the…

Machine Learning · Statistics 2017-08-22 Gal Mishne , Ronen Talmon , Israel Cohen , Ronald R. Coifman , Yuval Kluger

Sparse data to structured imageset transformation

Machine learning problems involving sparse datasets may benefit from the use of convolutional neural networks if the numbers of samples and features are very large. Such datasets are increasingly more frequently encountered in a variety of…

Image and Video Processing · Electrical Eng. & Systems 2020-05-21 Baris Kanber

Flexible tree-structured regression for clustered data with an application to quality of life in older adults

Tree-structured models are a powerful alternative to parametric regression models if non-linear effects and interactions are present in the data. Yet, classical tree-structured models might not be appropriate if data comes in clusters of…

Methodology · Statistics 2025-01-23 Nikolai Spuck , Matthias Schmid , Moritz Berger

Random Subspace with Trees for Feature Selection Under Memory Constraints

Dealing with datasets of very high dimension is a major challenge in machine learning. In this paper, we consider the problem of feature selection in applications where the memory is not large enough to contain all features. In this…

Machine Learning · Statistics 2017-09-07 Antonio Sutera , Célia Châtel , Gilles Louppe , Louis Wehenkel , Pierre Geurts

Highly Efficient Structural Learning of Sparse Staged Trees

Several structural learning algorithms for staged tree models, an asymmetric extension of Bayesian networks, have been defined. However, they do not scale efficiently as the number of variables considered increases. Here we introduce the…

Machine Learning · Statistics 2022-06-15 Manuele Leonelli , Gherardo Varando