Related papers: Consistent Estimation for Partition-wise Regressio…

Consistent Regression using Data-Dependent Coverings

In this paper, we introduce a novel method to generate interpretable regression function estimators. The idea is based on called data-dependent coverings. The aim is to extract from the data a covering of the feature space instead of a…

Statistics Theory · Mathematics 2021-01-27 Vincent Margot , Jean-Patrick Baudry , Frédéric Guilloux , Olivier Wintenberger

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion. Answering this problem accurately and efficiently is essential to many applications, such as density estimation, outlier detection,…

Databases · Computer Science 2021-05-28 Yaoshu Wang , Chuan Xiao , Jianbin Qin , Rui Mao , Onizuka Makoto , Wei Wang , Rui Zhang , Yoshiharu Ishikawa

Partition-wise Linear Models

Region-specific linear models are widely used in practical applications because of their non-linear but highly interpretable model representations. One of the key challenges in their use is non-convexity in simultaneous optimization of…

Machine Learning · Statistics 2014-11-03 Hidekazu Oiwa , Ryohei Fujimaki

Partition Function Estimation: A Quantitative Study

Probabilistic graphical models have emerged as a powerful modeling tool for several real-world scenarios where one needs to reason under uncertainty. A graphical model's partition function is a central quantity of interest, and its…

Artificial Intelligence · Computer Science 2021-05-25 Durgesh Agrawal , Yash Pote , Kuldeep S Meel

Large Sample Properties of Partitioning-Based Series Estimators

We present large sample results for partitioning-based least squares nonparametric regression, a popular method for approximating conditional expectation functions in statistics, econometrics, and machine learning. First, we obtain a…

Statistics Theory · Mathematics 2020-07-20 Matias D. Cattaneo , Max H. Farrell , Yingjie Feng

Random Partitioning and Distribution-based Thresholding for Iterative Variable Screening in High Dimensions

In big data analysis, a simple task such as linear regression can become very challenging as the variable dimension $p$ grows. As a result, variable screening is inevitable in many scientific studies. In recent years, randomized algorithms…

Methodology · Statistics 2019-02-13 Yu-Hsiang Cheng , Tzee-Ming Huang , Su-Yun Huang

Sparse Partitioning: Nonlinear regression with binary or tertiary predictors, with application to association studies

This paper presents Sparse Partitioning, a Bayesian method for identifying predictors that either individually or in combination with others affect a response variable. The method is designed for regression problems involving binary or…

Quantitative Methods · Quantitative Biology 2011-08-31 Doug Speed , Simon Tavaré

Differentiable Random Partition Models

Partitioning a set of elements into an unknown number of mutually exclusive subsets is essential in many machine learning problems. However, assigning elements, such as samples in a dataset or neurons in a network layer, to an unknown and…

Machine Learning · Computer Science 2023-11-10 Thomas M. Sutter , Alain Ryser , Joram Liebeskind , Julia E. Vogt

Predicting Nearly As Well As the Optimal Twice Differentiable Regressor

We study nonlinear regression of real valued data in an individual sequence manner, where we provide results that are guaranteed to hold without any statistical assumptions. We address the convergence and undertraining issues of…

Machine Learning · Computer Science 2014-10-08 N. Denizcan Vanli , Muhammed O. Sayin , Suleyman S. Kozat

Informed Random Partition Models with Temporal Dependence

Model-based clustering is a powerful tool that is often used to discover hidden structure in data by grouping observational units that exhibit similar response values. Recently, clustering methods have been developed that permit…

Methodology · Statistics 2025-06-24 Sally Paganin , Garritt L. Page , Fernando Andrés Quintana

Building Consistent Regression Trees From Complex Sample Data

In the past several years a wide range of methods for the construction of regression trees and other estimators based on the recursive partitioning of samples have appeared in the statistics literature. Many applications involve data…

Methodology · Statistics 2014-07-07 Daniell Toth , John Eltinge

Real-Time Regression with Dividing Local Gaussian Processes

The increased demand for online prediction and the growing availability of large data sets drives the need for computationally efficient models. While exact Gaussian process regression shows various favorable theoretical properties…

Machine Learning · Computer Science 2021-08-02 Armin Lederer , Alejandro Jose Ordonez Conejo , Korbinian Maier , Wenxin Xiao , Jonas Umlauft , Sandra Hirche

Adaptive Partitioning Design and Analysis for Emulation of a Complex Computer Code

Computer models are used as replacements for physical experiments in a large variety of applications. Nevertheless, direct use of the computer model for the ultimate scientific objective is often limited by the complexity and cost of the…

Methodology · Statistics 2019-07-03 Sonja Surjanovic , William J. Welch

Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Linear regression models depend directly on the design matrix and its properties. Techniques that efficiently estimate model coefficients by partitioning rows of the design matrix are increasingly popular for large-scale problems because…

Machine Learning · Statistics 2019-07-23 Michael J. Kane , Bryan Lewis , Sekhar Tatikonda , Simon Urbanek

Consistent distribution-free $K$-sample and independence tests for univariate random variables

A popular approach for testing if two univariate random variables are statistically independent consists of partitioning the sample space into bins, and evaluating a test statistic on the binned data. The partition size matters, and the…

Methodology · Statistics 2016-04-28 Ruth Heller , Yair Heller , Shachar Kaufman , Barak Brill , Malka Gorfine

A Latent-Variable Bayesian Nonparametric Regression Model

We introduce a random partition model for Bayesian nonparametric regression. The model is based on infinitely-many disjoint regions of the range of a latent covariate-dependent Gaussian process. Given a realization of the process, the…

Methodology · Statistics 2013-01-04 George Karabatsos , Stephen G. Walker

Finding the Best Partitioning Policy for Efficient Verification of Autonomous Systems at Runtime

The autonomous systems need to decide how to react to the changes at runtime efficiently. The ability to rigorously analyze the environment and the system together is theoretically possible by the model-driven approaches; however, the model…

Software Engineering · Computer Science 2021-10-28 Melika Dastranj , Mehran Alidoost Nia , Mehdi Kargahi

Fitting statistical models is computationally challenging when the sample size or the dimension of the dataset is huge. An attractive approach for down-scaling the problem size is to first partition the dataset into subsets and then fit…

Methodology · Statistics 2016-02-15 Xiangyu Wang , David Dunson , Chenlei Leng

Variable Partitioning for Distributed Optimization

This paper is about how to partition decision variables while decomposing a large-scale optimization problem for the best performance of distributed solution methods. Solving a large-scale optimization problem sequen- tially can be…

Optimization and Control · Mathematics 2017-10-26 Yuchen Zheng , Ilbin Lee , Nicoleta Serban

An Image Segmentation Model Based on a Variational Formulation

Starting from a variational formulation, we present a model for image segmentation that employs both region statistics and edge information. This combination allows for improved flexibility, making the proposed model suitable to process a…

Analysis of PDEs · Mathematics 2019-10-15 Carlos M. Paniagua Mejia