Related papers: Random Subset Averaging

Overcoming bias in representational similarity analysis

Representational similarity analysis (RSA) is a multivariate technique to investigate cortical representations of objects or constructs. While avoiding ill-posed matrix inversions that plague multivariate approaches in the presence of many…

Methodology · Statistics 2021-12-03 Roberto Viviani

Complete Subset Averaging for Quantile Regressions

We propose a novel conditional quantile prediction method based on complete subset averaging (CSA) for quantile regressions. All models under consideration are potentially misspecified and the dimension of regressors goes to infinity as the…

Econometrics · Economics 2022-08-11 Ji Hyung Lee , Youngki Shin

Random Machines Regression Approach: an ensemble support vector regression model with free kernel choice

Machine learning techniques always aim to reduce the generalized prediction error. In order to reduce it, ensemble methods present a good approach combining several models that results in a greater forecasting capacity. The Random Machines…

Machine Learning · Statistics 2020-03-31 Anderson Ara , Mateus Maia , Samuel Macêdo , Francisco Louzada

Is Representational Similarity Analysis Reliable? A Comparison with Regression

Representational Similarity Analysis (RSA) is a popular method for analyzing neuroimaging and behavioral data. Here we evaluate the accuracy and reliability of RSA in the context of model selection, and compare it to that of regression.…

Methodology · Statistics 2025-11-18 Chuanji Gao , Gang Chen , Svetlana V. Shinkareva , Rutvik H. Desai

Robust and Efficient Estimation of Potential Outcome Means under Random Assignment

We study efficiency improvements in randomized experiments for estimating a vector of potential outcome means using regression adjustment (RA) when there are more than two treatment levels. We show that linear RA which estimates separate…

Econometrics · Economics 2025-01-13 Akanksha Negi , Jeffrey M. Wooldridge

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension reduction under the ultra-high dimensional setting. Most existing screening methods are designed to rank the predictors according to their individual contributions to…

Methodology · Statistics 2022-02-08 Ye Tian , Yang Feng

Optimal Sparse Sliced Inverse Regression via Random Projection

We propose a novel sparse sliced inverse regression method based on random projections in a large $p$ small $n$ setting. Embedded in a generalized eigenvalue framework, the proposed approach finally reduces to parallel execution of…

Methodology · Statistics 2023-08-04 Jia Zhang , Runxiong Wu , Xin Chen

Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation

Meta-learning involves training models on a variety of training tasks in a way that enables them to generalize well on new, unseen test tasks. In this work, we consider meta-learning within the framework of high-dimensional multivariate…

Statistics Theory · Mathematics 2024-04-01 Yanhao Jin , Krishnakumar Balasubramanian , Debashis Paul

Adaptive Random SubSpace Learning (RSSL) Algorithm for Prediction

We present a novel adaptive random subspace learning algorithm (RSSL) for prediction purpose. This new framework is flexible where it can be adapted with any learning technique. In this paper, we tested the algorithm for regression and…

Machine Learning · Computer Science 2015-02-10 Mohamed Elshrif , Ernest Fokoue

Sample Weight Averaging for Stable Prediction

The challenge of Out-of-Distribution (OOD) generalization poses a foundational concern for the application of machine learning algorithms to risk-sensitive areas. Inspired by traditional importance weighting and propensity weighting…

Machine Learning · Computer Science 2025-02-12 Han Yu , Yue He , Renzhe Xu , Dongbai Li , Jiayin Zhang , Wenchao Zou , Peng Cui

Optimal Weighted Random Forests

The random forest (RF) algorithm has become a very popular prediction method for its great flexibility and promising accuracy. In RF, it is conventional to put equal weights on all the base learners (trees) to aggregate their predictions.…

Machine Learning · Statistics 2023-05-18 Xinyu Chen , Dalei Yu , Xinyu Zhang

Randomized maximum-contrast selection: subagging for large-scale regression

We introduce a very general method for sparse and large-scale variable selection. The large-scale regression settings is such that both the number of parameters and the number of samples are extremely large. The proposed method is based on…

Statistics Theory · Mathematics 2019-07-31 Jelena Bradic

Simultaneous Best Subset Selection and Dimension Reduction via Primal-Dual Iterations

Sparse reduced rank regression is an essential statistical learning method. In the contemporary literature, estimation is typically formulated as a nonconvex optimization that often yields to a local optimum in numerical computation. Yet,…

Methodology · Statistics 2022-12-06 Canhong Wen , Ruipeng Dong , Xueqin Wang , Weiyu Li , Heping Zhang

REDS: Random Ensemble Deep Spatial prediction

There has been a great deal of recent interest in the development of spatial prediction algorithms for very large datasets and/or prediction domains. These methods have primarily been developed in the spatial statistics community, but there…

Computation · Statistics 2022-11-10 Ranadeep Daw , Christopher K. Wikle

Systematic Ensemble Learning for Regression

The motivation of this work is to improve the performance of standard stacking approaches or ensembles, which are composed of simple, heterogeneous base models, through the integration of the generation and selection stages for regression…

Machine Learning · Statistics 2014-03-31 Roberto Aldave , Jean-Pierre Dussault

RaSE: Random Subspace Ensemble Classification

We propose a flexible ensemble classification framework, Random Subspace Ensemble (RaSE), for sparse classification. In the RaSE algorithm, we aggregate many weak learners, where each weak learner is a base classifier trained in a subspace…

Machine Learning · Statistics 2021-06-01 Ye Tian , Yang Feng

Soft Random Sampling: A Theoretical and Empirical Analysis

Soft random sampling (SRS) is a simple yet effective approach for efficient training of large-scale deep neural networks when dealing with massive data. SRS selects a subset uniformly at random with replacement from the full data set in…

Machine Learning · Computer Science 2023-11-27 Xiaodong Cui , Ashish Mittal , Songtao Lu , Wei Zhang , George Saon , Brian Kingsbury

Sample-aware RandAugment: Search-free Automatic Data Augmentation for Effective Image Recognition

Automatic data augmentation (AutoDA) plays an important role in enhancing the generalization of neural networks. However, mainstream AutoDA methods often encounter two challenges: either the search process is excessively time-consuming,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Anqi Xiao , Weichen Yu , Hongyuan Yu

Rejection Sampling with Autodifferentiation - Case study: Fitting a Hadronization Model

We present an autodifferentiable rejection sampling algorithm termed Rejection Sampling with Autodifferentiation (RSA). In conjunction with reweighting, we show that RSA can be used for efficient parameter estimation and model exploration.…

High Energy Physics - Phenomenology · Physics 2024-12-09 Nick Heller , Phil Ilten , Tony Menzo , Stephen Mrenna , Benjamin Nachman , Andrzej Siodmok , Manuel Szewc , Ahmed Youssef

PopulAtion Parameter Averaging (PAPA)

Ensemble methods combine the predictions of multiple models to improve performance, but they require significantly higher computation costs at inference time. To avoid these costs, multiple neural networks can be combined into one by…

Machine Learning · Computer Science 2024-05-07 Alexia Jolicoeur-Martineau , Emy Gervais , Kilian Fatras , Yan Zhang , Simon Lacoste-Julien