Related papers: Automated Modal Parameter Estimation Using Correla…

Modal-set estimation with an application to clustering

We present a first procedure that can estimate -- with statistical consistency guarantees -- any local-maxima of a density, under benign distributional conditions. The procedure estimates all such local maxima, or $\textit{modal-sets}$, of…

Machine Learning · Statistics 2017-05-30 Heinrich Jiang , Samory Kpotufe

Bootstrapped Adaptive Threshold Selection for Statistical Model Selection and Estimation

A central goal of neuroscience is to understand how activity in the nervous system is related to features of the external world, or to features of the nervous system itself. A common approach is to model neural responses as a weighted…

Machine Learning · Statistics 2015-05-14 Kristofer E. Bouchard

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution

Many tasks in explainable machine learning, such as data valuation and feature attribution, perform expensive computation for each data point and are intractable for large datasets. These methods require efficient approximations, and…

Machine Learning · Computer Science 2024-10-31 Ian Covert , Chanwoo Kim , Su-In Lee , James Zou , Tatsunori Hashimoto

Bootstrap Model Aggregation for Distributed Statistical Learning

In distributed, or privacy-preserving learning, we are often given a set of probabilistic models estimated from different local repositories, and asked to combine them into a single model that gives efficient statistical estimation. A…

Machine Learning · Statistics 2017-03-01 Jun Han , Qiang Liu

Parameter estimation by implicit sampling

Implicit sampling is a weighted sampling method that is used in data assimilation, where one sequentially updates estimates of the state of a stochastic model based on a stream of noisy or incomplete data. Here we describe how to use…

Numerical Analysis · Mathematics 2016-01-20 Matthias Morzfeld , Xuemin Tu , Jon Wilkening , Alexandre J. Chorin

Optimal Subsampling Bootstrap for Massive Data

The bootstrap is a widely used procedure for statistical inference because of its simplicity and attractive statistical properties. However, the vanilla version of bootstrap is no longer feasible computationally for many modern massive…

Methodology · Statistics 2023-02-16 Yingying Ma , Chenlei Leng , Hansheng Wang

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

The label noise transition matrix, characterizing the probabilities of a training instance being wrongly annotated, is crucial to designing popular solutions to learning with noisy labels. Existing works heavily rely on finding "anchor…

Machine Learning · Computer Science 2021-07-15 Zhaowei Zhu , Yiwen Song , Yang Liu

Conjugate Mixture Models for Clustering Multimodal Data

The problem of multimodal clustering arises whenever the data are gathered with several physically different sensors. Observations from different modalities are not necessarily aligned in the sense there there is no obvious way to associate…

Machine Learning · Statistics 2020-12-10 Vasil Khalidov , Florence Forbes , Radu Horaud

Parametric Modal Regression with Error in Covariates

An inference procedure is proposed to provide consistent estimators of parameters in a modal regression model with a covariate prone to measurement error. A score-based diagnostic tool exploiting parametric bootstrap is developed to assess…

Methodology · Statistics 2024-07-02 Qingyang Liu , Xianzheng Huang

Modal Analysis Using Sparse and Co-prime Arrays

Let a measurement consist of a linear combination of damped complex exponential modes, plus noise. The problem is to estimate the parameters of these modes, as in line spectrum estimation, vibration analysis, speech processing, system…

Information Theory · Computer Science 2016-05-04 Pooria Pakrooh , Louis L. Scharf , Ali Pezeshki

A Simultaneous Sparse Approximation Method for Multidimensional Harmonic Retrieval

In this paper, a sparse-based method for the estimation of the parameters of multidimensional ($R$-D) modal (harmonic or damped) complex signals in noise is presented. The problem is formulated as $R$ simultaneous sparse approximations of…

Information Theory · Computer Science 2015-11-02 Souleymen Sahnoun , El-Hadi Djermoune , David Brie , Pierre Comon

Convergence of uncertainty estimates in Ensemble and Bayesian sparse model discovery

Sparse model identification enables nonlinear dynamical system discovery from data. However, the control of false discoveries for sparse model identification is challenging, especially in the low-data and high-noise limit. In this paper, we…

Machine Learning · Computer Science 2023-04-28 L. Mars Gao , Urban Fasel , Steven L. Brunton , J. Nathan Kutz

CAST: Corpus-Aware Self-similarity Enhanced Topic modelling

Topic modelling is a pivotal unsupervised machine learning technique for extracting valuable insights from large document collections. Existing neural topic modelling methods often encode contextual information of documents, while ignoring…

Computation and Language · Computer Science 2025-02-07 Yanan Ma , Chenghao Xiao , Chenhan Yuan , Sabine N van der Veer , Lamiece Hassan , Chenghua Lin , Goran Nenadic

Modified Multidimensional Scaling and High Dimensional Clustering

Multidimensional scaling is an important dimension reduction tool in statistics and machine learning. Yet few theoretical results characterizing its statistical performance exist, not to mention any in high dimensions. By considering a…

Methodology · Statistics 2022-03-30 Xiucai Ding , Qiang Sun

Bootstrapping multiple systems estimates to account for model selection

Multiple systems estimation using a Poisson loglinear model is a standard approach to quantifying hidden populations where data sources are based on lists of known cases. Information criteria are often used for selecting between the large…

Methodology · Statistics 2023-11-23 Bernard W. Silverman , Lax Chan , Kyle Vincent

Mastering Complex Modes: A New Method for Real-Time Modal Identification of Vibrating Systems

A novel algorithm for real-time modal identification in linear vibrating systems with complex modes is introduced, utilizing a combination of first order eigen-perturbation and second order separation techniques. In practical settings,…

Systems and Control · Electrical Eng. & Systems 2023-04-27 Satyam Panda , Sanghamitra Das , Basuraj Bhowmik , Budhaditya Hazra

Adaptive Resampling with Bootstrap for Noisy Multi-Objective Optimization Problems

The challenge of noisy multi-objective optimization lies in the constant trade-off between exploring new decision points and improving the precision of known points through resampling. This decision should take into account both the…

Machine Learning · Computer Science 2025-04-25 Timo Budszuhn , Mark Joachim Krallmann , Daniel Horn

Modeling with Categorical Features via Exact Fusion and Sparsity Regularisation

We study the high-dimensional linear regression problem with categorical predictors that have many levels. We propose a new estimation approach, which performs model compression via two mechanisms by simultaneously encouraging (a)…

Methodology · Statistics 2026-03-30 Kayhan Behdin , Riade Benbaki , Peter Radchenko , Rahul Mazumder

Subsampling to Enhance Efficiency in Input Uncertainty Quantification

In stochastic simulation, input uncertainty refers to the output variability arising from the statistical noise in specifying the input models. This uncertainty can be measured by a variance contribution in the output, which, in the…

Methodology · Statistics 2021-05-20 Henry Lam , Huajie Qian

Nonparametric Stochastic Subspaces via the Bootstrap for Characterizing Model Error

Reliable forward uncertainty quantification in engineering requires methods that account for aleatory and epistemic uncertainties. In many applications, epistemic effects arising from uncertain parameters and model form dominate prediction…

Computational Engineering, Finance, and Science · Computer Science 2025-12-18 Akash Yadav , Ruda Zhang