Related papers: Optimal Sampling Density for Nonparametric Regress…

Optimal Centered Active Excitation in Linear System Identification

We propose an active learning algorithm for linear system identification with optimal centered noise excitation. Notably, our algorithm, based on ordinary least squares and semidefinite programming, attains the minimal sample complexity…

Optimization and Control · Mathematics 2026-04-08 Kaito Ito , Alexandre Proutiere

Active Learning Based Sampling for High-Dimensional Nonlinear Partial Differential Equations

The deep-learning-based least squares method has shown successful results in solving high-dimensional non-linear partial differential equations (PDEs). However, this method usually converges slowly. To speed up the convergence of this…

Numerical Analysis · Mathematics 2025-07-10 Wenhan Gao , Chunmei Wang

Adaptivity to Noise Parameters in Nonparametric Active Learning

This work addresses various open questions in the theory of active learning for nonparametric classification. Our contributions are both statistical and algorithmic: -We establish new minimax-rates for active learning under common…

Machine Learning · Statistics 2017-03-20 Andrea Locatelli , Alexandra Carpentier , Samory Kpotufe

Bayes-optimal Learning of Deep Random Networks of Extensive-width

We consider the problem of learning a target function corresponding to a deep, extensive-width, non-linear neural network with random Gaussian weights. We consider the asymptotic limit where the number of samples, the input dimension and…

Machine Learning · Statistics 2023-09-07 Hugo Cui , Florent Krzakala , Lenka Zdeborová

Bayes-optimal learning of an extensive-width neural network from quadratically many samples

We consider the problem of learning a target function corresponding to a single hidden layer neural network, with a quadratic activation function after the first layer, and random weights. We consider the asymptotic limit where the input…

Machine Learning · Statistics 2025-02-10 Antoine Maillard , Emanuele Troiani , Simon Martin , Florent Krzakala , Lenka Zdeborová

Robust Active Learning: Sample-Efficient Training of Robust Deep Learning Models

Active learning is an established technique to reduce the labeling cost to build high-quality machine learning models. A core component of active learning is the acquisition function that determines which data should be selected to…

Machine Learning · Computer Science 2021-12-07 Yuejun Guo , Qiang Hu , Maxime Cordy , Mike Papadakis , Yves Le Traon

Querying Easily Flip-flopped Samples for Deep Active Learning

Active learning is a machine learning paradigm that aims to improve the performance of a model by strategically selecting and querying unlabeled data. One effective selection strategy is to base it on the model's predictive uncertainty,…

Machine Learning · Computer Science 2024-05-17 Seong Jin Cho , Gwangsu Kim , Junghyun Lee , Jinwoo Shin , Chang D. Yoo

Improved Active Learning via Dependent Leverage Score Sampling

We show how to obtain improved active learning methods in the agnostic (adversarial noise) setting by combining marginal leverage score sampling with non-independent sampling strategies that promote spatial coverage. In particular, we…

Machine Learning · Computer Science 2024-05-07 Atsushi Shimizu , Xiaoou Cheng , Christopher Musco , Jonathan Weare

Concise Logarithmic Loss Function for Robust Training of Anomaly Detection Model

Recently, deep learning-based algorithms are widely adopted due to the advantage of being able to establish anomaly detection models without or with minimal domain knowledge of the task. Instead, to train the artificial neural network more…

Machine Learning · Computer Science 2023-04-17 YeongHyeon Park

Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation

Meta-learning involves training models on a variety of training tasks in a way that enables them to generalize well on new, unseen test tasks. In this work, we consider meta-learning within the framework of high-dimensional multivariate…

Statistics Theory · Mathematics 2024-04-01 Yanhao Jin , Krishnakumar Balasubramanian , Debashis Paul

Density-Aware Farthest Point Sampling

We focus on training machine learning regression models in scenarios where the availability of labeled training data is limited due to computational constraints or high labeling costs. Thus, selecting suitable training sets from unlabeled…

Machine Learning · Computer Science 2026-02-10 Paolo Climaco , Jochen Garcke

Worst-case Nonlinear Regression with Error Bounds

We propose an active-learning method for nonlinear minimax regression. Given a nonlinear function that can be arbitrarily evaluated over a compact set, we fit a surrogate model, such as a feedforward neural network, by minimizing the…

Systems and Control · Electrical Eng. & Systems 2026-04-24 Alberto Bemporad

Nonparametric adaptive active learning under local smoothness condition

Active learning is typically used to label data, when the labeling process is expensive. Several active learning algorithms have been theoretically proved to perform better than their passive counterpart. However, these algorithms rely on…

Machine Learning · Computer Science 2021-02-23 Boris Ndjia Njike , Xavier Siebert

Estimation of the density of regression errors

Estimation of the density of regression errors is a fundamental issue in regression analysis and it is typically explored via a parametric approach. This article uses a nonparametric approach with the mean integrated squared error (MISE)…

Statistics Theory · Mathematics 2007-06-13 Sam Efromovich

Chernoff Sampling for Active Testing and Extension to Active Regression

Active learning can reduce the number of samples needed to perform a hypothesis test and to estimate the parameters of a model. In this paper, we revisit the work of Chernoff that described an asymptotically optimal algorithm for performing…

Machine Learning · Statistics 2022-03-14 Subhojyoti Mukherjee , Ardhendu Tripathy , Robert Nowak

A new approach to locally adaptive polynomial regression

Adaptive bandwidth selection is a fundamental challenge in nonparametric regression. This paper introduces a new bandwidth selection procedure inspired by the optimality criteria for $\ell_0$-penalized regression. Although similar in spirit…

Machine Learning · Statistics 2025-05-21 Sabyasachi Chatterjee , Subhajit Goswami , Soumendu Sundar Mukherjee

K-NN active learning under local smoothness assumption

There is a large body of work on convergence rates either in passive or active learning. Here we first outline some of the main results that have been obtained, more specifically in a nonparametric setting under assumptions about the…

Machine Learning · Computer Science 2020-07-14 Boris Ndjia Njike , Xavier Siebert

Axiomatic Approach to Variable Kernel Density Estimation

Variable kernel density estimation allows the approximation of a probability density by the mean of differently stretched and rotated kernels centered at given sampling points $y_n\in\mathbb{R}^d,\ n=1,\dots,N$. Up to now, the choice of the…

Statistics Theory · Mathematics 2018-05-07 Ilja Klebanov

Deep regression learning with optimal loss function

In this paper, we develop a novel efficient and robust nonparametric regression estimator under a framework of feedforward neural network. There are several interesting characteristics for the proposed estimator. First, the loss function is…

Methodology · Statistics 2023-09-25 Xuancheng Wang , Ling Zhou , Huazhen Lin

Active Regression by Stratification

We propose a new active learning algorithm for parametric linear regression with random design. We provide finite sample convergence guarantees for general distributions in the misspecified model. This is the first active learner for this…

Machine Learning · Statistics 2018-11-21 Sivan Sabato , Remi Munos