Related papers: Optimal kernel selection for density estimation

Bandwidth selection in kernel density estimation: Oracle inequalities and adaptive minimax optimality

We address the problem of density estimation with $\mathbb{L}_s$-loss by selection of kernel estimators. We develop a selection procedure and derive corresponding $\mathbb{L}_s$-risk oracle inequalities. It is shown that the proposed…

Statistics Theory · Mathematics 2012-11-26 Alexander Goldenshluger , Oleg Lepski

Optimal model selection in density estimation

We build penalized least-squares estimators using the slope heuristic and resampling penalties. We prove oracle inequalities for the selected estimator with leading constant asymptotically equal to 1. We compare the practical performances…

Statistics Theory · Mathematics 2015-03-13 Matthieu Lerasle

Sparsity in multiple kernel learning

The problem of multiple kernel learning based on penalized empirical risk minimization is discussed. The complexity penalty is determined jointly by the empirical $L_2$ norms and the reproducing kernel Hilbert space (RKHS) norms induced by…

Statistics Theory · Mathematics 2012-11-14 Vladimir Koltchinskii , Ming Yuan

Adaptive optimal kernel density estimation for directional data

We focus on the nonparametric density estimation problem with directional data. We propose a new rule for bandwidth selection for kernel density estimation. Our procedure is automatic, fully data-driven and adaptive to the smoothness degree…

Statistics Theory · Mathematics 2018-08-08 Thanh Mai Pham Ngoc

Averaging of density kernel estimators

Averaging provides an alternative to bandwidth selection for density kernel estimation. We propose a procedure to combine linearly several kernel estimators of a density obtained from different, possibly data-driven, bandwidths. The method…

Statistics Theory · Mathematics 2019-11-05 O. Chernova , F. Lavancier , P. Rochet

Selecting Biomarkers for building optimal treatment selection rules using Kernel Machines

Optimal biomarker combinations for treatment-selection can be derived by minimizing total burden to the population caused by the targeted disease and its treatment. However, when multiple biomarkers are present, including all in the model…

Applications · Statistics 2019-06-07 Sayan Dasgupta , Ying Huang

Estimator selection: a new method with applications to kernel density estimation

Estimator selection has become a crucial issue in non parametric estimation. Two widely used methods are penalized empirical risk minimization (such as penalized log-likelihood estimation) or pairwise comparison (such as Lepski's method).…

Statistics Theory · Mathematics 2017-10-19 Claire Lacour , Pascal Massart , Vincent Rivoirard

Data-driven calibration of linear estimators with minimal penalties

This paper tackles the problem of selecting among several linear estimators in non-parametric regression; this includes model selection for linear regression, the choice of a regularization parameter in kernel ridge regression, spline…

Statistics Theory · Mathematics 2011-09-15 Sylvain Arlot , Francis Bach

Sparse Feature Selection in Kernel Discriminant Analysis via Optimal Scoring

We consider the two-group classification problem and propose a kernel classifier based on the optimal scoring framework. Unlike previous approaches, we provide theoretical guarantees on the expected risk consistency of the method. We also…

Machine Learning · Statistics 2021-04-01 Alexander F. Lapanowski , Irina Gaynanova

Bayesian Model Selection for Change Point Detection and Clustering

We address the new problem of estimating a piece-wise constant signal with the purpose of detecting its change points and the levels of clusters. Our approach is to model it as a nonparametric penalized least square model selection on a…

Machine Learning · Statistics 2019-12-04 Othmane Mazhar , Cristian R. Rojas , Carlo Fischione , Mohammad R. Hesamzadeh

Optimizing Kernel Discrepancies via Subset Selection

Kernel discrepancies are a powerful tool for analyzing worst-case errors in quasi-Monte Carlo (QMC) methods. Building on recent advances in optimizing such discrepancy measures, we extend the subset selection problem to the setting of…

Machine Learning · Statistics 2025-11-05 Deyao Chen , François Clément , Carola Doerr , Nathan Kirk

Sparse Multiple Kernel Learning: Alternating Best Response and Semidefinite Relaxations

We study Sparse Multiple Kernel Learning (SMKL), which is the problem of selecting a sparse convex combination of prespecified kernels for support vector binary classification. Unlike prevailing l1 regularized approaches that approximate a…

Machine Learning · Statistics 2025-12-03 Dimitris Bertsimas , Caio de Prospero Iglesias , Nicholas A. G. Johnson

Node-screening tests for L0-penalized least-squares problem with supplementary material

We present a novel screening methodology to safely discard irrelevant nodes within a generic branch-and-bound (BnB) algorithm solving the l0-penalized least-squares problem. Our contribution is a set of two simple tests to detect sets of…

Signal Processing · Electrical Eng. & Systems 2022-02-04 Théo Guyard , Cédric Herzet , Clément Elvira

Voted Kernel Regularization

This paper presents an algorithm, Voted Kernel Regularization , that provides the flexibility of using potentially very complex kernel functions such as predictors based on much higher-degree polynomial kernels, while benefitting from…

Machine Learning · Computer Science 2015-09-16 Corinna Cortes , Prasoon Goyal , Vitaly Kuznetsov , Mehryar Mohri

Towards the study of least squares estimators with convex penalty

Penalized least squares estimation is a popular technique in high-dimensional statistics. It includes such methods as the LASSO, the group LASSO, and the nuclear norm penalized least squares. The existing theory of these methods is not…

Statistics Theory · Mathematics 2017-07-10 Pierre C. Bellec , Guillaume Lecué , Alexandre B. Tsybakov

Condition Number Analysis of Kernel-based Density Ratio Estimation

The ratio of two probability densities can be used for solving various machine learning tasks such as covariate shift adaptation (importance sampling), outlier detection (likelihood-ratio test), and feature selection (mutual information).…

Machine Learning · Statistics 2009-12-16 Takafumi Kanamori , Taiji Suzuki , Masashi Sugiyama

Fast Exact Univariate Kernel Density Estimation

This paper presents new methodology for computationally efficient kernel density estimation. It is shown that a large class of kernels allows for exact evaluation of the density estimates using simple recursions. The same methodology can be…

Computation · Statistics 2019-11-12 David P. Hofmeyr

Oracle inequalities for computationally adaptive model selection

We analyze general model selection procedures using penalized empirical loss minimization under computational constraints. While classical model selection approaches do not consider computational aspects of performing model selection, we…

Machine Learning · Statistics 2012-08-02 Alekh Agarwal , Peter L. Bartlett , John C. Duchi

Randomized Kernel Methods for Least-Squares Support Vector Machines

The least-squares support vector machine is a frequently used kernel method for non-linear regression and classification tasks. Here we discuss several approximation algorithms for the least-squares support vector machine classifier. The…

Machine Learning · Computer Science 2017-03-24 M. Andrecut

The Optimality of Kernel Classifiers in Sobolev Space

Kernel methods are widely used in machine learning, especially for classification problems. However, the theoretical analysis of kernel classification is still limited. This paper investigates the statistical performances of kernel…

Statistics Theory · Mathematics 2024-02-05 Jianfa Lai , Zhifan Li , Dongming Huang , Qian Lin