Related papers: Revisiting Score Function Estimators for $k$-Subse…

SIMPLE: A Gradient Estimator for $k$-Subset Sampling

$k$-subset sampling is ubiquitous in machine learning, enabling regularization and interpretability through sparsity. The challenge lies in rendering $k$-subset sampling amenable to end-to-end learning. This has typically involved relaxing…

Machine Learning · Computer Science 2024-06-10 Kareem Ahmed , Zhe Zeng , Mathias Niepert , Guy Van den Broeck

A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

Model performance evaluation is a critical and expensive task in machine learning and computer vision. Without clear guidelines, practitioners often estimate model accuracy using a one-time completely random selection of the data. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Riccardo Fogliato , Pratik Patil , Mathew Monfort , Pietro Perona

Classification via score-based generative modelling

In this work, we investigated the application of score-based gradient learning in discriminative and generative classification settings. Score function can be used to characterize data distribution as an alternative to density. It can be…

Machine Learning · Computer Science 2022-07-25 Yongchao Huang

Optimal subsampling for functional quantile regression

Subsampling is an efficient method to deal with massive data. In this paper, we investigate the optimal subsampling for linear quantile regression when the covariates are functions. The asymptotic distribution of the subsampling estimator…

Numerical Analysis · Mathematics 2022-05-06 Qian Yan , Hanyu Li , Chengmei Niu

Semi-parametric estimation of shifts

We observe a large number of functions differing from each other only by a translation parameter. While the main pattern is unknown, we propose to estimate the shift parameters using $M$-estimators. Fourier transform enables to transform…

Statistics Theory · Mathematics 2007-12-18 Fabrice Gamboa , Jean-Michel Loubes , Elie Maza

Optimal Sub-sampling with Influence Functions

Sub-sampling is a common and often effective method to deal with the computational challenges of large datasets. However, for most statistical models, there is no well-motivated approach for drawing a non-uniform subsample. We show that the…

Machine Learning · Statistics 2017-09-07 Daniel Ting , Eric Brochu

Variance Reduction for Score Functions Using Optimal Baselines

Many problems involve the use of models which learn probability distributions or incorporate randomness in some way. In such problems, because computing the true expected gradient may be intractable, a gradient estimator is used to update…

Machine Learning · Computer Science 2022-12-29 Ronan Keane , H. Oliver Gao

Gradient Estimation with Simultaneous Perturbation and Compressive Sensing

This paper aims at achieving a "good" estimator for the gradient of a function on a high-dimensional space. Often such functions are not sensitive in all coordinates and the gradient of the function is almost sparse. We propose a method for…

Machine Learning · Statistics 2016-07-27 Vivek S. Borkar , Vikranth R. Dwaracherla , Neeraja Sahasrabudhe

Learning Set Functions that are Sparse in Non-Orthogonal Fourier Bases

Many applications of machine learning on discrete domains, such as learning preference functions in recommender systems or auctions, can be reduced to estimating a set function that is sparse in the Fourier domain. In this work, we present…

Machine Learning · Computer Science 2021-05-11 Chris Wendler , Andisheh Amrollahi , Bastian Seifert , Andreas Krause , Markus Püschel

Adaptive Functional Thresholding for Sparse Covariance Function Estimation in High Dimensions

Covariance function estimation is a fundamental task in multivariate functional data analysis and arises in many applications. In this paper, we consider estimating sparse covariance functions for high-dimensional functional data, where the…

Statistics Theory · Mathematics 2022-07-15 Qin Fang , Shaojun Guo , Xinghao Qiao

Concentration study of M-estimators using the influence function

We present a new finite-sample analysis of M-estimators of locations in $\mathbb{R}^d$ using the tool of the influence function. In particular, we show that the deviations of an M-estimator can be controlled thanks to its influence function…

Statistics Theory · Mathematics 2022-08-23 Timothée Mathieu

Score Function Gradient Estimation to Widen the Applicability of Decision-Focused Learning

Many real-world optimization problems contain parameters that are unknown before deployment time, either due to stochasticity or to lack of information (e.g., demand or travel times in delivery problems). A common strategy in such cases is…

Machine Learning · Computer Science 2024-06-18 Mattia Silvestri , Senne Berden , Jayanta Mandi , Ali İrfan Mahmutoğulları , Brandon Amos , Tias Guns , Michele Lombardi

Improving the local scoring algorithm using gradient sampling

We adapt the gradient sampling algorithm to the local scoring algorithm to solve complex estimation problems based on an optimization of an objective function. This overcomes non-differentiability and non-smoothness of the objective…

Methodology · Statistics 2017-05-30 Marc-Olivier Boldi , Valérie Chavez-Demoulin

Nearest Neighbour Score Estimators for Diffusion Generative Models

Score function estimation is the cornerstone of both training and sampling from diffusion generative models. Despite this fact, the most commonly used estimators are either biased neural network approximations or high variance Monte Carlo…

Machine Learning · Computer Science 2024-07-18 Matthew Niedoba , Dylan Green , Saeid Naderiparizi , Vasileios Lioutas , Jonathan Wilder Lavington , Xiaoxuan Liang , Yunpeng Liu , Ke Zhang , Setareh Dabiri , Adam Ścibior , Berend Zwartsenberg , Frank Wood

Nonparametric Score Estimators

Estimating the score, i.e., the gradient of log density function, from a set of samples generated by an unknown distribution is a fundamental task in inference and learning of probabilistic models that involve flexible yet intractable…

Machine Learning · Statistics 2020-07-01 Yuhao Zhou , Jiaxin Shi , Jun Zhu

A Weighted K-Center Algorithm for Data Subset Selection

The success of deep learning hinges on enormous data and large models, which require labor-intensive annotations and heavy computation costs. Subset selection is a fundamental problem that can play a key role in identifying smaller portions…

Machine Learning · Computer Science 2023-12-19 Srikumar Ramalingam , Pranjal Awasthi , Sanjiv Kumar

Score-based diffusion models for accelerated MRI

Score-based diffusion models provide a powerful way to model images using the gradient of the data distribution. Leveraging the learned score function as a prior, here we introduce a way to sample data from a conditional distribution given…

Image and Video Processing · Electrical Eng. & Systems 2022-07-19 Hyungjin Chung , Jong Chul Ye

Prediction Error Reduction Function as a Variable Importance Score

This paper introduces and develops a novel variable importance score function in the context of ensemble learning and demonstrates its appeal both theoretically and empirically. Our proposed score function is simple and more straightforward…

Machine Learning · Statistics 2015-01-27 Ernest Fokoué

Beyond Quantile Methods: Improved Top-K Threshold Estimation for Traditional and Learned Sparse Indexes

Top-k threshold estimation is the problem of estimating the score of the k-th highest ranking result of a search query. A good estimate can be used to speed up many common top-k query processing algorithms, and thus a number of researchers…

Information Retrieval · Computer Science 2024-12-17 Jinrui Gou , Yifan Liu , Minghao Shao , Torsten Suel

Model-specific Data Subsampling with Influence Functions

Model selection requires repeatedly evaluating models on a given dataset and measuring their relative performances. In modern applications of machine learning, the models being considered are increasingly more expensive to evaluate and the…

Machine Learning · Computer Science 2020-10-21 Anant Raj , Cameron Musco , Lester Mackey , Nicolo Fusi