Related papers: Valid Inference for Machine Learning Model Paramet…

Learning To Simulate

Simulation is a useful tool in situations where training data for machine learning models is costly to annotate or even hard to acquire. In this work, we propose a reinforcement learning-based method for automatically adjusting the…

Machine Learning · Computer Science 2019-05-15 Nataniel Ruiz , Samuel Schulter , Manmohan Chandraker

Inference-time Unlearning Using Conformal Prediction

Machine unlearning is the process of efficiently removing specific information from a trained machine learning model without retraining from scratch. Existing unlearning methods, which often provide provable guarantees, typically involve…

Machine Learning · Computer Science 2026-02-04 Somnath Basu Roy Chowdhury , Rahul Kidambi , Avinava Dubey , David Wang , Gokhan Mergen , Amr Ahmed , Aranyak Mehta

Certain and Approximately Certain Models for Statistical Learning

Real-world data is often incomplete and contains missing values. To train accurate models over real-world datasets, users need to spend a substantial amount of time and resources imputing and finding proper values for missing data items. In…

Machine Learning · Statistics 2024-03-05 Cheng Zhen , Nischal Aryal , Arash Termehchy , Alireza Aghasi , Amandeep Singh Chabada

Machine Learning and Knowledge: Why Robustness Matters

Trusting machine learning algorithms requires having confidence in their outputs. Confidence is typically interpreted in terms of model reliability, where a model is reliable if it produces a high proportion of correct outputs. However,…

Machine Learning · Computer Science 2023-11-01 Jonathan Vandenburgh

Transfer learning of regression models from a sequence of datasets by penalized estimation

Transfer learning refers to the promising idea of initializing model fits based on pre-training on other data. We particularly consider regression modeling settings where parameter estimates from previous data can be used as anchoring…

Methodology · Statistics 2020-07-07 Wessel N. van Wieringen , Harald Binder

Optimal Learning via Moderate Deviations Theory

This paper proposes a statistically optimal approach for learning a function value using a confidence interval in a wide range of models, including general non-parametric estimation of an expected loss described as a stochastic programming…

Machine Learning · Statistics 2025-08-07 Arnab Ganguly , Tobias Sutter

ModelPred: A Framework for Predicting Trained Model from Training Data

In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on a trained model. This is critical for building trust in various stages of a machine learning pipeline: from cleaning…

Machine Learning · Computer Science 2022-12-27 Yingyan Zeng , Jiachen T. Wang , Si Chen , Hoang Anh Just , Ran Jin , Ruoxi Jia

Learning from networked examples

Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may…

Artificial Intelligence · Computer Science 2017-06-06 Yuyi Wang , Jan Ramon , Zheng-Chu Guo

Towards Understanding the Data Dependency of Mixup-style Training

In the Mixup training paradigm, a model is trained using convex combinations of data points and their associated labels. Despite seeing very few true data points during training, models trained using Mixup seem to still minimize the…

Machine Learning · Computer Science 2022-02-22 Muthu Chidambaram , Xiang Wang , Yuzheng Hu , Chenwei Wu , Rong Ge

Towards Sample-efficient Overparameterized Meta-learning

An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of…

Machine Learning · Computer Science 2022-01-19 Yue Sun , Adhyyan Narang , Halil Ibrahim Gulluk , Samet Oymak , Maryam Fazel

Confidence Is All You Need for MI Attacks

In this evolving era of machine learning security, membership inference attacks have emerged as a potent threat to the confidentiality of sensitive data. In this attack, adversaries aim to determine whether a particular point was used…

Machine Learning · Computer Science 2024-06-21 Abhishek Sinha , Himanshi Tibrewal , Mansi Gupta , Nikhar Waghela , Shivank Garg

Unknown Examples & Machine Learning Model Generalization

Over the past decades, researchers and ML practitioners have come up with better and better ways to build, understand and improve the quality of ML models, but mostly under the key assumption that the training data is distributed…

Machine Learning · Computer Science 2019-10-14 Yeounoh Chung , Peter J. Haas , Eli Upfal , Tim Kraska

Credal Learning Theory

Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learned from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment,…

Machine Learning · Computer Science 2024-10-25 Michele Caprio , Maryam Sultana , Eleni Elia , Fabio Cuzzolin

Transfer Learning for Linear Regression: a Statistical Test of Gain

Transfer learning, also referred as knowledge transfer, aims at reusing knowledge from a source dataset to a similar target one. While many empirical studies illustrate the benefits of transfer learning, few theoretical results are…

Statistics Theory · Mathematics 2021-02-19 David Obst , Badih Ghattas , Jairo Cugliari , Georges Oppenheim , Sandra Claudel , Yannig Goude

Improving the Reliability for Confidence Estimation

Confidence estimation, a task that aims to evaluate the trustworthiness of the model's prediction output during deployment, has received lots of research attention recently, due to its importance for the safe deployment of deep models.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-14 Haoxuan Qu , Yanchao Li , Lin Geng Foo , Jason Kuen , Jiuxiang Gu , Jun Liu

Robust Meta Learning for Image based tasks

A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we learn an optimal model in training data, it could have better generalization performance in testing tasks. However, learning such a…

Computer Vision and Pattern Recognition · Computer Science 2023-02-22 Penghao Jiang , Xin Ke , ZiFeng Wang , Chunxi Li

Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models

We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters. Our goal is to maximize some metric, while simultaneously…

Machine Learning · Statistics 2016-11-23 Xinyu He , Warren B. Powell

Predicting Parameters in Deep Learning

We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that…

Machine Learning · Computer Science 2014-10-28 Misha Denil , Babak Shakibi , Laurent Dinh , Marc'Aurelio Ranzato , Nando de Freitas

Learning to Learn with Generative Models of Neural Network Checkpoints

We explore a data-driven approach for learning to optimize neural networks. We construct a dataset of neural network checkpoints and train a generative model on the parameters. In particular, our model is a conditional diffusion transformer…

Machine Learning · Computer Science 2022-09-27 William Peebles , Ilija Radosavovic , Tim Brooks , Alexei A. Efros , Jitendra Malik

How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made…

Machine Learning · Computer Science 2021-04-27 Maria-Florina Balcan , Dan DeBlasio , Travis Dick , Carl Kingsford , Tuomas Sandholm , Ellen Vitercik