Related papers: Marginalizing Corrupted Features

Countering Overfitting with Counterfactual Examples

Overfitting is a well-known issue in machine learning that occurs when a model struggles to generalize its predictions to new, unseen data beyond the scope of its training set. Traditional techniques to mitigate overfitting include early…

Machine Learning · Computer Science 2025-12-09 Flavio Giorgi , Fabiano Veglianti , Fabrizio Silvestri , Gabriele Tolomei

Negotiated Representations to Prevent Overfitting in Machine Learning Applications

Overfitting is a phenomenon that occurs when a machine learning model is trained for too long and focused too much on the exact fitness of the training samples to the provided training labels and cannot keep track of the predictive rules…

Machine Learning · Computer Science 2025-09-22 Nuri Korhan , Samet Bayram

Quantifying Overfitting: Evaluating Neural Network Performance through Analysis of Null Space

Machine learning models that are overfitted/overtrained are more vulnerable to knowledge leakage, which poses a risk to privacy. Suppose we download or receive a model from a third-party collaborator without knowing its training accuracy.…

Machine Learning · Computer Science 2023-06-01 Hossein Rezaei , Mohammad Sabokrou

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

In our era of enormous neural networks, empirical progress has been driven by the philosophy that more is better. Recent deep learning practice has found repeatedly that larger model size, more data, and more computation (resulting in lower…

Machine Learning · Computer Science 2024-05-17 James B. Simon , Dhruva Karkada , Nikhil Ghosh , Mikhail Belkin

Certified Robust Neural Networks: Generalization and Corruption Resistance

Recent work have demonstrated that robustness (to "corruption") can be at odds with generalization. Adversarial training, for instance, aims to reduce the problematic susceptibility of modern neural networks to small data perturbations.…

Machine Learning · Statistics 2023-05-19 Amine Bennouna , Ryan Lucas , Bart Van Parys

Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting

In software engineering, deep learning models are increasingly deployed for critical tasks such as bug detection and code review. However, overfitting remains a challenge that affects the quality, reliability, and trustworthiness of…

Software Engineering · Computer Science 2024-05-21 Hao Li , Gopi Krishnan Rajbahadur , Dayi Lin , Cor-Paul Bezemer , Zhen Ming , Jiang

Manipulating hidden-Markov-model inferences by corrupting batch data

Time-series models typically assume untainted and legitimate streams of data. However, a self-interested adversary may have incentive to corrupt this data, thereby altering a decision maker's inference. Within the broader field of…

Cryptography and Security · Computer Science 2024-02-22 William N. Caballero , Jose Manuel Camacho , Tahir Ekin , Roi Naveiro

Frequency-Based Vulnerability Analysis of Deep Learning Models against Image Corruptions

Deep learning models often face challenges when handling real-world image corruptions. In response, researchers have developed image corruption datasets to evaluate the performance of deep neural networks in handling such corruptions.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Harshitha Machiraju , Michael H. Herzog , Pascal Frossard

Towards Sample-efficient Overparameterized Meta-learning

An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of…

Machine Learning · Computer Science 2022-01-19 Yue Sun , Adhyyan Narang , Halil Ibrahim Gulluk , Samet Oymak , Maryam Fazel

Auto-weighted Robust Federated Learning with Corrupted Data Sources

Federated learning provides a communication-efficient and privacy-preserving training process by enabling learning statistical models with massive participants while keeping their data in local clients. However, standard federated learning…

Machine Learning · Computer Science 2022-07-15 Shenghui Li , Edith Ngai , Fanghua Ye , Thiemo Voigt

Robust Machine Learning by Transforming and Augmenting Imperfect Training Data

Machine Learning (ML) is an expressive framework for turning data into computer programs. Across many problem domains -- both in industry and policy settings -- the types of computer programs needed for accurate prediction or optimal…

Machine Learning · Computer Science 2023-12-21 Elliot Creager

Breaking Fair Binary Classification with Optimal Flipping Attacks

Minimizing risk with fairness constraints is one of the popular approaches to learning a fair classifier. Recent works showed that this approach yields an unfair classifier if the training set is corrupted. In this work, we study the…

Machine Learning · Computer Science 2022-05-10 Changhun Jo , Jy-yong Sohn , Kangwook Lee

Robust Training under Label Noise by Over-parameterization

Recently, over-parameterized deep networks, with increasingly more network parameters than training samples, have dominated the performances of modern machine learning. However, when the training data is corrupted, it has been well-known…

Machine Learning · Computer Science 2022-08-04 Sheng Liu , Zhihui Zhu , Qing Qu , Chong You

Learning Filter Functions in Regularisers by Minimising Quotients

Learning approaches have recently become very popular in the field of inverse problems. A large variety of methods has been established in recent years, ranging from bi-level learning to high-dimensional machine learning techniques. Most…

Optimization and Control · Mathematics 2017-04-05 Martin Benning , Guy Gilboa , Joana Sarah Grah , Carola-Bibiane Schönlieb

Performance Weighting for Robust Federated Learning Against Corrupted Sources

Federated Learning has emerged as a dominant computational paradigm for distributed machine learning. Its unique data privacy properties allow us to collaboratively train models while offering participating clients certain…

Machine Learning · Computer Science 2022-05-04 Dimitris Stripelis , Marcin Abram , Jose Luis Ambite

Overfitting Mechanism and Avoidance in Deep Neural Networks

Assisted by the availability of data and high performance computing, deep learning techniques have achieved breakthroughs and surpassed human performance empirically in difficult tasks, including object recognition, speech recognition, and…

Machine Learning · Computer Science 2019-01-23 Shaeke Salman , Xiuwen Liu

Mitigating Group Bias in Federated Learning: Beyond Local Fairness

The issue of group fairness in machine learning models, where certain sub-populations or groups are favored over others, has been recognized for some time. While many mitigation strategies have been proposed in centralized learning, many of…

Machine Learning · Computer Science 2023-05-18 Ganghua Wang , Ali Payani , Myungjin Lee , Ramana Kompella

Density Fixing: Simple yet Effective Regularization Method based on the Class Prior

Machine learning models suffer from overfitting, which is caused by a lack of labeled data. To tackle this problem, we proposed a framework of regularization methods, called density-fixing, that can be used commonly for supervised and…

Machine Learning · Computer Science 2020-09-08 Masanari Kimura , Ryohei Izawa

Training Uncertainty-Aware Classifiers with Conformalized Deep Learning

Deep neural networks are powerful tools to detect hidden patterns in data and leverage them to make predictions, but they are not designed to understand uncertainty and estimate reliable probabilities. In particular, they tend to be…

Machine Learning · Statistics 2022-11-10 Bat-Sheva Einbinder , Yaniv Romano , Matteo Sesia , Yanfei Zhou

Learning Deep Neural Networks under Agnostic Corrupted Supervision

Training deep neural models in the presence of corrupted supervision is challenging as the corrupted data points may significantly impact the generalization performance. To alleviate this problem, we present an efficient robust algorithm…

Machine Learning · Computer Science 2021-02-16 Boyang Liu , Mengying Sun , Ding Wang , Pang-Ning Tan , Jiayu Zhou