Related papers: Machine Learning on sWeighted Data

Machine Learning on data with sPlot background subtraction

Data analysis in high energy physics often deals with data samples consisting of a mixture of signal and background events. The sPlot technique is a common method to subtract the contribution of the background by assigning weights to…

Machine Learning · Computer Science 2019-12-03 Maxim Borisyak , Nikita Kazeev

Converting sWeights to Probabilities with Density Ratios

The use of machine learning approaches continues to have many benefits in experimental nuclear and particle physics. One common issue is generating training data which is sufficiently realistic to give reliable results. Here we advocate…

Data Analysis, Statistics and Probability · Physics 2025-08-11 D. I. Glazier , R. Tyson

What is the Machine Learning?

Applications of machine learning tools to problems of physical interest are often criticized for producing sensitivity at the expense of transparency. To address this concern, we explore a data planing procedure for identifying combinations…

High Energy Physics - Phenomenology · Physics 2018-03-29 Spencer Chang , Timothy Cohen , Bryan Ostdiek

Multi-Class Unlearning for Image Classification via Weight Filtering

Machine Unlearning is an emerging paradigm for selectively removing the impact of training datapoints from a network. Unlike existing methods that target a limited subset or a single class, our framework unlearns all classes in a single…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Samuele Poppi , Sara Sarto , Marcella Cornia , Lorenzo Baraldi , Rita Cucchiara

Simple and efficient algorithms for training machine learning potentials to force data

Abstract Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be…

Computational Physics · Physics 2020-06-11 Justin S. Smith , Nicholas Lubbers , Aidan P. Thompson , Kipton Barros

Learning to Re-weight Examples with Optimal Transport for Imbalanced Classification

Imbalanced data pose challenges for deep learning based classification models. One of the most widely-used approaches for tackling imbalanced data is re-weighting, where training samples are associated with different weights in the loss…

Machine Learning · Computer Science 2022-08-08 Dandan Guo , Zhuo Li , Meixi Zheng , He Zhao , Mingyuan Zhou , Hongyuan Zha

Data Selection for Short Term load forecasting

Power load forecast with Machine Learning is a fairly mature application of artificial intelligence and it is indispensable in operation, control and planning. Data selection techniqies have been hardly used in this application. However,…

Machine Learning · Statistics 2019-10-17 Nestor Pereira , Miguel Angel Hombrados Herrera , Vanesssa Gómez-Verdejo , Andrea A. Mammoli , Manel Martínez-Ramón

Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning

Existing semi-supervised learning (SSL) algorithms use a single weight to balance the loss of labeled and unlabeled examples, i.e., all unlabeled examples are equally weighted. But not all unlabeled data are equal. In this paper we study…

Machine Learning · Computer Science 2020-10-30 Zhongzheng Ren , Raymond A. Yeh , Alexander G. Schwing

sPlot: a statistical tool to unfold data distributions

The paper advocates the use of a statistical tool dedicated to the exploration of data samples populated by several sources of events. This new technique, called sPlot, is able to unfold the contributions of the different sources to the…

Data Analysis, Statistics and Probability · Physics 2008-11-26 Muriel Pivk , Francois R. Le Diberder

Metric-Optimized Example Weights

Real-world machine learning applications often have complex test metrics, and may have training and test data that are not identically distributed. Motivated by known connections between complex test metrics and cost-weighted learning, we…

Machine Learning · Statistics 2019-06-18 Sen Zhao , Mahdi Milani Fard , Harikrishna Narasimhan , Maya Gupta

Optimizing importance weighting in the presence of sub-population shifts

A distribution shift between the training and test data can severely harm performance of machine learning models. Importance weighting addresses this issue by assigning different weights to data points during training. We argue that…

Machine Learning · Statistics 2025-11-17 Floris Holstege , Bram Wouters , Noud van Giersbergen , Cees Diks

Adaptive Loss Weighting for Machine Learning Interatomic Potentials

Training machine learning interatomic potentials often requires optimizing a loss function composed of three variables: potential energies, forces, and stress. The contribution of each variable to the total loss is typically weighted using…

Computational Physics · Physics 2024-03-29 Daniel Ocampo , Daniela Posso , Reza Namakian , Wei Gao

Understanding new tasks through the lens of training data via exponential tilting

Deploying machine learning models to new tasks is a major challenge despite the large size of the modern training datasets. However, it is conceivable that the training data can be reweighted to be more representative of the new (target)…

Machine Learning · Computer Science 2023-02-22 Subha Maity , Mikhail Yurochkin , Moulinath Banerjee , Yuekai Sun

Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting

Current deep neural networks (DNNs) can easily overfit to biased training data with corrupted labels or class imbalance. Sample re-weighting strategy is commonly used to alleviate this issue by designing a weighting function mapping from…

Machine Learning · Computer Science 2019-09-30 Jun Shu , Qi Xie , Lixuan Yi , Qian Zhao , Sanping Zhou , Zongben Xu , Deyu Meng

Reweighting with Boosted Decision Trees

Machine learning tools are commonly used in modern high energy physics (HEP) experiments. Different models, such as boosted decision trees (BDT) and artificial neural networks (ANN), are widely used in analyses and even in the software…

Data Analysis, Statistics and Probability · Physics 2016-12-21 A. Rogozhnikov

Machine Unlearning Method Based On Projection Residual

Machine learning models (mainly neural networks) are used more and more in real life. Users feed their data to the model for training. But these processes are often one-way. Once trained, the model remembers the data. Even when data is…

Machine Learning · Computer Science 2022-10-03 Zihao Cao , Jianzong Wang , Shijing Si , Zhangcheng Huang , Jing Xiao

Compensation Learning

Weighting strategy prevails in machine learning. For example, a common approach in robust machine learning is to exert lower weights on samples which are likely to be noisy or quite hard. This study reveals another undiscovered strategy,…

Machine Learning · Computer Science 2022-01-05 Rujing Yao , Ou Wu

Deep Reinforcement Learning with Weighted Q-Learning

Reinforcement learning algorithms based on Q-learning are driving Deep Reinforcement Learning (DRL) research towards solving complex problems and achieving super-human performance on many of them. Nevertheless, Q-Learning is known to be…

Machine Learning · Computer Science 2022-06-14 Andrea Cini , Carlo D'Eramo , Jan Peters , Cesare Alippi

Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning

In the context of supervised learning of a function by a neural network, we claim and empirically verify that the neural network yields better results when the distribution of the data set focuses on regions where the function to learn is…

Machine Learning · Statistics 2022-09-28 Paul Novello , Gaël Poëtte , David Lugato , Pietro Congedo

Learning where to learn: Training data distribution optimization for scientific machine learning

In scientific machine learning, models are routinely deployed with parameter values or boundary conditions far from those used in training. This paper studies the learning-where-to-learn problem of designing a training data distribution…

Machine Learning · Computer Science 2025-12-09 Nicolas Guerra , Nicholas H. Nelsen , Yunan Yang