Related papers: Adaptive Scaling

Be Wary of Your Time Series Preprocessing

Normalization and scaling are fundamental preprocessing steps in time series modeling, yet their role in Transformer-based models remains underexplored from a theoretical perspective. In this work, we present the first formal analysis of…

Machine Learning · Computer Science 2026-02-20 Sofiane Ennadir , Tianze Wang , Oleg Smirnov , Sahar Asadi , Lele Cao

Normalization: A Preprocessing Stage

As we know that the normalization is a pre-processing stage of any type problem statement. Especially normalization takes important role in the field of soft computing, cloud computing etc. for manipulation of data like scale down or scale…

Other Computer Science · Computer Science 2015-03-24 S. Gopal Krishna Patro , Kishore Kumar Sahu

The choice of scaling technique matters for classification performance

Dataset scaling, also known as normalization, is an essential preprocessing step in a machine learning pipeline. It is aimed at adjusting attributes scales in a way that they all vary within the same range. This transformation is known to…

Machine Learning · Computer Science 2022-12-26 Lucas B. V. de Amorim , George D. C. Cavalcanti , Rafael M. O. Cruz

Dynamic Feature Scaling for Online Learning of Binary Classifiers

Scaling feature values is an important step in numerous machine learning tasks. Different features can have different value ranges and some form of a feature scaling is often required in order to learn an accurate classifier. However,…

Machine Learning · Computer Science 2014-07-30 Danushka Bollegala

Generalized Linear Models via the Lasso: To Scale or Not to Scale?

The Lasso regression is a popular regularization method for feature selection in statistics. Prior to computing the Lasso estimator in both linear and generalized linear models, it is common to conduct a preliminary rescaling of the feature…

Methodology · Statistics 2023-11-21 Anant Mathur , Sarat Moka , Zdravko Botev

Big Data Regression Using Tree Based Segmentation

Scaling regression to large datasets is a common problem in many application areas. We propose a two step approach to scaling regression to large datasets. Using a regression tree (CART) to segment the large dataset constitutes the first…

Machine Learning · Statistics 2017-07-26 Rajiv Sambasivan , Sourish Das

Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training

We introduce a framework for optimizing domain-specific dataset construction in foundation model training. Specifically, we seek a cost-efficient way to estimate the quality of data sources (e.g. synthetically generated or filtered web…

Machine Learning · Computer Science 2025-07-31 Oleksiy Ostapenko , Charles Guille-Escuret , Luke Kumar , Max Tian , Denis Kocetkov , Gopeshh Subbaraj , Raymond Li , Joel Lamy-Poirier , Sebastien Paquet , Torsten Scholak

Dual scaling of rating data

When applied to contingency tables, dual scaling and correspondence are mathematically equivalent methods. For the analysis of rating data, however, the methods differ. To a large extent this is due to differences in preprocessing of the…

Methodology · Statistics 2023-02-10 Michel van de Velden , Patrick J. F. Groenen

DTization: A New Method for Supervised Feature Scaling

Artificial intelligence is currently a dominant force in shaping various aspects of the world. Machine learning is a sub-field in artificial intelligence. Feature scaling is one of the data pre-processing techniques that improves the…

Machine Learning · Computer Science 2024-04-30 Niful Islam

Scale Normalization

One of the difficulties of training deep neural networks is caused by improper scaling between layers. Scaling issues introduce exploding / gradient problems, and have typically been addressed by careful scale-preserving initialization. We…

Neural and Evolutionary Computing · Computer Science 2016-04-27 Henry Z. Lo , Kevin Amaral , Wei Ding

Starting Small -- Learning with Adaptive Sample Sizes

For many machine learning problems, data is abundant and it may be prohibitive to make multiple passes through the full training set. In this context, we investigate strategies for dynamically increasing the effective sample size, when…

Machine Learning · Computer Science 2016-10-10 Hadi Daneshmand , Aurelien Lucchi , Thomas Hofmann

Improving cluster recovery with feature rescaling factors

The data preprocessing stage is crucial in clustering. Features may describe entities using different scales. To rectify this, one usually applies feature normalisation aiming at rescaling features so that none of them overpowers the others…

Machine Learning · Computer Science 2021-04-26 Renato Cordeiro de Amorim , Vladimir Makarenkov

On the cross-validation bias due to unsupervised pre-processing

Cross-validation is the de facto standard for predictive model evaluation and selection. In proper use, it provides an unbiased estimate of a model's predictive performance. However, data sets often undergo various forms of data-dependent…

Methodology · Statistics 2023-01-18 Amit Moscovich , Saharon Rosset

Adaptive Optimization for Prediction with Missing Data

When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions. In this paper, we view prediction with…

Machine Learning · Computer Science 2025-02-25 Dimitris Bertsimas , Arthur Delarue , Jean Pauphilet

Weighted Scaling Approach for Metabolomics Data Analysis

Systematic variation is a common issue in metabolomics data analysis. Therefore, different scaling and normalization techniques are used to preprocess the data for metabolomics data analysis. Although several scaling methods are available…

Machine Learning · Statistics 2022-08-02 Biplab Biswas , Nishith Kumar , Md Aminul Hoque , Md Ashad Alam

Efficient adaptive designs with mid-course sample size adjustment in clinical trials

Adaptive designs have been proposed for clinical trials in which the nuisance parameters or alternative of interest are unknown or likely to be misspecified before the trial. Whereas most previous works on adaptive designs and mid-course…

Methodology · Statistics 2011-05-18 Jay Bartroff , Tze Leung Lai

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws

The composition of pretraining data is a key determinant of foundation models' performance, but there is no standard guideline for allocating a limited computational budget across different data sources. Most current approaches either rely…

Machine Learning · Computer Science 2024-10-16 Yiding Jiang , Allan Zhou , Zhili Feng , Sadhika Malladi , J. Zico Kolter

Scaling Sequential Recommendation Models with Transformers

Modeling user preferences has been mainly addressed by looking at users' interaction history with the different elements available in the system. Tailoring content to individual preferences based on historical data is the main goal of…

Machine Learning · Computer Science 2024-12-11 Pablo Zivic , Hernan Vazquez , Jorge Sanchez

Coupled Training with Privileged Information and Unlabeled Data

In many prediction problems, we have extra information during training (for example, measurements that are expensive or slow to collect) that will not be available when the model is deployed. A common strategy is to first train a model that…

Machine Learning · Statistics 2026-05-25 Jiahao Shi , Omar Hagrass , Jason M. Klusowski

Optimal Two-Step Prediction in Regression

High-dimensional prediction typically comprises two steps: variable selection and subsequent least-squares refitting on the selected variables. However, the standard variable selection procedures, such as the lasso, hinge on tuning…

Methodology · Statistics 2017-06-07 Didier Chételat , Johannes Lederer , Joseph Salmon