Related papers: Model Stability with Continuous Data Updates

Understanding the Downstream Instability of Word Embeddings

Many industrial machine learning (ML) systems require frequent retraining to keep up-to-date with constantly changing data. This retraining exacerbates a large challenge facing ML systems today: model training is unstable, i.e., small…

Computation and Language · Computer Science 2020-03-12 Megan Leszczynski , Avner May , Jian Zhang , Sen Wu , Christopher R. Aberger , Christopher Ré

A Stability Analysis of Fine-Tuning a Pre-Trained Model

Fine-tuning a pre-trained model (such as BERT, ALBERT, RoBERTa, T5, GPT, etc.) has proven to be one of the most promising paradigms in recent NLP research. However, numerous recent works indicate that fine-tuning suffers from the…

Machine Learning · Computer Science 2023-12-08 Zihao Fu , Anthony Man-Cho So , Nigel Collier

Instability in clinical risk stratification models using deep learning

While it has been well known in the ML community that deep learning models suffer from instability, the consequences for healthcare deployments are under characterised. We study the stability of different model architectures trained on…

Machine Learning · Computer Science 2022-11-22 Daniel Lopez-Martinez , Alex Yakubovich , Martin Seneviratne , Adam D. Lelkes , Akshit Tyagi , Jonas Kemp , Ethan Steinberg , N. Lance Downing , Ron C. Li , Keith E. Morse , Nigam H. Shah , Ming-Jun Chen

Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments

Retraining modern deep learning systems can lead to variations in model performance even when trained using the same data and hyper-parameters by simply using different random seeds. We call this phenomenon model jitter. This issue is often…

Computation and Language · Computer Science 2022-09-26 Christopher Hidey , Fei Liu , Rahul Goel

Stability for the Training of Deep Neural Networks and Other Classifiers

We examine the stability of loss-minimizing training processes that are used for deep neural networks (DNN) and other classifiers. While a classifier is optimized during training through a so-called loss function, the performance of…

Analysis of PDEs · Mathematics 2020-10-05 Leonid Berlyand , Pierre-Emmanuel Jabin , C. Alex Safsten

Inference of Continuous Linear Systems from Data with Guaranteed Stability

Machine-learning technologies for learning dynamical systems from data play an important role in engineering design. This research focuses on learning continuous linear models from data. Stability, a key feature of dynamic systems, is…

Machine Learning · Computer Science 2023-01-25 Pawan Goyal , Igor Pontes Duff , Peter Benner

New metrics for analyzing continual learners

Deep neural networks have shown remarkable performance when trained on independent and identically distributed data from a fixed set of classes. However, in real-world scenarios, it can be desirable to train models on a continuous stream of…

Machine Learning · Computer Science 2023-09-04 Nicolas Michel , Giovanni Chierchia , Romain Negrel , Jean-François Bercher , Toshihiko Yamasaki

Word Embeddings: Stability and Semantic Change

Word embeddings are computed by a class of techniques within natural language processing (NLP), that create continuous vector representations of words in a language from a large text corpus. The stochastic nature of the training process of…

Computation and Language · Computer Science 2020-08-03 Lucas Rettenmeier

Measuring and Mitigating Local Instability in Deep Neural Networks

Deep Neural Networks (DNNs) are becoming integral components of real world services relied upon by millions of users. Unfortunately, architects of these systems can find it difficult to ensure reliable performance as irrelevant details like…

Machine Learning · Computer Science 2023-05-22 Arghya Datta , Subhrangshu Nandi , Jingcheng Xu , Greg Ver Steeg , He Xie , Anoop Kumar , Aram Galstyan

Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

We consider the problem of retraining machine learning (ML) models when new batches of data become available. Existing approaches greedily optimize for predictive power independently at each batch, without considering the stability of the…

Machine Learning · Computer Science 2025-02-05 Dimitris Bertsimas , Vassilis Digalakis , Yu Ma , Phevos Paschalidis

Improving the Robustness of Deep Neural Networks via Stability Training

In this paper we address the issue of output instability of deep neural networks: small perturbations in the visual input can significantly distort the feature embeddings and output of a neural network. Such instability affects many deep…

Computer Vision and Pattern Recognition · Computer Science 2016-04-18 Stephan Zheng , Yang Song , Thomas Leung , Ian Goodfellow

On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines

Fine-tuning pre-trained transformer-based language models such as BERT has become a common practice dominating leaderboards across various NLP benchmarks. Despite the strong empirical performance of fine-tuned models, fine-tuning is an…

Machine Learning · Computer Science 2021-03-26 Marius Mosbach , Maksym Andriushchenko , Dietrich Klakow

An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress

As large language models (LLMs) are increasingly deployed in high-stakes and operational settings, evaluation strategies based solely on aggregate accuracy are often insucient to characterize system reliability. This study proposes a…

Artificial Intelligence · Computer Science 2026-05-06 Hikmat Karimov , Rahid Zahid Alekberli

Evaluating the Stability of Deep Learning Latent Feature Spaces

High-dimensional datasets present substantial challenges in statistical modeling across various disciplines, necessitating effective dimensionality reduction methods. Deep learning approaches, notable for their capacity to distill essential…

Machine Learning · Computer Science 2025-08-12 Ademide O. Mabadeje , Michael J. Pyrcz

Stabilizing Linear Prediction Models using Autoencoder

To date, the instability of prognostic predictors in a sparse high dimensional model, which hinders their clinical adoption, has received little attention. Stable prediction is often overlooked in favour of performance. Yet, stability…

Machine Learning · Statistics 2016-09-29 Shivapratap Gopakumar , Truyen Tran , Dinh Phung , Svetha Venkatesh

Preservation of Feature Stability in Machine Learning Under Data Uncertainty for Decision Support in Critical Domains

In a world where Machine Learning (ML) is increasingly deployed to support decision-making in critical domains, providing decision-makers with explainable, stable, and relevant inputs becomes fundamental. Understanding how machine learning…

Machine Learning · Computer Science 2024-08-07 Karol Capała , Paulina Tworek , Jose Sousa

Stability as a Liability:Systematic Breakdown of Linguistic Structure in LLMs

Training stability is typically regarded as a prerequisite for reliable optimization in large language models. In this work, we analyze how stabilizing training dynamics affects the induced generation distribution. We show that under…

Artificial Intelligence · Computer Science 2026-02-10 Xianzhe Meng , Qiangsheng Zeng , Ling Luo , Qinghan Yang , Jiarui Hao , Wenbo Wu , Qinyu Wang , Rui Yin , Lin Qi , Renzhi Lu

Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning

Machine learning (ML) systems are increasingly deployed in high-stakes domains where reliability is paramount. This thesis investigates how uncertainty estimation can enhance the safety and trustworthiness of ML, focusing on selective…

Machine Learning · Computer Science 2025-09-09 Stephan Rabanser

To be or not to be stable, that is the question: understanding neural networks for inverse problems

The solution of linear inverse problems arising, for example, in signal and image processing is a challenging problem since the ill-conditioning amplifies, in the solution, the noise present in the data. Recently introduced algorithms based…

Numerical Analysis · Mathematics 2024-02-08 Davide Evangelista , James Nagy , Elena Morotti , Elena Loli Piccolomini

Interpreting the Robustness of Neural NLP Models to Textual Perturbations

Modern Natural Language Processing (NLP) models are known to be sensitive to input perturbations and their performance can decrease when applied to real-world, noisy data. However, it is still unclear why models are less robust to some…

Computation and Language · Computer Science 2022-03-21 Yunxiang Zhang , Liangming Pan , Samson Tan , Min-Yen Kan