English
Related papers

Related papers: Model Stability with Continuous Data Updates

200 papers

Many industrial machine learning (ML) systems require frequent retraining to keep up-to-date with constantly changing data. This retraining exacerbates a large challenge facing ML systems today: model training is unstable, i.e., small…

Computation and Language · Computer Science 2020-03-12 Megan Leszczynski , Avner May , Jian Zhang , Sen Wu , Christopher R. Aberger , Christopher Ré

Fine-tuning a pre-trained model (such as BERT, ALBERT, RoBERTa, T5, GPT, etc.) has proven to be one of the most promising paradigms in recent NLP research. However, numerous recent works indicate that fine-tuning suffers from the…

Machine Learning · Computer Science 2023-12-08 Zihao Fu , Anthony Man-Cho So , Nigel Collier

While it has been well known in the ML community that deep learning models suffer from instability, the consequences for healthcare deployments are under characterised. We study the stability of different model architectures trained on…

Retraining modern deep learning systems can lead to variations in model performance even when trained using the same data and hyper-parameters by simply using different random seeds. We call this phenomenon model jitter. This issue is often…

Computation and Language · Computer Science 2022-09-26 Christopher Hidey , Fei Liu , Rahul Goel

We examine the stability of loss-minimizing training processes that are used for deep neural networks (DNN) and other classifiers. While a classifier is optimized during training through a so-called loss function, the performance of…

Analysis of PDEs · Mathematics 2020-10-05 Leonid Berlyand , Pierre-Emmanuel Jabin , C. Alex Safsten

Machine-learning technologies for learning dynamical systems from data play an important role in engineering design. This research focuses on learning continuous linear models from data. Stability, a key feature of dynamic systems, is…

Machine Learning · Computer Science 2023-01-25 Pawan Goyal , Igor Pontes Duff , Peter Benner

Deep neural networks have shown remarkable performance when trained on independent and identically distributed data from a fixed set of classes. However, in real-world scenarios, it can be desirable to train models on a continuous stream of…

Machine Learning · Computer Science 2023-09-04 Nicolas Michel , Giovanni Chierchia , Romain Negrel , Jean-François Bercher , Toshihiko Yamasaki

Word embeddings are computed by a class of techniques within natural language processing (NLP), that create continuous vector representations of words in a language from a large text corpus. The stochastic nature of the training process of…

Computation and Language · Computer Science 2020-08-03 Lucas Rettenmeier

Deep Neural Networks (DNNs) are becoming integral components of real world services relied upon by millions of users. Unfortunately, architects of these systems can find it difficult to ensure reliable performance as irrelevant details like…

Machine Learning · Computer Science 2023-05-22 Arghya Datta , Subhrangshu Nandi , Jingcheng Xu , Greg Ver Steeg , He Xie , Anoop Kumar , Aram Galstyan

We consider the problem of retraining machine learning (ML) models when new batches of data become available. Existing approaches greedily optimize for predictive power independently at each batch, without considering the stability of the…

Machine Learning · Computer Science 2025-02-05 Dimitris Bertsimas , Vassilis Digalakis , Yu Ma , Phevos Paschalidis

In this paper we address the issue of output instability of deep neural networks: small perturbations in the visual input can significantly distort the feature embeddings and output of a neural network. Such instability affects many deep…

Computer Vision and Pattern Recognition · Computer Science 2016-04-18 Stephan Zheng , Yang Song , Thomas Leung , Ian Goodfellow

Fine-tuning pre-trained transformer-based language models such as BERT has become a common practice dominating leaderboards across various NLP benchmarks. Despite the strong empirical performance of fine-tuned models, fine-tuning is an…

Machine Learning · Computer Science 2021-03-26 Marius Mosbach , Maksym Andriushchenko , Dietrich Klakow

As large language models (LLMs) are increasingly deployed in high-stakes and operational settings, evaluation strategies based solely on aggregate accuracy are often insucient to characterize system reliability. This study proposes a…

Artificial Intelligence · Computer Science 2026-05-06 Hikmat Karimov , Rahid Zahid Alekberli

High-dimensional datasets present substantial challenges in statistical modeling across various disciplines, necessitating effective dimensionality reduction methods. Deep learning approaches, notable for their capacity to distill essential…

Machine Learning · Computer Science 2025-08-12 Ademide O. Mabadeje , Michael J. Pyrcz

To date, the instability of prognostic predictors in a sparse high dimensional model, which hinders their clinical adoption, has received little attention. Stable prediction is often overlooked in favour of performance. Yet, stability…

Machine Learning · Statistics 2016-09-29 Shivapratap Gopakumar , Truyen Tran , Dinh Phung , Svetha Venkatesh

In a world where Machine Learning (ML) is increasingly deployed to support decision-making in critical domains, providing decision-makers with explainable, stable, and relevant inputs becomes fundamental. Understanding how machine learning…

Machine Learning · Computer Science 2024-08-07 Karol Capała , Paulina Tworek , Jose Sousa

Training stability is typically regarded as a prerequisite for reliable optimization in large language models. In this work, we analyze how stabilizing training dynamics affects the induced generation distribution. We show that under…

Artificial Intelligence · Computer Science 2026-02-10 Xianzhe Meng , Qiangsheng Zeng , Ling Luo , Qinghan Yang , Jiarui Hao , Wenbo Wu , Qinyu Wang , Rui Yin , Lin Qi , Renzhi Lu

Machine learning (ML) systems are increasingly deployed in high-stakes domains where reliability is paramount. This thesis investigates how uncertainty estimation can enhance the safety and trustworthiness of ML, focusing on selective…

Machine Learning · Computer Science 2025-09-09 Stephan Rabanser

The solution of linear inverse problems arising, for example, in signal and image processing is a challenging problem since the ill-conditioning amplifies, in the solution, the noise present in the data. Recently introduced algorithms based…

Numerical Analysis · Mathematics 2024-02-08 Davide Evangelista , James Nagy , Elena Morotti , Elena Loli Piccolomini

Modern Natural Language Processing (NLP) models are known to be sensitive to input perturbations and their performance can decrease when applied to real-world, noisy data. However, it is still unclear why models are less robust to some…

Computation and Language · Computer Science 2022-03-21 Yunxiang Zhang , Liangming Pan , Samson Tan , Min-Yen Kan
‹ Prev 1 2 3 10 Next ›