Related papers: Adjusting Pretrained Backbones for Performativity

Continual Learning with Pretrained Backbones by Tuning in the Input Space

The intrinsic difficulty in adapting deep learning models to non-stationary environments limits the applicability of neural networks to real-world tasks. This issue is critical in practical supervised learning settings, such as the ones in…

Machine Learning · Computer Science 2023-06-09 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci , Tinne Tuytelaars

FaultFormer: Pretraining Transformers for Adaptable Bearing Fault Classification

The growth of global consumption has motivated important applications of deep learning to smart manufacturing and machine health monitoring. In particular, analyzing vibration data offers great potential to extract meaningful insights into…

Machine Learning · Computer Science 2024-05-30 Anthony Zhou , Amir Barati Farimani

MemFlow: A Lightweight Forward Memorizing Framework for Quick Domain Adaptive Feature Mapping

Deploying pretrained visual models in real-world environments often suffers from significant performance degradation due to the diversity of testing scenarios. Continuous adaptation of learning models on edge devices via unlabeled data…

Neural and Evolutionary Computing · Computer Science 2026-05-08 Jianming Lv , Chengjun Wang , Depin Liang , Qianli Ma , Wei Chen , Xueqi Cheng

Adaptive Policy Backbone via Shared Network

Reinforcement learning (RL) has achieved impressive results across domains, yet learning an optimal policy typically requires extensive interaction data, limiting practical deployment. A common remedy is to leverage priors, such as…

Machine Learning · Computer Science 2025-09-29 Bumgeun Park , Donghwan Lee

Robust Transfer Learning with Pretrained Language Models through Adapters

Transfer learning with large pretrained transformer-based language models like BERT has become a dominating approach for most NLP tasks. Simply fine-tuning those large language models on downstream tasks or combining it with task-specific…

Computation and Language · Computer Science 2021-08-06 Wenjuan Han , Bo Pang , Yingnian Wu

Optimal Classification under Performative Distribution Shift

Performative learning addresses the increasingly pervasive situations in which algorithmic decisions may induce changes in the data distribution as a consequence of their public deployment. We propose a novel view in which these…

Machine Learning · Computer Science 2024-11-05 Edwige Cyffers , Muni Sreenivas Pydi , Jamal Atif , Olivier Cappé

Active Foundational Models for Fault Diagnosis of Electrical Motors

Fault detection and diagnosis of electrical motors are of utmost importance in ensuring the safe and reliable operation of several industrial systems. Detection and diagnosis of faults at the incipient stage allows corrective actions to be…

Systems and Control · Electrical Eng. & Systems 2023-11-28 Sriram Anbalagan , Sai Shashank GP , Deepesh Agarwal , Balasubramaniam Natarajan , Babji Srinivasan

What Happens During Finetuning of Vision Transformers: An Invariance Based Investigation

The pretrain-finetune paradigm usually improves downstream performance over training a model from scratch on the same task, becoming commonplace across many areas of machine learning. While pretraining is empirically observed to be…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Gabriele Merlin , Vedant Nanda , Ruchit Rawal , Mariya Toneva

Adaptive Deep Learning for Entity Resolution by Risk Analysis

The state-of-the-art performance on entity resolution (ER) has been achieved by deep learning. However, deep models are usually trained on large quantities of accurately labeled training data, and can not be easily tuned towards a target…

Machine Learning · Computer Science 2022-04-12 Zhaoqiang Chen , Qun Chen , Youcef Nafa , Tianyi Duan , Wei Pan , Lijun Zhang , Zhanhuai Li

Neural Priming for Sample-Efficient Adaptation

We propose Neural Priming, a technique for adapting large pretrained models to distribution shifts and downstream tasks given few or no labeled examples. Presented with class names or unlabeled test samples, Neural Priming enables the model…

Machine Learning · Computer Science 2023-12-06 Matthew Wallingford , Vivek Ramanujan , Alex Fang , Aditya Kusupati , Roozbeh Mottaghi , Aniruddha Kembhavi , Ludwig Schmidt , Ali Farhadi

These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

Overwriting Pretrained Bias with Finetuning Data

Transfer learning is beneficial by allowing the expressive features of models pretrained on large-scale datasets to be finetuned for the target task of smaller, more domain-specific datasets. However, there is a concern that these…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Angelina Wang , Olga Russakovsky

Efficiently Robustify Pre-trained Models

A recent trend in deep learning algorithms has been towards training large scale models, having high parameter count and trained on big dataset. However, robustness of such large scale models towards real-world settings is still a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-15 Nishant Jain , Harkirat Behl , Yogesh Singh Rawat , Vibhav Vineet

Accelerating Vision Foundation Models with Drop-in Depthwise Convolution

Pretrained vision foundation models deliver strong performance across tasks with limited fine-tuning. However, their Vision Transformer (ViT) backbones impose high inference costs, limiting deployment on resource-constrained devices. In…

Computer Vision and Pattern Recognition · Computer Science 2026-05-22 Carmelo Scribano , Mohammad Mahdi , Nedyalko Prisadnikov , Yuqian Fu , Giorgia Franchini , Danda Pani Paudel , Marko Bertogna , Luc Van Gool

Transfer of Pretrained Model Weights Substantially Improves Semi-Supervised Image Classification

Deep neural networks produce state-of-the-art results when trained on a large number of labeled examples but tend to overfit when small amounts of labeled examples are used for training. Creating a large number of labeled examples requires…

Computer Vision and Pattern Recognition · Computer Science 2021-09-13 Attaullah Sahito , Eibe Frank , Bernhard Pfahringer

Plug-in Performative Optimization

When predictions are performative, the choice of which predictor to deploy influences the distribution of future observations. The overarching goal in learning under performativity is to find a predictor that has low \emph{performative…

Machine Learning · Computer Science 2024-05-29 Licong Lin , Tijana Zrnic

Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning

While transferring a pretrained language model, common approaches conventionally attach their task-specific classifiers to the top layer and adapt all the pretrained layers. We investigate whether one could make a task-specific selection on…

Computation and Language · Computer Science 2022-10-20 Shuo Xie , Jiahao Qiu , Ankita Pasad , Li Du , Qing Qu , Hongyuan Mei

Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment

TabPFN has recently gained attention as a foundation model for tabular datasets, achieving strong performance by leveraging in-context learning on synthetic data. However, we find that TabPFN is vulnerable to label shift, often overfitting…

Machine Learning · Computer Science 2026-05-26 Seunghan Lee

Adaptive Real-Time Multi-Loss Function Optimization Using Dynamic Memory Fusion Framework: A Case Study on Breast Cancer Segmentation

Deep learning has proven to be a highly effective tool for a wide range of applications, significantly when leveraging the power of multi-loss functions to optimize performance on multiple criteria simultaneously. However, optimal selection…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Amin Golnari , Mostafa Diba

Cost-Effective Training of Deep CNNs with Active Model Adaptation

Deep convolutional neural networks have achieved great success in various applications. However, training an effective DNN model for a specific task is rather challenging because it requires a prior knowledge or experience to design the…

Machine Learning · Computer Science 2018-06-06 Sheng-Jun Huang , Jia-Wei Zhao , Zhao-Yang Liu