Related papers: Transferred Discrepancy: Quantifying the Differenc…

Measuring Representational Shifts in Continual Learning: A Linear Transformation Perspective

In continual learning scenarios, catastrophic forgetting of previously learned tasks is a critical issue, making it essential to effectively measure such forgetting. Recently, there has been growing interest in focusing on representation…

Machine Learning · Computer Science 2025-06-13 Joonkyu Kim , Yejin Kim , Jy-yong Sohn

Features are fate: a theory of transfer learning in high-dimensional regression

With the emergence of large-scale pre-trained neural networks, methods to adapt such "foundation" models to data-limited downstream tasks have become a necessity. Fine-tuning, preference optimization, and transfer learning have all been…

Machine Learning · Statistics 2025-07-09 Javan Tahir , Surya Ganguli , Grant M. Rotskoff

Rethinking Two Consensuses of the Transferability in Deep Learning

Deep transfer learning (DTL) has formed a long-term quest toward enabling deep neural networks (DNNs) to reuse historical experiences as efficiently as humans. This ability is named knowledge transferability. A commonly used paradigm for…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Yixiong Chen , Jingxian Li , Chris Ding , Li Liu

These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

Learning unbiased features

A key element in transfer learning is representation learning; if representations can be developed that expose the relevant factors underlying the data, then new tasks and domains can be learned readily based on mappings of these salient…

Machine Learning · Computer Science 2014-12-18 Yujia Li , Kevin Swersky , Richard Zemel

Deep Metric Learning for Computer Vision: A Brief Overview

Objective functions that optimize deep neural networks play a vital role in creating an enhanced feature representation of the input data. Although cross-entropy-based loss formulations have been extensively used in a variety of supervised…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Deen Dayal Mohan , Bhavin Jawade , Srirangaraj Setlur , Venu Govindaraj

Representation Transfer by Optimal Transport

Learning generic representations with deep networks requires massive training samples and significant computer resources. To learn a new specific task, an important issue is to transfer the generic teacher's representation to a student…

Machine Learning · Computer Science 2021-03-01 Xuhong Li , Yves Grandvalet , Rémi Flamary , Nicolas Courty , Dejing Dou

Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes

Transfer learning is a useful technique for achieving improved performance and reducing training costs by leveraging the knowledge gained from source tasks and applying it to target tasks. Assessing the effectiveness of transfer learning…

Machine Learning · Computer Science 2023-06-12 Peizhong Ju , Sen Lin , Mark S. Squillante , Yingbin Liang , Ness B. Shroff

Towards Understanding the Transferability of Deep Representations

Deep neural networks trained on a wide range of datasets demonstrate impressive transferability. Deep features appear general in that they are applicable to many datasets and tasks. Such property is in prevalent use in real-world…

Machine Learning · Computer Science 2019-09-27 Hong Liu , Mingsheng Long , Jianmin Wang , Michael I. Jordan

Understanding the Transferability of Representations via Task-Relatedness

The growing popularity of transfer learning, due to the availability of models pre-trained on vast amounts of data, makes it imperative to understand when the knowledge of these pre-trained models can be transferred to obtain…

Machine Learning · Computer Science 2024-10-30 Akshay Mehra , Yunbei Zhang , Jihun Hamm

A Taxonomy of Similarity Metrics for Markov Decision Processes

Although the notion of task similarity is potentially interesting in a wide range of areas such as curriculum learning or automated planning, it has mostly been tied to transfer learning. Transfer is based on the idea of reusing the…

Machine Learning · Computer Science 2021-03-09 Álvaro Visús , Javier García , Fernando Fernández

A Practitioners' Guide to Transfer Learning for Text Classification using Convolutional Neural Networks

Transfer Learning (TL) plays a crucial role when a given dataset has insufficient labeled examples to train an accurate model. In such scenarios, the knowledge accumulated within a model pre-trained on a source dataset can be transferred to…

Computation and Language · Computer Science 2018-01-22 Tushar Semwal , Gaurav Mathur , Promod Yenigalla , Shivashankar B. Nair

Transfer Learning for Analysis of Collective and Non-Collective Thomson Scattering Spectra

Thomson scattering (TS) diagnostics provide reliable, minimally perturbative measurements of fundamental plasma parameters, such as electron density ($n_e$) and electron temperature ($T_e$). Deep neural networks can provide accurate…

Plasma Physics · Physics 2025-12-23 T. Van Hoomissen , J. Alhuthali , A. M. Ortiz , D. A. Mariscal , R. S. Dorst , S. Eisenbach , H. Zhang , J. J. Pilgram , C. G. Constantin , L. Rovige , C. Niemann , D. B. Schaeffer

Sharing Matters for Generalization in Deep Metric Learning

Learning the similarity between images constitutes the foundation for numerous vision tasks. The common paradigm is discriminative metric learning, which seeks an embedding that separates different training classes. However, the main…

Computer Vision and Pattern Recognition · Computer Science 2021-09-10 Timo Milbich , Karsten Roth , Biagio Brattoli , Björn Ommer

Deep Task-specific Bottom Representation Network for Multi-Task Recommendation

Neural-based multi-task learning (MTL) has gained significant improvement, and it has been successfully applied to recommendation system (RS). Recent deep MTL methods for RS (e.g. MMoE, PLE) focus on designing soft gating-based…

Artificial Intelligence · Computer Science 2023-08-21 Qi Liu , Zhilong Zhou , Gangwei Jiang , Tiezheng Ge , Defu Lian

Nested Learning For Multi-Granular Tasks

Standard deep neural networks (DNNs) are commonly trained in an end-to-end fashion for specific tasks such as object recognition, face identification, or character recognition, among many examples. This specificity often leads to…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Raphaël Achddou , J. Matias di Martino , Guillermo Sapiro

Diffused Redundancy in Pre-trained Representations

Representations learned by pre-training a neural network on a large dataset are increasingly used successfully to perform a variety of downstream tasks. In this work, we take a closer look at how features are encoded in such pre-trained…

Machine Learning · Computer Science 2023-11-15 Vedant Nanda , Till Speicher , John P. Dickerson , Soheil Feizi , Krishna P. Gummadi , Adrian Weller

Transfer Learning with Deep Tabular Models

Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models…

Machine Learning · Computer Science 2023-08-08 Roman Levin , Valeriia Cherepanova , Avi Schwarzschild , Arpit Bansal , C. Bayan Bruss , Tom Goldstein , Andrew Gordon Wilson , Micah Goldblum

Deep Reinforcement Learning and The Tale of Two Temporal Difference Errors

The temporal difference (TD) error was first formalized in Sutton (1988), where it was first characterized as the difference between temporally successive predictions, and later, in that same work, formulated as the difference between a…

Machine Learning · Computer Science 2026-03-24 Juan Sebastian Rojas , Chi-Guhn Lee

Do Adversarially Robust ImageNet Models Transfer Better?

Transfer learning is a widely-used paradigm in deep learning, where models pre-trained on standard datasets can be efficiently adapted to downstream tasks. Typically, better pre-trained models yield better transfer results, suggesting that…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Hadi Salman , Andrew Ilyas , Logan Engstrom , Ashish Kapoor , Aleksander Madry