English
Related papers

Related papers: Weighted Training for Cross-Task Learning

200 papers

Meta reinforcement learning aims to develop policies that generalize to unseen tasks sampled from a task distribution. While context-based meta-RL methods improve task representation using task latents, they often struggle with…

Machine Learning · Computer Science 2026-05-21 Jeongmo Kim , Yisak Park , Minung Kim , Seungyul Han

Weight averaging is a widely used technique for accelerating training and improving the generalization of deep neural networks (DNNs). While existing approaches like stochastic weight averaging (SWA) rely on pre-set weighting schemes, they…

Machine Learning · Computer Science 2025-02-11 Tao Li , Zhehao Huang , Yingwen Wu , Zhengbao He , Qinghua Tao , Xiaolin Huang , Chih-Jen Lin

Large-scale general domain pretraining followed by downstream-specific finetuning has become a predominant paradigm in machine learning. However, discrepancies between the pretraining and target domains can still lead to performance…

Machine Learning · Computer Science 2024-10-15 Ruiyi Zhang , Sai Ashish Somayajula , Pengtao Xie

Meta-learning leverages related source tasks to learn an initialization that can be quickly fine-tuned to a target task with limited labeled examples. However, many popular meta-learning algorithms, such as model-agnostic meta-learning…

Machine Learning · Statistics 2020-03-24 Diana Cai , Rishit Sheth , Lester Mackey , Nicolo Fusi

Developing meta-learning algorithms that are un-biased toward a subset of training tasks often requires hand-designed criteria to weight tasks, potentially resulting in sub-optimal solutions. In this paper, we introduce a new principled and…

Machine Learning · Computer Science 2023-01-05 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro

Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which…

Computer Vision and Pattern Recognition · Computer Science 2019-10-10 Qianru Sun , Yaoyao Liu , Zhaozheng Chen , Tat-Seng Chua , Bernt Schiele

Meta-learning has emerged as a trending technique to tackle few-shot text classification and achieve state-of-the-art performance. However, the performance of existing approaches heavily depends on the inter-class variance of the support…

Computation and Language · Computer Science 2023-06-12 Shuo Lei , Xuchao Zhang , Jianfeng He , Fanglan Chen , Chang-Tien Lu

Language models (LMs) pretrained on a large text corpus and fine-tuned on a downstream text corpus and fine-tuned on a downstream task becomes a de facto training strategy for several natural language processing (NLP) tasks. Recently, an…

Computation and Language · Computer Science 2021-07-23 Junghoon Lee , Jounghee Kim , Pilsung Kang

Prompt tuning offers a parameter-efficient way to adapt large pre-trained language models to new tasks, but most existing approaches are designed for single-task settings, failing to share knowledge across related tasks. We propose…

Computation and Language · Computer Science 2025-09-19 Ahmad Pouramini , Hesham Faili

Prior multi-task triplet loss methods relied on static weights to balance supervision between various types of annotation. However, static weighting requires tuning and does not account for how tasks interact when shaping a shared…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Meilun Zhou , Alina Zare

Deep learning models have demonstrated exceptional performance across a wide range of computer vision tasks. However, their performance often degrades significantly when faced with distribution shifts, such as domain or dataset changes.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Samuel Barbeau , Pedram Fekri , David Osowiechi , Ali Bahri , Moslem Yazdanpanah , Masih Aminbeidokhti , Christian Desrosiers

Transfer learning is an emerging paradigm for leveraging multiple sources to improve the statistical inference on a single target. In this paper, we propose a novel approach named residual importance weighted transfer learning (RIW-TL) for…

Methodology · Statistics 2024-01-04 Junlong Zhao , Shengbin Zheng , Chenlei Leng

Leveraging large amounts of unlabeled data using Transformer-like architectures, like BERT, has gained popularity in recent times owing to their effectiveness in learning general representations that can then be further fine-tuned for…

Computation and Language · Computer Science 2020-10-28 Kasturi Bhattacharjee , Miguel Ballesteros , Rishita Anubhai , Smaranda Muresan , Jie Ma , Faisal Ladhak , Yaser Al-Onaizan

Multi-Task Learning (MTL) has achieved success in various fields. However, how to balance different tasks to achieve good performance is a key problem. To achieve the task balancing, there are many works to carefully design dynamical…

Machine Learning · Computer Science 2022-07-28 Baijiong Lin , Feiyang Ye , Yu Zhang , Ivor W. Tsang

We present a novel reinforcement learning based algorithm for multi-robot task allocation problem in warehouse environments. We formulate it as a Markov Decision Process and solve via a novel deep multi-agent reinforcement learning method…

Robotics · Computer Science 2023-02-28 Aakriti Agrawal , Amrit Singh Bedi , Dinesh Manocha

We propose a novel adaptive transfer learning framework, learning to transfer learn (L2TL), to improve performance on a target dataset by careful extraction of the related information from a source dataset. Our framework considers…

Machine Learning · Computer Science 2020-07-17 Linchao Zhu , Sercan O. Arik , Yi Yang , Tomas Pfister

Transfer Learning enables Convolutional Neural Networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally,…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 S. H. Shabbeer Basha , Debapriya Tula , Sravan Kumar Vinakota , Shiv Ram Dubey

Although considerable progress has been obtained in neural network quantization for efficient inference, existing methods are not scalable to heterogeneous devices as one dedicated model needs to be trained, transmitted, and stored for one…

Machine Learning · Computer Science 2022-12-13 Hai Wu , Ruifei He , Haoru Tan , Xiaojuan Qi , Kaibin Huang

Transfer learning can boost the performance on the targettask by leveraging the knowledge of the source domain. Recent worksin neural architecture search (NAS), especially one-shot NAS, can aidtransfer learning by establishing sufficient…

Computer Vision and Pattern Recognition · Computer Science 2021-05-20 Ming Sun , Haoxuan Dou , Junjie Yan

We present a simple yet effective Targeted Adversarial Training (TAT) algorithm to improve adversarial training for natural language understanding. The key idea is to introspect current mistakes and prioritize adversarial training steps to…

Computation and Language · Computer Science 2021-04-14 Lis Pereira , Xiaodong Liu , Hao Cheng , Hoifung Poon , Jianfeng Gao , Ichiro Kobayashi
‹ Prev 1 2 3 10 Next ›