English
Related papers

Related papers: Using Multi-task and Transfer Learning to Solve Wo…

200 papers

Learning distributed sentence representations is one of the key challenges in natural language processing. Previous work demonstrated that a recurrent neural network (RNNs) based sentence encoder trained on a large collection of annotated…

Computation and Language · Computer Science 2018-08-20 Wasi Uddin Ahmad , Xueying Bai , Zhechao Huang , Chao Jiang , Nanyun Peng , Kai-Wei Chang

Recent breakthroughs in deep learning often rely on representation learning and knowledge transfer. In recent years, unsupervised and self-supervised techniques for learning speech representation were developed to foster automatic speech…

Computation and Language · Computer Science 2021-12-15 Pierre Beckmann , Mikolaj Kegler , Milos Cernak

Deep learning models require extensive architecture design exploration and hyperparameter optimization to perform well on a given task. The exploration of the model design space is often made by a human expert, and optimized using a…

Artificial Intelligence · Computer Science 2017-10-31 Catherine Wong , Andrea Gesmundo

Data reconstruction is a widely used pre-training task to learn the generalized features for many downstream tasks. Although reconstruction tasks have been applied to neural signal completion and denoising, neural signal reconstruction is…

Neurons and Cognition · Quantitative Biology 2024-07-02 Youzhi Qu , Junfeng Xia , Xinyao Jian , Wendu Li , Kaining Peng , Zhichao Liang , Haiyan Wu , Quanying Liu

Data-efficient neural decoding is a central challenge for speech brain-computer interfaces. We present the first demonstration of transfer learning and cross-task decoding for MEG-based speech models spanning perception and production. We…

Machine Learning · Computer Science 2026-02-23 Xabier de Zuazo , Vincenzo Verbeni , Eva Navas , Ibon Saratxaga , Mathieu Bourguignon , Nicola Molinaro

Electroencephalography (EEG) decoding is a challenging task due to the limited availability of labelled data. While transfer learning is a promising technique to address this challenge, it assumes that transferable data domains and task are…

Signal Processing · Electrical Eng. & Systems 2023-08-07 Bruno Aristimunha , Raphael Y. de Camargo , Walter H. Lopez Pinaya , Sylvain Chevallier , Alexandre Gramfort , Cedric Rommel

Neural network based models have achieved impressive results on various specific tasks. However, in previous works, most models are learned separately based on single-task supervised objectives, which often suffer from insufficient training…

Computation and Language · Computer Science 2016-09-26 Pengfei Liu , Xipeng Qiu , Xuanjing Huang

Recently, quite a few novel neural architectures were derived to solve math word problems by predicting expression trees. These architectures varied from seq2seq models, including encoders leveraging graph relationships combined with tree…

Computation and Language · Computer Science 2022-06-06 Keyur Faldu , Amit Sheth , Prashant Kikani , Darshan Patel

Neural Architecture Search (NAS) methods have been shown to outperform hand-designed models and help to democratize AI. However, NAS methods often start from scratch with each new task, making them computationally expensive and limiting…

Machine Learning · Computer Science 2025-07-15 Prabhant Singh , Joaquin Vanschoren

Missing input sequences are common in medical imaging data, posing a challenge for deep learning models reliant on complete input data. In this work, inspired by MultiMAE [2], we develop a masked autoencoder (MAE) paradigm for multi-modal,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Ayhan Can Erdur , Christian Beischl , Daniel Scholz , Jiazhen Pan , Benedikt Wiestler , Daniel Rueckert , Jan C Peeken

Neural processes have recently emerged as a class of powerful neural latent variable models that combine the strengths of neural networks and stochastic processes. As they can encode contextual data in the network's function space, they…

Machine Learning · Computer Science 2021-12-03 Jiayi Shen , Xiantong Zhen , Marcel Worring , Ling Shao

Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-21 Mirco Ravanelli , Jianyuan Zhong , Santiago Pascual , Pawel Swietojanski , Joao Monteiro , Jan Trmal , Yoshua Bengio

Recent work has demonstrated that large-scale, multi-animal models are powerful tools for characterizing the relationship between neural activity and behavior. Current large-scale approaches, however, focus exclusively on either predicting…

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance…

Computation and Language · Computer Science 2024-10-22 David Schulte , Felix Hamborg , Alan Akbik

Masked Autoencoder~(MAE) is a prevailing self-supervised learning method that achieves promising results in model pre-training. However, when the various downstream tasks have data distributions different from the pre-training data, the…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Zhili Liu , Kai Chen , Jianhua Han , Lanqing Hong , Hang Xu , Zhenguo Li , James T. Kwok

We propose a pre-training strategy called Multi-modal Multi-task Masked Autoencoders (MultiMAE). It differs from standard Masked Autoencoding in two key aspects: I) it can optionally accept additional modalities of information in the input…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Roman Bachmann , David Mizrahi , Andrei Atanov , Amir Zamir

We propose a multi-resolution convolutional autoencoder (MrCAE) architecture that integrates and leverages three highly successful mathematical architectures: (i) multigrid methods, (ii) convolutional autoencoders and (iii) transfer…

Machine Learning · Computer Science 2020-04-13 Yuying Liu , Colin Ponce , Steven L. Brunton , J. Nathan Kutz

Self-supervised representation learning of Multivariate Time Series (MTS) is a challenging task and attracts increasing research interests in recent years. Many previous works focus on the pretext task of self-supervised learning and…

Machine Learning · Computer Science 2022-03-10 Yijiang Chen , Xiangdong Zhou , Zhen Xing , Zhidan Liu , Minyang Xu

This paper investigates an under-explored but important problem: given a collection of pre-trained neural networks, predicting their performance on each multi-modal task without fine-tuning them, such as image recognition, referring,…

Machine Learning · Computer Science 2023-08-14 Fanqing Meng , Wenqi Shao , Zhanglin Peng , Chonghe Jiang , Kaipeng Zhang , Yu Qiao , Ping Luo
‹ Prev 1 2 3 10 Next ›