English
Related papers

Related papers: HyperGrid: Efficient Multi-Task Transformers with …

200 papers

Prompt-Tuning is a new paradigm for finetuning pre-trained language models in a parameter-efficient way. Here, we explore the use of HyperNetworks to generate hyper-prompts: we propose HyperPrompt, a novel architecture for prompt-based…

Computation and Language · Computer Science 2022-06-16 Yun He , Huaixiu Steven Zheng , Yi Tay , Jai Gupta , Yu Du , Vamsi Aribandi , Zhe Zhao , YaGuang Li , Zhao Chen , Donald Metzler , Heng-Tze Cheng , Ed H. Chi

Although pretraining has garnered significant attention and popularity in recent years, its application in graph-based recommender systems is relatively limited. It is challenging to exploit prior knowledge by pretraining in widely used…

Information Retrieval · Computer Science 2023-10-23 Mingdai Yang , Zhiwei Liu , Liangwei Yang , Xiaolong Liu , Chen Wang , Hao Peng , Philip S. Yu

Deep learning based techniques achieve state-of-the-art results in a wide range of image reconstruction tasks like compressed sensing. These methods almost always have hyperparameters, such as the weight coefficients that balance the…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

Brain-inspired Hyperdimensional (HD) computing is an emerging technique for cognitive tasks in the field of low-power design. As a fast-learning and energy-efficient computational paradigm, HD computing has shown great success in many…

Signal Processing · Electrical Eng. & Systems 2020-04-30 Cheng-Yang Chang , Yu-Chuan Chuang , An-Yeu Wu

Hypergraphs play a pivotal role in the modelling of data featuring higher-order relations involving more than two entities. Hypergraph neural networks emerge as a powerful tool for processing hypergraph-structured data, delivering…

Machine Learning · Computer Science 2024-06-04 Zexi Liu , Bohan Tang , Ziyuan Ye , Xiaowen Dong , Siheng Chen , Yanfeng Wang

We present a novel methodology to jointly perform multi-task learning and infer intrinsic relationship among tasks by an interpretable and sparse graph. Unlike existing multi-task learning methodologies, the graph structure is not assumed…

Machine Learning · Computer Science 2020-09-15 Shujian Yu , Francesco Alesiani , Ammar Shaker , Wenzhe Yin

Recent works have shown that Large Language Models (LLMs) can facilitate the grounding of instructions for robotic task planning. Despite this progress, most existing works have primarily focused on utilizing raw images to aid LLMs in…

Robotics · Computer Science 2024-03-12 Zhe Ni , Xiaoxin Deng , Cong Tai , Xinyue Zhu , Qinghongbing Xie , Weihang Huang , Xiang Wu , Long Zeng

In this work we propose a HyperTransformer, a Transformer-based model for supervised and semi-supervised few-shot learning that generates weights of a convolutional neural network (CNN) directly from support samples. Since the dependence of…

Machine Learning · Computer Science 2022-07-15 Andrey Zhmoginov , Mark Sandler , Max Vladymyrov

The workflow of pretraining and fine-tuning has emerged as a popular paradigm for solving various NLP and V&L (Vision-and-Language) downstream tasks. With the capacity of pretrained models growing rapidly, how to perform parameter-efficient…

Computation and Language · Computer Science 2022-03-09 Zhengkun Zhang , Wenya Guo , Xiaojun Meng , Yasheng Wang , Yadao Wang , Xin Jiang , Qun Liu , Zhenglu Yang

We propose a novel graph-driven generative model, that unifies multiple heterogeneous learning tasks into the same framework. The proposed model is based on the fact that heterogeneous learning tasks, which correspond to different…

Machine Learning · Computer Science 2019-11-21 Wenlin Wang , Hongteng Xu , Zhe Gan , Bai Li , Guoyin Wang , Liqun Chen , Qian Yang , Wenqi Wang , Lawrence Carin

Massively multilingual models are promising for transfer learning across tasks and languages. However, existing methods are unable to fully leverage training data when it is available in different task-language combinations. To exploit such…

Computation and Language · Computer Science 2022-10-26 Ahmet Üstün , Arianna Bisazza , Gosse Bouma , Gertjan van Noord , Sebastian Ruder

We present a method for learning multiple scene representations given a small labeled set, by exploiting the relationships between such representations in the form of a multi-task hypergraph. We also show how we can use the hypergraph to…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Alina Marcu , Mihai Pirvu , Dragos Costea , Emanuela Haller , Emil Slusanschi , Ahmed Nabil Belbachir , Rahul Sukthankar , Marius Leordeanu

This paper proposes a novel heterogeneous grid convolution that builds a graph-based image representation by exploiting heterogeneity in the image content, enabling adaptive, efficient, and controllable computations in a convolutional…

Computer Vision and Pattern Recognition · Computer Science 2021-04-23 Ryuhei Hamaguchi , Yasutaka Furukawa , Masaki Onishi , Ken Sakurada

Graph Neural Networks (GNNs) are a framework for graph representation learning, where a model learns to generate low dimensional node embeddings that encapsulate structural and feature-related information. GNNs are usually trained in an…

Machine Learning · Computer Science 2020-12-15 Davide Buffelli , Fabio Vandin

State-of-the-art parameter-efficient fine-tuning methods rely on introducing adapter modules between the layers of a pretrained language model. However, such modules are trained separately for each task and thus do not enable sharing…

Computation and Language · Computer Science 2021-06-09 Rabeeh Karimi Mahabadi , Sebastian Ruder , Mostafa Dehghani , James Henderson

Self-supervised learning is currently gaining a lot of attention, as it allows neural networks to learn robust representations from large quantities of unlabeled data. Additionally, multi-task learning can further improve representation…

Machine Learning · Computer Science 2020-12-07 Franco Manessi , Alessandro Rozza

Neural network based methods have obtained great progress on a variety of natural language processing tasks. However, in most previous works, the models are learned based on single-task supervised objectives, which often suffer from…

Computation and Language · Computer Science 2016-05-18 Pengfei Liu , Xipeng Qiu , Xuanjing Huang

There are many ways of interpreting the world and they are highly interdependent. We exploit such complex dependencies and introduce a powerful multi-task hypergraph, in which every node is a task and different paths through the hypergraph…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Mihai Pirvu , Alina Marcu , Alexandra Dobrescu , Nabil Belbachir , Marius Leordeanu

With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Simon Vandenhende , Stamatios Georgoulis , Wouter Van Gansbeke , Marc Proesmans , Dengxin Dai , Luc Van Gool

Multitask algorithms typically use task similarity information as a bias to speed up and improve the performance of learning processes. Tasks are learned jointly, sharing information across them, in order to construct models more accurate…

Machine Learning · Computer Science 2019-04-11 Marco Frasca , Giuliano Grossi , Giorgio Valentini
‹ Prev 1 2 3 10 Next ›