Related papers: Efficient Multitask Learning on Resource-Constrain…

Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures

Executing machine learning inference tasks on resource-constrained edge devices requires careful hardware-software co-design optimizations. Recent examples have shown how transformer-based deep neural network models such as ALBERT can be…

Machine Learning · Computer Science 2023-04-14 Zirui Fu , Aleksandre Avaliani , Marco Donato

AdaMTL: Adaptive Input-dependent Inference for Efficient Multi-Task Learning

Modern Augmented reality applications require performing multiple tasks on each input frame simultaneously. Multi-task learning (MTL) represents an effective approach where multiple tasks share an encoder to extract representative features…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Marina Neseem , Ahmed Agiza , Sherief Reda

Multi-Task Learning with Group-Specific Feature Space Sharing

When faced with learning a set of inter-related tasks from a limited amount of usable data, learning each task independently may lead to poor generalization performance. Multi-Task Learning (MTL) exploits the latent relations between tasks…

Machine Learning · Computer Science 2015-08-14 Niloofar Yousefi , Michael Georgiopoulos , Georgios C. Anagnostopoulos

AdapLeR: Speeding up Inference by Adaptive Length Reduction

Pre-trained language models have shown stellar performance in various downstream tasks. But, this usually comes at the cost of high latency and computation, hindering their usage in resource-limited settings. In this work, we propose a…

Computation and Language · Computer Science 2022-03-18 Ali Modarressi , Hosein Mohebbi , Mohammad Taher Pilehvar

Joint Optimization of Video-based AI Inference Tasks in MEC-assisted Augmented Reality Systems

The high computational complexity and energy consumption of artificial intelligence (AI) algorithms hinder their application in augmented reality (AR) systems. However, mobile edge computing (MEC) makes it possible to solve this problem.…

Networking and Internet Architecture · Computer Science 2023-01-04 Guangjin Pan , Heng Zhang , Shugong Xu , Shunqing Zhang , Xiaojing Chen

Energy and Time Efficient Scheduling of Tasks with Dependencies on Asymmetric Multiprocessors

In this work we study the problem of scheduling tasks with dependencies in multiprocessor architectures where processors have different speeds. We present the preemptive algorithm "Save-Energy" that given a schedule of tasks it post…

Distributed, Parallel, and Cluster Computing · Computer Science 2008-06-09 Ioannis Chatzigiannakis , Georgios Giannoulis , Paul G. Spirakis

Ensemble Prediction of Task Affinity for Efficient Multi-Task Learning

A fundamental problem in multi-task learning (MTL) is identifying groups of tasks that should be learned together. Since training MTL models for all possible combinations of tasks is prohibitively expensive for large task sets, a crucial…

Machine Learning · Computer Science 2026-02-24 Afiya Ayman , Ayan Mukhopadhyay , Aron Laszka

A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning

In pursuit of reinforcement learning systems that could train in physical environments, we investigate multi-task approaches as a means to alleviate the need for massive data acquisition. In a tabular scenario where the Q-functions are…

Machine Learning · Computer Science 2025-01-22 Sergio Rozada , Santiago Paternain , Juan Andres Bazerque , Antonio G. Marques

Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness

Representation multi-task learning (MTL) has achieved tremendous success in practice. However, the theoretical understanding of these methods is still lacking. Most existing theoretical works focus on cases where all tasks share the same…

Machine Learning · Statistics 2025-07-08 Ye Tian , Yuqi Gu , Yang Feng

Improving tasks throughput on accelerators using OpenCL command concurrency

A heterogeneous architecture composed by a host and an accelerator must frequently deal with situations where several independent tasks are available to be offloaded onto the accelerator. These tasks can be generated by concurrent…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-03 A. J. Lázaro-Muñoz , J. M. González-Linares , J. Gómez-Luna , N. Guil

Energy-Aware DNN Graph Optimization

Unlike existing work in deep neural network (DNN) graphs optimization for inference performance, we explore DNN graph optimization for energy awareness and savings for power- and resource-constrained machine learning devices. We present a…

Machine Learning · Computer Science 2026-01-27 Yu Wang , Rong Ge , Shuang Qiu

Learning an Interpretable Graph Structure in Multi-Task Learning

We present a novel methodology to jointly perform multi-task learning and infer intrinsic relationship among tasks by an interpretable and sparse graph. Unlike existing multi-task learning methodologies, the graph structure is not assumed…

Machine Learning · Computer Science 2020-09-15 Shujian Yu , Francesco Alesiani , Ammar Shaker , Wenzhe Yin

Optimizing Tensor Network Partitioning using Simulated Annealing

Tensor networks have proven to be a valuable tool, for instance, in the classical simulation of (strongly correlated) quantum systems. As the size of the systems increases, contracting larger tensor networks becomes computationally…

Quantum Physics · Physics 2025-07-29 Manuel Geiger , Qunsheng Huang , Christian B. Mendl

InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

Jointly learning multiple tasks with a unified model can improve accuracy and data efficiency, but it faces the challenge of task interference, where optimizing one task objective may inadvertently compromise the performance of another. A…

Machine Learning · Computer Science 2024-02-27 Babak Ehteshami Bejnordi , Gaurav Kumar , Amelie Royer , Christos Louizos , Tijmen Blankevoort , Mohsen Ghafoorian

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Large Language Models (LLMs) with the Mixture-of-Experts (MoE) architectures have shown promising performance on various tasks. However, due to the huge model sizes, running them in resource-constrained environments where the GPU memory is…

Machine Learning · Computer Science 2025-05-02 Keisuke Kamahori , Tian Tang , Yile Gu , Kan Zhu , Baris Kasikci

Efficient Task Grouping Through Samplewise Optimisation Landscape Analysis

Shared training approaches, such as multi-task learning (MTL) and gradient-based meta-learning, are widely used in various machine learning applications, but they often suffer from negative transfer, leading to performance degradation in…

Machine Learning · Computer Science 2024-12-10 Anshul Thakur , Yichen Huang , Soheila Molaei , Yujiang Wang , David A. Clifton

Efficient Multitask Feature and Relationship Learning

We consider a multitask learning problem, in which several predictors are learned jointly. Prior research has shown that learning the relations between tasks, and between the input features, together with the predictor, can lead to better…

Machine Learning · Computer Science 2019-07-11 Han Zhao , Otilia Stretcu , Alex Smola , Geoff Gordon

Efficient Controllable Multi-Task Architectures

We aim to train a multi-task model such that users can adjust the desired compute budget and relative importance of task performances after deployment, without retraining. This enables optimizing performance for dynamically varying user…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Abhishek Aich , Samuel Schulter , Amit K. Roy-Chowdhury , Manmohan Chandraker , Yumin Suh

Optimal Task Assignment to Heterogeneous Federated Learning Devices

Federated Learning provides new opportunities for training machine learning models while respecting data privacy. This technique is based on heterogeneous devices that work together to iteratively train a model while never sharing their own…

Artificial Intelligence · Computer Science 2020-10-02 Laércio Lima Pilla

ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning

Asynchronous methods are fundamental for parallelizing computations in distributed machine learning. They aim to accelerate training by fully utilizing all available resources. However, their greedy approach can lead to inefficiencies using…

Machine Learning · Computer Science 2025-05-23 Artavazd Maranjyan , El Mehdi Saad , Peter Richtárik , Francesco Orabona