Related papers: Dynamic Layer Tying for Parameter-Efficient Transf…

Replacement Learning: Training Vision Tasks with Fewer Learnable Parameters

Traditional end-to-end deep learning models often enhance feature representation and overall performance by increasing the depth and complexity of the network during training. However, this approach inevitably introduces issues of parameter…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yuming Zhang , Peizhe Wang , Shouxin Zhang , Dongzhi Guan , Jiabin Liu , Junhao Su

Fractional Transfer Learning for Deep Model-Based Reinforcement Learning

Reinforcement learning (RL) is well known for requiring large amounts of data in order for RL agents to learn to perform complex tasks. Recent progress in model-based RL allows agents to be much more data-efficient, as it enables them to…

Machine Learning · Computer Science 2021-08-17 Remo Sasso , Matthia Sabatelli , Marco A. Wiering

Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning

Reinforcement Learning (RL) enables an intelligent agent to optimise its performance in a task by continuously taking action from an observed state and receiving a feedback from the environment in form of rewards. RL typically uses tables…

Artificial Intelligence · Computer Science 2025-01-28 Alberto Castagna

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers

State-of-the-art rehearsal-free continual learning methods exploit the peculiarities of Vision Transformers to learn task-specific prompts, drastically reducing catastrophic forgetting. However, there is a tradeoff between the number of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Thomas De Min , Massimiliano Mancini , Karteek Alahari , Xavier Alameda-Pineda , Elisa Ricci

Learning Distinguishable Representations in Deep Q-Networks for Linear Transfer

Deep Reinforcement Learning (RL) has demonstrated success in solving complex sequential decision-making problems by integrating neural networks with the RL framework. However, training deep RL models poses several challenges, such as the…

Machine Learning · Computer Science 2025-09-30 Sooraj Sathish , Keshav Goyal , Raghuram Bharadwaj Diddigi

Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable…

Multiagent Systems · Computer Science 2021-06-15 Filippos Christianos , Georgios Papoudakis , Arrasy Rahman , Stefano V. Albrecht

Dynamic Sparse Training for Deep Reinforcement Learning

Deep reinforcement learning (DRL) agents are trained through trial-and-error interactions with the environment. This leads to a long training time for dense neural networks to achieve good performance. Hence, prohibitive computation and…

Machine Learning · Computer Science 2022-05-09 Ghada Sokar , Elena Mocanu , Decebal Constantin Mocanu , Mykola Pechenizkiy , Peter Stone

Functional Regularization for Reinforcement Learning via Learned Fourier Features

We propose a simple architecture for deep reinforcement learning by embedding inputs into a learned Fourier basis and show that it improves the sample efficiency of both state-based and image-based RL. We perform infinite-width analysis of…

Machine Learning · Computer Science 2021-12-07 Alexander C. Li , Deepak Pathak

Learning to reinforcement learn

In recent years deep reinforcement learning (RL) systems have attained superhuman performance in a number of challenging task domains. However, a major limitation of such applications is their demand for massive amounts of training data. A…

Machine Learning · Computer Science 2017-01-24 Jane X Wang , Zeb Kurth-Nelson , Dhruva Tirumala , Hubert Soyer , Joel Z Leibo , Remi Munos , Charles Blundell , Dharshan Kumaran , Matt Botvinick

Efficient Parameter Sampling for Neural Network Construction

The customizable nature of deep learning models have allowed them to be successful predictors in various disciplines. These models are often trained with respect to thousands or millions of instances for complicated problems, but the…

Machine Learning · Computer Science 2019-12-24 Drimik Roy Chowdhury , Muhammad Firmansyah Kasim

Compression-aware Training of Deep Networks

In recent years, great progress has been made in a variety of application domains thanks to the development of increasingly deeper neural networks. Unfortunately, the huge number of units of these networks makes them expensive both…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Jose M. Alvarez , Mathieu Salzmann

Hyperparameter Tuning for Deep Reinforcement Learning Applications

Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to…

Machine Learning · Computer Science 2022-01-28 Mariam Kiran , Melis Ozyildirim

Deep Learning in Target Space

Deep learning uses neural networks which are parameterised by their weights. The neural networks are usually trained by tuning the weights to directly minimise a given loss function. In this paper we propose to re-parameterise the weights…

Neural and Evolutionary Computing · Computer Science 2022-03-14 Michael Fairbank , Spyridon Samothrakis , Luca Citi

Using Deep Reinforcement Learning for the Continuous Control of Robotic Arms

Deep reinforcement learning enables algorithms to learn complex behavior, deal with continuous action spaces and find good strategies in environments with high dimensional state spaces. With deep reinforcement learning being an active area…

Machine Learning · Computer Science 2018-10-17 Winfried Lötzsch

Deep Q-Networks for Accelerating the Training of Deep Neural Networks

In this paper, we propose a principled deep reinforcement learning (RL) approach that is able to accelerate the convergence rate of general deep neural networks (DNNs). With our approach, a deep RL agent (synonym for optimizer in this work)…

Machine Learning · Computer Science 2017-07-14 Jie Fu

Dynamic Rank Adjustment for Accurate and Efficient Neural Network Training

Low-rank training methods reduce the number of trainable parameters by re-parameterizing the weights with matrix decompositions (e.g., singular value decomposition). However, enforcing a fixed low-rank structure caps the rank of the weight…

Machine Learning · Computer Science 2025-10-16 Hyuntak Shin , Aecheon Jung , Sungeun Hong , Sunwoo Lee

Learning Data Manipulation for Augmentation and Weighting

Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of…

Machine Learning · Computer Science 2019-10-29 Zhiting Hu , Bowen Tan , Ruslan Salakhutdinov , Tom Mitchell , Eric P. Xing

Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics

Overparameterized models have proven to be powerful tools for solving various machine learning tasks. However, overparameterization often leads to a substantial increase in computational and memory costs, which in turn requires extensive…

Machine Learning · Computer Science 2024-03-13 Soo Min Kwon , Zekai Zhang , Dogyoon Song , Laura Balzano , Qing Qu

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Layerwise Noise Maximisation to Train Low-Energy Deep Neural Networks

Deep neural networks (DNNs) depend on the storage of a large number of parameters, which consumes an important portion of the energy used during inference. This paper considers the case where the energy usage of memory elements can be…

Machine Learning · Computer Science 2019-12-24 Sébastien Henwood , François Leduc-Primeau , Yvon Savaria