Related papers: Compositional Models: Multi-Task Learning and Know…

Modular Networks: Learning to Decompose Neural Computation

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number…

Machine Learning · Computer Science 2018-11-14 Louis Kirsch , Julius Kunze , David Barber

Routing Networks and the Challenges of Modular and Compositional Computation

Compositionality is a key strategy for addressing combinatorial complexity and the curse of dimensionality. Recent work has shown that compositional solutions can be learned and offer substantial gains across a variety of domains, including…

Machine Learning · Computer Science 2019-04-30 Clemens Rosenbaum , Ignacio Cases , Matthew Riemer , Tim Klinger

Fast and Slow Learning of Recurrent Independent Mechanisms

Decomposing knowledge into interchangeable pieces promises a generalization advantage when there are changes in distribution. A learning agent interacting with its environment is likely to be faced with situations requiring novel…

Machine Learning · Computer Science 2021-05-20 Kanika Madan , Nan Rosemary Ke , Anirudh Goyal , Bernhard Schölkopf , Yoshua Bengio

Flexible Multi-task Networks by Learning Parameter Allocation

This paper proposes a novel learning method for multi-task applications. Multi-task neural networks can learn to transfer knowledge across different tasks by using parameter sharing. However, sharing parameters between unrelated tasks can…

Machine Learning · Computer Science 2020-07-21 Krzysztof Maziarz , Efi Kokiopoulou , Andrea Gesmundo , Luciano Sbaiz , Gabor Bartok , Jesse Berent

Compositional meta-learning through probabilistic task inference

To solve a new task from minimal experience, it is essential to effectively reuse knowledge from previous tasks, a problem known as meta-learning. Compositional solutions, where common elements of computation are flexibly recombined into…

Machine Learning · Computer Science 2025-10-03 Jacob J. W. Bakermans , Pablo Tano , Reidar Riveland , Charles Findling , Alexandre Pouget

Recurrent Neural Network for Text Classification with Multi-Task Learning

Neural network based methods have obtained great progress on a variety of natural language processing tasks. However, in most previous works, the models are learned based on single-task supervised objectives, which often suffer from…

Computation and Language · Computer Science 2016-05-18 Pengfei Liu , Xipeng Qiu , Xuanjing Huang

Explainable Neural Computation via Stack Neural Module Networks

In complex inferential tasks like question answering, machine learning models must confront two challenges: the need to implement a compositional reasoning process, and, in many applications, the need for this reasoning process to be…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Ronghang Hu , Jacob Andreas , Trevor Darrell , Kate Saenko

Modular meta-learning

Many prediction problems, such as those that arise in the context of robotics, have a simplifying underlying structure that, if known, could accelerate learning. In this paper, we present a strategy for learning a set of neural network…

Machine Learning · Computer Science 2019-05-06 Ferran Alet , Tomás Lozano-Pérez , Leslie P. Kaelbling

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

Multi-Task Learning (MTL) networks have emerged as a promising method for transferring learned knowledge across different tasks. However, MTL must deal with challenges such as: overfitting to low resource tasks, catastrophic forgetting, and…

Machine Learning · Computer Science 2022-04-22 Jonathan Pilault , Amine Elhattami , Christopher Pal

Modular Blended Attention Network for Video Question Answering

In multimodal machine learning tasks, it is due to the complexity of the assignments that the network structure, in most cases, is assembled in a sophisticated way. The holistic architecture can be separated into several logical parts…

Computer Vision and Pattern Recognition · Computer Science 2023-11-23 Mingjie Zhou

Modular Deep Learning

Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples. Nonetheless, it remains unclear how to develop…

Machine Learning · Computer Science 2024-01-30 Jonas Pfeiffer , Sebastian Ruder , Ivan Vulić , Edoardo Maria Ponti

Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks

Neural networks (NNs) whose subnetworks implement reusable functions are expected to offer numerous advantages, including compositionality through efficient recombination of functional building blocks, interpretability, preventing…

Neural and Evolutionary Computing · Computer Science 2021-03-09 Róbert Csordás , Sjoerd van Steenkiste , Jürgen Schmidhuber

MaskConnect: Connectivity Learning by Gradient Descent

Although deep networks have recently emerged as the model of choice for many computer vision problems, in order to yield good results they often require time-consuming architecture search. To combat the complexity of design choices, prior…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Karim Ahmed , Lorenzo Torresani

Independent Modular Networks

Monolithic neural networks that make use of a single set of weights to learn useful representations for downstream tasks explicitly dismiss the compositional nature of data generation processes. This characteristic exists in data where…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Hamed Damirchi , Forest Agostinelli , Pooyan Jamshidi

StackNet: Stacking Parameters for Continual learning

Training a neural network for a classification task typically assumes that the data to train are given from the beginning. However, in the real world, additional data accumulate gradually and the model requires additional training without…

Machine Learning · Computer Science 2020-04-22 Jangho Kim , Jeesoo Kim , Nojun Kwak

Towards Reusable Network Components by Learning Compatible Representations

This paper proposes to make a first step towards compatible and hence reusable network components. Rather than training networks for different tasks independently, we adapt the training process to produce network components that are…

Machine Learning · Computer Science 2020-12-17 Michael Gygli , Jasper Uijlings , Vittorio Ferrari

Network Transplanting (extended abstract)

This paper focuses on a new task, i.e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision. We design a functionally interpretable structure for the generic network. Like…

Machine Learning · Computer Science 2019-01-23 Quanshi Zhang , Yu Yang , Qian Yu , Ying Nian Wu

Transfer Learning with Reconstruction Loss

In most applications of utilizing neural networks for mathematical optimization, a dedicated model is trained for each specific optimization objective. However, in many scenarios, several distinct yet correlated objectives or tasks often…

Machine Learning · Computer Science 2024-04-15 Wei Cui , Wei Yu

Break It Down: Evidence for Structural Compositionality in Neural Networks

Though modern neural networks have achieved impressive performance in both vision and language tasks, we know little about the functions that they implement. One possibility is that neural networks implicitly break down complex tasks into…

Computation and Language · Computer Science 2023-11-08 Michael A. Lepori , Thomas Serre , Ellie Pavlick

Network Transplanting

This paper focuses on a new task, i.e., transplanting a category-and-task-specific neural network to a generic, modular network without strong supervision. We design an functionally interpretable structure for the generic network. Like…

Machine Learning · Computer Science 2018-12-19 Quanshi Zhang , Yu Yang , Qian Yu , Ying Nian Wu