Related papers: Modular Deep Learning

Modularity in Deep Learning: A Survey

Modularity is a general principle present in many fields. It offers attractive advantages, including, among others, ease of conceptualization, interpretability, scalability, module combinability, and module reusability. The deep learning…

Machine Learning · Computer Science 2023-10-03 Haozhe Sun , Isabelle Guyon

Modular Networks: Learning to Decompose Neural Computation

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number…

Machine Learning · Computer Science 2018-11-14 Louis Kirsch , Julius Kunze , David Barber

Transferability in Deep Learning: A Survey

The success of deep learning algorithms generally depends on large-scale data, while humans appear to have inherent ability of knowledge transfer, by recognizing and applying relevant knowledge from previous learning experiences when…

Machine Learning · Computer Science 2022-01-19 Junguang Jiang , Yang Shu , Jianmin Wang , Mingsheng Long

Is a Modular Architecture Enough?

Inspired from human cognition, machine learning systems are gradually revealing advantages of sparser and more modular architectures. Recent work demonstrates that not only do some modular architectures generalize well, but they also lead…

Machine Learning · Computer Science 2022-06-07 Sarthak Mittal , Yoshua Bengio , Guillaume Lajoie

Design Perspectives of Multitask Deep Learning Models and Applications

In recent years, multi-task learning has turned out to be of great success in various applications. Though single model training has promised great results throughout these years, it ignores valuable information that might help us estimate…

Machine Learning · Computer Science 2022-09-28 Yeshwant Singh , Anupam Biswas , Angshuman Bora , Debashish Malakar , Subham Chakraborty , Suman Bera

Breaking Neural Network Scaling Laws with Modularity

Modular neural networks outperform nonmodular neural networks on tasks ranging from visual question answering to robotics. These performance improvements are thought to be due to modular networks' superior ability to model the compositional…

Machine Learning · Computer Science 2025-03-12 Akhilan Boopathy , Sunshine Jiang , William Yue , Jaedong Hwang , Abhiram Iyer , Ila Fiete

Modular Universal Reparameterization: Deep Multi-task Learning Across Diverse Domains

As deep learning applications continue to become more diverse, an interesting question arises: Can general problem solving arise from jointly learning several such diverse tasks? To approach this question, deep multi-task learning is…

Machine Learning · Computer Science 2019-10-29 Elliot Meyerson , Risto Miikkulainen

Deep Modulation Embedding

Deep neural network has recently shown very promising applications in different research directions and attracted the industry attention as well. Although the idea was introduced in the past but just recently the main limitation of using…

Signal Processing · Electrical Eng. & Systems 2019-04-16 Amin Abbasloo , Alan Salari

Modularizing Deep Learning via Pairwise Learning With Kernels

By redefining the conventional notions of layers, we present an alternative view on finitely wide, fully trainable deep neural networks as stacked linear models in feature spaces, leading to a kernel machine interpretation. Based on this…

Machine Learning · Statistics 2020-12-02 Shiyu Duan , Shujian Yu , Jose Principe

A Survey on State-of-the-art Deep Learning Applications and Challenges

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

A Selective Overview of Deep Learning

Deep learning has arguably achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks…

Machine Learning · Statistics 2019-04-16 Jianqing Fan , Cong Ma , Yiqiao Zhong

Systems for Parallel and Distributed Large-Model Deep Learning Training

Deep learning (DL) has transformed applications in a variety of domains, including computer vision, natural language processing, and tabular data analysis. The search for improved DL model accuracy has led practitioners to explore…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-10 Kabir Nagrecha

A Combinatorial Perspective on Transfer Learning

Human intelligence is characterized not only by the capacity to learn complex skills, but the ability to rapidly adapt and acquire new skills within an ever-changing environment. In this work we study how the learning of modular solutions…

Machine Learning · Computer Science 2020-10-26 Jianan Wang , Eren Sezener , David Budden , Marcus Hutter , Joel Veness

Transfer Learning with Deep Tabular Models

Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models…

Machine Learning · Computer Science 2023-08-08 Roman Levin , Valeriia Cherepanova , Avi Schwarzschild , Arpit Bansal , C. Bayan Bruss , Tom Goldstein , Andrew Gordon Wilson , Micah Goldblum

A Survey From Distributed Machine Learning to Distributed Deep Learning

Artificial intelligence has made remarkable progress in handling complex tasks, thanks to advances in hardware acceleration and machine learning algorithms. However, to acquire more accurate outcomes and solve more complex issues,…

Machine Learning · Computer Science 2023-09-12 Mohammad Dehghani , Zahra Yazdanparast

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Clustering units in neural networks: upstream vs downstream information

It has been hypothesized that some form of "modular" structure in artificial neural networks should be useful for learning, compositionality, and generalization. However, defining and quantifying modularity remains an open problem. We cast…

Machine Learning · Computer Science 2022-06-23 Richard D. Lange , David S. Rolnick , Konrad P. Kording

Model-Based Deep Learning

Signal processing traditionally relies on classical statistical modeling techniques. Such model-based methods utilize mathematical formulations that represent the underlying physics, prior information and additional domain knowledge. Simple…

Signal Processing · Electrical Eng. & Systems 2023-06-08 Nir Shlezinger , Yonina C. Eldar

A Perspective on Deep Learning for Molecular Modeling and Simulations

Deep learning is transforming many areas in science, and it has great potential in modeling molecular systems. However, unlike the mature deployment of deep learning in computer vision and natural language processing, its development in…

Computational Physics · Physics 2021-03-19 Jun Zhang , Yao-Kun Lei , Zhen Zhang , Junhan Chang , Maodong Li , Xu Han , Lijiang Yang , Yi Isaac Yang , Yi Qin Gao

On The Specialization of Neural Modules

A number of machine learning models have been proposed with the goal of achieving systematic generalization: the ability to reason about new situations by combining aspects of previous experiences. These models leverage compositional…

Machine Learning · Computer Science 2024-09-24 Devon Jarvis , Richard Klein , Benjamin Rosman , Andrew M. Saxe