Related papers: Multi-Objective Optimization for Sparse Deep Multi…

Efficient and Sparse Neural Networks by Pruning Weights in a Multiobjective Learning Approach

Overparameterization and overfitting are common concerns when designing and training deep neural networks, that are often counteracted by pruning and regularization strategies. However, these strategies remain secondary to most learning…

Machine Learning · Computer Science 2020-09-01 Malena Reiners , Kathrin Klamroth , Michael Stiglmayr

A Survey on Multi-Objective based Parameter Optimization for Deep Learning

Deep learning models form one of the most powerful machine learning models for the extraction of important features. Most of the designs of deep neural models, i.e., the initialization of parameters, are still manually tuned. Hence,…

Machine Learning · Computer Science 2023-05-18 Mrittika Chakraborty , Wreetbhas Pal , Sanghamitra Bandyopadhyay , Ujjwal Maulik

Multi-Task Learning as Multi-Objective Optimization

In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise…

Machine Learning · Computer Science 2019-01-14 Ozan Sener , Vladlen Koltun

Heterogeneous Resource Allocation with Multi-task Learning for Wireless Networks

The optimal solution to an optimization problem depends on the problem's objective function, constraints, and size. While deep neural networks (DNNs) have proven effective in solving optimization problems, changes in the problem's size,…

Machine Learning · Computer Science 2025-02-17 Nikos A. Mitsiou , Pavlos S. Bouzinis , Panagiotis G. Sarigiannidis , George K. Karagiannidis

Nonconvex sparse regularization for deep neural networks and its optimality

Recent theoretical studies proved that deep neural network (DNN) estimators obtained by minimizing empirical risk with a certain sparsity constraint can attain optimal convergence rates for regression and classification problems. However,…

Statistics Theory · Mathematics 2021-08-10 Ilsang Ohn , Yongdai Kim

Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning

This paper presents meta-sparsity, a framework for learning model sparsity, basically learning the parameter that controls the degree of sparsity, that allows deep neural networks (DNNs) to inherently generate optimal sparse shared…

Machine Learning · Computer Science 2025-01-22 Richa Upadhyay , Ronald Phlypo , Rajkumar Saini , Marcus Liwicki

A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints

This article introduces a generalized framework for Decentralized Learning formulated as a Multi-Objective Optimization problem, in which both distributed agents and a central coordinator contribute independent, potentially conflicting…

Optimization and Control · Mathematics 2025-07-21 Roberto Morales , Umberto Biccari

A multiobjective continuation method to compute the regularization path of deep neural networks

Sparsity is a highly desired feature in deep neural networks (DNNs) since it ensures numerical efficiency, improves the interpretability of models (due to the smaller number of relevant features), and robustness. For linear models, it is…

Machine Learning · Computer Science 2024-04-01 Augustina C. Amakor , Konstantin Sonntag , Sebastian Peitz

Multi-Task Learning with Multi-Task Optimization

Multi-task learning solves multiple correlated tasks. However, conflicts may exist between them. In such circumstances, a single solution can rarely optimize all the tasks, leading to performance trade-offs. To arrive at a set of optimized…

Artificial Intelligence · Computer Science 2024-03-26 Lu Bai , Abhishek Gupta , Yew-Soon Ong

Leveraging Multiplexed Metasurfaces for Multi-Task Learning with All-Optical Diffractive Processors

Diffractive Neural Networks (DNNs) leverage the power of light to enhance computational performance in machine learning, offering a pathway to high-speed, low-energy, and large-scale neural information processing. However, most existing DNN…

Optics · Physics 2024-11-21 Sahar Behroozinia , Qing Gu

DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would…

Computer Vision and Pattern Recognition · Computer Science 2016-06-15 Maya Kabkab , Azadeh Alavi , Rama Chellappa

A Unified Framework for Training Neural Networks

The lack of mathematical tractability of Deep Neural Networks (DNNs) has hindered progress towards having a unified convergence analysis of training algorithms, in the general setting. We propose a unified optimization framework for…

Machine Learning · Computer Science 2018-05-24 Hadi Ghauch , Hossein Shokri-Ghadikolaei , Carlo Fischione , Mikael Skoglund

Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off

Over-parameterization of deep neural networks (DNNs) has shown high prediction accuracy for many applications. Although effective, the large number of parameters hinders its popularity on resource-limited devices and has an outsize…

Machine Learning · Computer Science 2023-04-25 Shaoyi Huang , Bowen Lei , Dongkuan Xu , Hongwu Peng , Yue Sun , Mimi Xie , Caiwen Ding

Partitioning sparse deep neural networks for scalable training and inference

The state-of-the-art deep neural networks (DNNs) have significant computational and data management requirements. The size of both training data and models continue to increase. Sparsification and pruning methods are shown to be effective…

Machine Learning · Computer Science 2021-04-27 Gunduz Vehbi Demirci , Hakan Ferhatosmanoglu

Aligning Distributionally Robust Optimization with Practical Deep Learning Needs

While traditional Deep Learning (DL) optimization methods treat all training samples equally, Distributionally Robust Optimization (DRO) adaptively assigns importance weights to different samples. However, a significant gap exists between…

Machine Learning · Computer Science 2025-09-26 Dmitrii Feoktistov , Igor Ignashin , Andrey Veprikov , Nikita Borovko , Alexander Bogdanov , Savelii Chezhegov , Aleksandr Beznosikov

WANCO: Weak Adversarial Networks for Constrained Optimization problems

This paper focuses on integrating the networks and adversarial training into constrained optimization problems to develop a framework algorithm for constrained optimization problems. For such problems, we first transform them into minimax…

Optimization and Control · Mathematics 2024-07-08 Gang Bao , Dong Wang , Boyi Zou

Re-Weighted Learning for Sparsifying Deep Neural Networks

This paper addresses the topic of sparsifying deep neural networks (DNN's). While DNN's are powerful models that achieve state-of-the-art performance on a large number of tasks, the large number of model parameters poses serious storage and…

Machine Learning · Computer Science 2018-02-07 Igor Fedorov , Bhaskar D. Rao

Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

With the rapid development of Deep Learning, more and more applications on the cloud and edge tend to utilize large DNN (Deep Neural Network) models for improved task execution efficiency as well as decision-making quality. Due to memory…

Machine Learning · Computer Science 2024-07-02 Jingran Shen , Nikos Tziritas , Georgios Theodoropoulos

Smooth Tchebycheff Scalarization for Multi-Objective Optimization

Multi-objective optimization problems can be found in many real-world applications, where the objectives often conflict each other and cannot be optimized by a single solution. In the past few decades, numerous methods have been proposed to…

Machine Learning · Computer Science 2024-07-24 Xi Lin , Xiaoyuan Zhang , Zhiyuan Yang , Fei Liu , Zhenkun Wang , Qingfu Zhang

Dynamic Optimization of Neural Network Structures Using Probabilistic Modeling

Deep neural networks (DNNs) are powerful machine learning models and have succeeded in various artificial intelligence tasks. Although various architectures and modules for the DNNs have been proposed, selecting and designing the…

Neural and Evolutionary Computing · Computer Science 2018-01-24 Shinichi Shirakawa , Yasushi Iwata , Youhei Akimoto