Related papers: A multi-stage deep learning based algorithm for mu…

Deep Multiscale Model Learning

The objective of this paper is to design novel multi-layer neural network architectures for multiscale simulations of flows taking into account the observed data and physical modeling concepts. Our approaches use deep learning concepts…

Numerical Analysis · Mathematics 2018-06-14 Yating Wang , Siu Wun Cheung , Eric T. Chung , Yalchin Efendiev , Min Wang

Multilevel Initialization for Layer-Parallel Deep Neural Network Training

This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal…

Machine Learning · Computer Science 2019-12-20 Eric C. Cyr , Stefanie Günther , Jacob B. Schroder

Sequential training algorithm for neural networks

A sequential training method for large-scale feedforward neural networks is presented. Each layer of the neural network is decoupled and trained separately. After the training is completed for each layer, they are combined together. The…

Machine Learning · Computer Science 2019-05-21 Jongrae Kim

Scalable Nested Optimization for Deep Learning

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or…

Machine Learning · Computer Science 2024-07-02 Jonathan Lorraine

Towards Differentiable Multilevel Optimization: A Gradient-Based Approach

Multilevel optimization has gained renewed interest in machine learning due to its promise in applications such as hyperparameter tuning and continual learning. However, existing methods struggle with the inherent difficulty of efficiently…

Machine Learning · Computer Science 2024-10-16 Yuntian Gu , Xuzheng Chen

Multi-stage Neural Networks: Function Approximator of Machine Precision

Deep learning techniques are increasingly applied to scientific problems, where the precision of networks is crucial. Despite being deemed as universal function approximators, neural networks, in practice, struggle to reduce the prediction…

Machine Learning · Computer Science 2023-07-19 Yongji Wang , Ching-Yao Lai

Multi-Grade Deep Learning

The current deep learning model is of a single-grade, that is, it learns a deep neural network by solving a single nonconvex optimization problem. When the layer number of the neural network is large, it is computationally challenging to…

Machine Learning · Computer Science 2023-02-02 Yuesheng Xu

Improving the Reconstruction of Disentangled Representation Learners via Multi-Stage Modeling

Current autoencoder-based disentangled representation learning methods achieve disentanglement by penalizing the (aggregate) posterior to encourage statistical independence of the latent factors. This approach introduces a trade-off between…

Machine Learning · Statistics 2024-11-05 Akash Srivastava , Yamini Bansal , Yukun Ding , Cole Lincoln Hurwitz , Kai Xu , Bernhard Egger , Prasanna Sattigeri , Joshua B. Tenenbaum , Agus Sudjianto , Phuong Le , Arun Prakash R , Nengfeng Zhou , Joel Vaughan , Yaqun Wang , Anwesha Bhattacharyya , Kristjan Greenewald , David D. Cox , Dan Gutfreund

Optimizing ML Training with Metagradient Descent

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

Stage-based Hyper-parameter Optimization for Deep Learning

As deep learning techniques advance more than ever, hyper-parameter optimization is the new major workload in deep learning clusters. Although hyper-parameter optimization is crucial in training deep learning models for high model…

Machine Learning · Computer Science 2019-11-26 Ahnjae Shin , Dong-Jin Shin , Sungwoo Cho , Do Yoon Kim , Eunji Jeong , Gyeong-In Yu , Byung-Gon Chun

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Model-Based Deep Learning: On the Intersection of Deep Learning and Optimization

Decision making algorithms are used in a multitude of different applications. Conventional approaches for designing decision algorithms employ principled and simplified modelling, based on which one can determine decisions via tractable…

Signal Processing · Electrical Eng. & Systems 2022-06-23 Nir Shlezinger , Yonina C. Eldar , Stephen P. Boyd

Scaling Matters in Deep Structured-Prediction Models

Deep structured-prediction energy-based models combine the expressive power of learned representations and the ability of embedding knowledge about the task at hand into the system. A common way to learn parameters of such models consists…

Machine Learning · Computer Science 2019-03-01 Aleksandr Shevchenko , Anton Osokin

Real-time topology optimization via learnable mappings

In traditional topology optimization, the computing time required to iteratively update the material distribution within a design domain strongly depends on the complexity or size of the problem, limiting its application in real engineering…

Computational Engineering, Finance, and Science · Computer Science 2024-05-14 Gabriel Garayalde , Matteo Torzoni , Matteo Bruggi , Alberto Corigliano

Dynamic Sampling for Deep Metric Learning

Deep metric learning maps visually similar images onto nearby locations and visually dissimilar images apart from each other in an embedding manifold. The learning process is mainly based on the supplied image negative and positive training…

Computer Vision and Pattern Recognition · Computer Science 2020-09-14 Chang-Hui Liang , Wan-Lei Zhao , Run-Qing Chen

Gradient Descent based Optimization Algorithms for Deep Learning Models Training

In this paper, we aim at providing an introduction to the gradient descent based optimization algorithms for learning deep neural network models. Deep learning models involving multiple nonlinear projection layers are very challenging to…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang

Fast Multilevel Support Vector Machines

Solving different types of optimization models (including parameters fitting) for support vector machines on large-scale training data is often an expensive computational task. This paper proposes a multilevel algorithmic framework that…

Machine Learning · Statistics 2014-10-14 Talayeh Razzaghi , Ilya Safro

Distributed learning of deep neural network over multiple agents

In domains such as health care and finance, shortage of labeled data and computational resources is a critical issue while developing machine learning algorithms. To address the issue of labeled data scarcity in training and deployment of…

Machine Learning · Computer Science 2018-10-16 Otkrist Gupta , Ramesh Raskar

Open Problems in Applied Deep Learning

This work formulates the machine learning mechanism as a bi-level optimization problem. The inner level optimization loop entails minimizing a properly chosen loss function evaluated on the training data. This is nothing but the…

Machine Learning · Computer Science 2023-01-27 Maziar Raissi

A reinforced learning approach to optimal design under model uncertainty

Optimal designs are usually model-dependent and likely to be sub-optimal if the postulated model is not correctly specified. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found…

Statistics Theory · Mathematics 2023-03-29 Mingyao Ai , Holger Dette , Zhengfu Liu , Jun Yu