English
Related papers

Related papers: A multi-stage deep learning based algorithm for mu…

200 papers

The objective of this paper is to design novel multi-layer neural network architectures for multiscale simulations of flows taking into account the observed data and physical modeling concepts. Our approaches use deep learning concepts…

Numerical Analysis · Mathematics 2018-06-14 Yating Wang , Siu Wun Cheung , Eric T. Chung , Yalchin Efendiev , Min Wang

This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal…

Machine Learning · Computer Science 2019-12-20 Eric C. Cyr , Stefanie Günther , Jacob B. Schroder

A sequential training method for large-scale feedforward neural networks is presented. Each layer of the neural network is decoupled and trained separately. After the training is completed for each layer, they are combined together. The…

Machine Learning · Computer Science 2019-05-21 Jongrae Kim

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or…

Machine Learning · Computer Science 2024-07-02 Jonathan Lorraine

Multilevel optimization has gained renewed interest in machine learning due to its promise in applications such as hyperparameter tuning and continual learning. However, existing methods struggle with the inherent difficulty of efficiently…

Machine Learning · Computer Science 2024-10-16 Yuntian Gu , Xuzheng Chen

Deep learning techniques are increasingly applied to scientific problems, where the precision of networks is crucial. Despite being deemed as universal function approximators, neural networks, in practice, struggle to reduce the prediction…

Machine Learning · Computer Science 2023-07-19 Yongji Wang , Ching-Yao Lai

The current deep learning model is of a single-grade, that is, it learns a deep neural network by solving a single nonconvex optimization problem. When the layer number of the neural network is large, it is computationally challenging to…

Machine Learning · Computer Science 2023-02-02 Yuesheng Xu

Current autoencoder-based disentangled representation learning methods achieve disentanglement by penalizing the (aggregate) posterior to encourage statistical independence of the latent factors. This approach introduces a trade-off between…

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

As deep learning techniques advance more than ever, hyper-parameter optimization is the new major workload in deep learning clusters. Although hyper-parameter optimization is crucial in training deep learning models for high model…

Machine Learning · Computer Science 2019-11-26 Ahnjae Shin , Dong-Jin Shin , Sungwoo Cho , Do Yoon Kim , Eunji Jeong , Gyeong-In Yu , Byung-Gon Chun

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Decision making algorithms are used in a multitude of different applications. Conventional approaches for designing decision algorithms employ principled and simplified modelling, based on which one can determine decisions via tractable…

Signal Processing · Electrical Eng. & Systems 2022-06-23 Nir Shlezinger , Yonina C. Eldar , Stephen P. Boyd

Deep structured-prediction energy-based models combine the expressive power of learned representations and the ability of embedding knowledge about the task at hand into the system. A common way to learn parameters of such models consists…

Machine Learning · Computer Science 2019-03-01 Aleksandr Shevchenko , Anton Osokin

In traditional topology optimization, the computing time required to iteratively update the material distribution within a design domain strongly depends on the complexity or size of the problem, limiting its application in real engineering…

Computational Engineering, Finance, and Science · Computer Science 2024-05-14 Gabriel Garayalde , Matteo Torzoni , Matteo Bruggi , Alberto Corigliano

Deep metric learning maps visually similar images onto nearby locations and visually dissimilar images apart from each other in an embedding manifold. The learning process is mainly based on the supplied image negative and positive training…

Computer Vision and Pattern Recognition · Computer Science 2020-09-14 Chang-Hui Liang , Wan-Lei Zhao , Run-Qing Chen

In this paper, we aim at providing an introduction to the gradient descent based optimization algorithms for learning deep neural network models. Deep learning models involving multiple nonlinear projection layers are very challenging to…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang

Solving different types of optimization models (including parameters fitting) for support vector machines on large-scale training data is often an expensive computational task. This paper proposes a multilevel algorithmic framework that…

Machine Learning · Statistics 2014-10-14 Talayeh Razzaghi , Ilya Safro

In domains such as health care and finance, shortage of labeled data and computational resources is a critical issue while developing machine learning algorithms. To address the issue of labeled data scarcity in training and deployment of…

Machine Learning · Computer Science 2018-10-16 Otkrist Gupta , Ramesh Raskar

This work formulates the machine learning mechanism as a bi-level optimization problem. The inner level optimization loop entails minimizing a properly chosen loss function evaluated on the training data. This is nothing but the…

Machine Learning · Computer Science 2023-01-27 Maziar Raissi

Optimal designs are usually model-dependent and likely to be sub-optimal if the postulated model is not correctly specified. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found…

Statistics Theory · Mathematics 2023-03-29 Mingyao Ai , Holger Dette , Zhengfu Liu , Jun Yu
‹ Prev 1 2 3 10 Next ›