Related papers: Accelerating Flash Calculation through Deep Learni…

Acceleration of the NVT-flash calculation for multicomponent mixtures using deep neural network models

Phase equilibrium calculation, also known as flash calculation, has been extensively applied in petroleum engineering, not only as a standalone application for separation process but also an integral component of compositional reservoir…

Computational Physics · Physics 2019-01-29 Yiteng Li , Tao Zhang , Shuyu Sun

Fast Calculation of Probabilistic Power Flow: A Model-based Deep Learning Approach

Probabilistic power flow (PPF) plays a critical role in power system analysis. However, the high computational burden makes it challenging for the practical implementation of PPF. This paper proposes a model-based deep learning approach to…

Signal Processing · Electrical Eng. & Systems 2019-09-17 Yan Yang , Zhifang Yang , Juan Yu , Baosen Zhang

Look-ups are not (yet) all you need for deep learning inference

Fast approximations to matrix multiplication have the potential to dramatically reduce the cost of neural network inference. Recent work on approximate matrix multiplication proposed to replace costly multiplications with table-lookups by…

Machine Learning · Computer Science 2022-07-14 Calvin McCarter , Nicholas Dronen

FastFlow: Accelerating The Generative Flow Matching Models with Bandit Inference

Flow-matching models deliver state-of-the-art fidelity in image and video generation, but the inherent sequential denoising process renders them slower. Existing acceleration methods like distillation, trajectory truncation, and consistency…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Divya Jyoti Bajpai , Dhruv Bhardwaj , Soumya Roy , Tejas Duseja , Harsh Agarwal , Aashay Sandansing , Manjesh Kumar Hanawal

Distributed Deep Learning using Stochastic Gradient Staleness

Despite the notable success of deep neural networks (DNNs) in solving complex tasks, the training process still remains considerable challenges. A primary obstacle is the substantial time required for training, particularly as high…

Machine Learning · Computer Science 2025-09-09 Viet Hoang Pham , Hyo-Sung Ahn

A fast neural hybrid Newton solver adapted to implicit methods for nonlinear dynamics

The use of implicit time-stepping schemes for the numerical approximation of solutions to stiff nonlinear time-evolution equations brings well-known advantages including, typically, better stability behaviour and corresponding support of…

Numerical Analysis · Mathematics 2025-02-14 Tianyu Jin , Georg Maierhofer , Katharina Schratz , Yang Xiang

Speeding Up Mixed-Integer Programming Solvers with Sparse Learning for Branching

Machine learning is increasingly used to improve decisions within branch-and-bound algorithms for mixed-integer programming. Many existing approaches rely on deep learning, which often requires very large training datasets and substantial…

Machine Learning · Computer Science 2026-04-02 Selin Bayramoğlu , George L Nemhauser , Nikolaos V Sahinidis

Principled Acceleration of Iterative Numerical Methods Using Machine Learning

Iterative methods are ubiquitous in large-scale scientific computing applications, and a number of approaches based on meta-learning have been recently proposed to accelerate them. However, a systematic study of these approaches and how…

Numerical Analysis · Mathematics 2023-01-31 Sohei Arisaka , Qianxiao Li

Accelerated Forward-Backward Optimization using Deep Learning

We propose several deep-learning accelerated optimization solvers with convergence guarantees. We use ideas from the analysis of accelerated forward-backward schemes like FISTA, but instead of the classical approach of proving convergence…

Optimization and Control · Mathematics 2021-05-12 Sebastian Banert , Jevgenija Rudzusika , Ozan Öktem , Jonas Adler

Accelerating Deep Learning with Dynamic Data Pruning

Deep learning's success has been attributed to the training of large, overparameterized models on massive amounts of data. As this trend continues, model training has become prohibitively costly, requiring access to powerful computing…

Machine Learning · Computer Science 2021-11-25 Ravi S Raju , Kyle Daruwalla , Mikko Lipasti

Learning Instance-wise Sparsity for Accelerating Deep Models

Exploring deep convolutional neural networks of high efficiency and low memory usage is very essential for a wide variety of machine learning tasks. Most of existing approaches used to accelerate deep models by manipulating parameters or…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Chuanjian Liu , Yunhe Wang , Kai Han , Chunjing Xu , Chang Xu

A Statistical Theory of Deep Learning via Proximal Splitting

In this paper we develop a statistical theory and an implementation of deep learning models. We show that an elegant variable splitting scheme for the alternating direction method of multipliers optimises a deep learning objective. We allow…

Machine Learning · Statistics 2015-09-22 Nicholas G. Polson , Brandon T. Willard , Massoud Heidari

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields.…

Machine Learning · Computer Science 2022-07-04 Daniel Nichols , Siddharth Singh , Shu-Huai Lin , Abhinav Bhatele

Intelligent gradient amplification for deep neural networks

Deep learning models offer superior performance compared to other machine learning techniques for a variety of tasks and domains, but pose their own challenges. In particular, deep learning models require larger training times as the depth…

Machine Learning · Computer Science 2023-05-31 Sunitha Basodi , Krishna Pusuluri , Xueli Xiao , Yi Pan

Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers

Continual learning is crucial for applying machine learning in challenging, dynamic, and often resource-constrained environments. However, catastrophic forgetting - overwriting previously learned knowledge when new information is acquired -…

Machine Learning · Computer Science 2025-05-30 Filip Szatkowski , Yaoyue Zheng , Fei Yang , Bartłomiej Twardowski , Tomasz Trzciński , Joost van de Weijer

Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction

Despite impressive performance, deep neural networks require significant memory and computation costs, prohibiting their application in resource-constrained scenarios. Sparse training is one of the most common techniques to reduce these…

Machine Learning · Computer Science 2023-12-06 Bowen Lei , Dongkuan Xu , Ruqi Zhang , Shuren He , Bani K. Mallick

Growing Deep Neural Network Considering with Similarity between Neurons

Deep learning has excelled in image recognition tasks through neural networks inspired by the human brain. However, the necessity for large models to improve prediction accuracy introduces significant computational demands and extended…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Taigo Sakai , Kazuhiro Hotta

Efficient Neural Network Training via Forward and Backward Propagation Sparsification

Sparse training is a natural idea to accelerate the training speed of deep neural networks and save the memory usage, especially since large modern neural networks are significantly over-parameterized. However, most of the existing methods…

Machine Learning · Computer Science 2021-11-11 Xiao Zhou , Weizhong Zhang , Zonghao Chen , Shizhe Diao , Tong Zhang

Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference

The ever-increasing sizes of large language models necessitate distributed solutions for fast inference that exploit multi-dimensional parallelism, where computational loads are split across various accelerators such as GPU clusters.…

Artificial Intelligence · Computer Science 2024-12-12 Qingyuan Li , Bo Zhang , Liang Ye , Yifan Zhang , Wei Wu , Yerui Sun , Lin Ma , Yuchen Xie

Phasor-Driven Acceleration for FFT-based CNNs

Recent research in deep learning (DL) has investigated the use of the Fast Fourier Transform (FFT) to accelerate the computations involved in Convolutional Neural Networks (CNNs) by replacing spatial convolution with element-wise…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Eduardo Reis , Thangarajah Akilan , Mohammed Khalid