Related papers: High-level python abstractions for optimal checkpo…

Combining Checkpointing and Data Compression to Accelerate Adjoint-Based Optimization Problems

Seismic inversion and imaging are adjoint-based optimization problems that process up to terabytes of data, regularly exceeding the memory capacity of available computers. Data compression is an effective strategy to reduce this memory…

Computational Engineering, Finance, and Science · Computer Science 2021-09-21 Navjot Kukreja , Jan Hueckelheim , Mathias Louboutin , Fabio Luporini , Paul Hovland , Gerard Gorman

Optimal Checkpointing for Adjoint Multistage Time-Stepping Schemes

We consider checkpointing strategies that minimize the number of recomputations needed when performing discrete adjoint computations using multistage time-stepping schemes, which requires computing several substeps within one complete time…

Mathematical Software · Computer Science 2022-04-29 Hong Zhang , Emil Constantinescu

Profiling checkpointing schedules in adjoint ST-AD

Checkpointing is a cornerstone of data-flow reversal in adjoint algorithmic differentiation. Checkpointing is a storage/recomputation trade-off that can be applied at different levels, one of which being the call tree. We are looking for…

Computation and Language · Computer Science 2024-09-13 Laurent Hascoët , Jean-Luc Bouchot , Shreyas Sunil Gaikwad , Sri Hari Krishna Narayanan , Jan Hückelheim

An autoencoder compression approach for accelerating large-scale inverse problems

PDE-constrained inverse problems are some of the most challenging and computationally demanding problems in computational science today. Fine meshes that are required to accurately compute the PDE solution introduce an enormous number of…

Numerical Analysis · Mathematics 2023-04-12 Jonathan Wittmer , Jacob Badger , Hari Sundar , Tan Bui-Thanh

Enabling user-driven Checkpointing strategies in Reverse-mode Automatic Differentiation

This paper presents a new functionality of the Automatic Differentiation (AD) tool Tapenade. Tapenade generates adjoint codes which are widely used for optimization or inverse problems. Unfortunately, for large applications the adjoint code…

Data Structures and Algorithms · Computer Science 2007-05-23 Laurent Hascoet , Mauricio Araya-Polo

Step-based checkpointing with high-level algorithmic differentiation

Automated code generation allows for a separation between the development of a model, expressed via a domain specific language, and lower level implementation details. Algorithmic differentiation can be applied symbolically at the level of…

Programming Languages · Computer Science 2024-09-27 James R. Maddison

Perturbation Resilience and Superiorization of Iterative Algorithms

Iterative algorithms aimed at solving some problems are discussed. For certain problems, such as finding a common point in the intersection of a finite number of convex sets, there often exist iterative algorithms that impose very little…

Optimization and Control · Mathematics 2010-09-28 Y. Censor , R. Davidi , G. T. Herman

High Performance Data Persistence in Non-Volatile Memory for Resilient High Performance Computing

Resilience is a major design goal for HPC. Checkpoint is the most common method to enable resilient HPC. Checkpoint periodically saves critical data objects to non-volatile storage to enable data persistence. However, using checkpoint, we…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-05-03 Yingchao Huang , Kai Wu , Dong Li

Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory

This paper introduces a new activation checkpointing method which allows to significantly decrease memory usage when training Deep Neural Networks with the back-propagation algorithm. Similarly to checkpoint-ing techniques coming from the…

Machine Learning · Computer Science 2019-12-02 Julien Herrmann , Olivier Beaumont , Lionel Eyraud-Dubois , Julien Hermann , Alexis Joly , Alena Shilova

Analysis of Recent Checkpointing Techniques for Mobile Computing Systems

Recovery from transient failures is one of the prime issues in the context of distributed systems. These systems demand to have transparent yet efficient techniques to achieve the same. Checkpoint is defined as a designated place in a…

Networking and Internet Architecture · Computer Science 2011-09-01 Ruchi Tuli , Parveen Kumar

From Reversible Computation to Checkpoint-Based Rollback Recovery for Message-Passing Concurrent Programs

The reliability of concurrent and distributed systems often depends on some well-known techniques for fault tolerance. One such technique is based on checkpointing and rollback recovery. Checkpointing involves processes to take snapshots of…

Programming Languages · Computer Science 2023-11-15 Germán Vidal

Convex optimization over intersection of simple sets: improved convergence rate guarantees via an exact penalty approach

We consider the problem of minimizing a convex function over the intersection of finitely many simple sets which are easy to project onto. This is an important problem arising in various domains such as machine learning. The main difficulty…

Optimization and Control · Mathematics 2017-10-19 Achintya Kundu , Francis Bach , Chiranjib Bhattacharyya

Improved penalty algorithm for Mixed Integer PDE Constrained Optimization Problems

Optimal control problems including partial differential equation (PDE) as well as integer constraints merge the combinatorial difficulties of integer programming and the challenges related to large-scale systems resulting from discretized…

Numerical Analysis · Mathematics 2021-09-09 Dominik Garmatter , Margherita Porcelli , Francesco Rinaldi , Martin Stoll

Algorithms and software for projections onto intersections of convex and non-convex sets with applications to inverse problems

We propose algorithms and software for computing projections onto the intersection of multiple convex and non-convex constraint sets. The software package, called SetIntersectionProjection, is intended for the regularization of inverse…

Mathematical Software · Computer Science 2019-03-08 Bas Peters , Felix J. Herrmann

Optimal Checkpoint Interval with Availability as an Objective Function

We present a simplified derivation of the optimal checkpoint interval in Young_1974 [1]. The optimal checkpoint interval derivation in [1] is based on minimizing the total lost time as an objective-function. Lost time is a function of…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-25 Nirmal Raj Saxena , Saurabh Hukerikar , Mikolaj Blaz , Swapna Raj

A memory-efficient neural ODE framework based on high-level adjoint differentiation

Neural ordinary differential equations (neural ODEs) have emerged as a novel network architecture that bridges dynamical systems and deep learning. However, the gradient obtained with the continuous adjoint method in the vanilla neural ODE…

Machine Learning · Computer Science 2023-06-12 Hong Zhang , Wenjun Zhao

High-dimensional changepoint estimation via sparse projection

Changepoints are a very common feature of Big Data that arrive in the form of a data stream. In this paper, we study high-dimensional time series in which, at certain time points, the mean structure changes in a sparse subset of the…

Methodology · Statistics 2017-03-21 Tengyao Wang , Richard J. Samworth

Ordering for Non-Replacement SGD

One approach for reducing run time and improving efficiency of machine learning is to reduce the convergence rate of the optimization algorithm used. Shuffling is an algorithm technique that is widely used in machine learning, but it only…

Machine Learning · Computer Science 2023-06-29 Yuetong Xu , Baharan Mirzasoleiman

Projection-Based Iterative Mode Scheduling for Switched Systems

This paper describes a method for scheduling the events of a switched system to achieve an optimal performance. The approach has guarantees on convergence and computational complexity that parallel derivative-based iterative optimization…

Optimization and Control · Mathematics 2017-09-11 Timothy Caldwell , Todd Murphey

Interpolation Constraints for Computing Worst-Case Bounds in Performance Estimation Problems

The Performance Estimation Problem (PEP) approach consists in computing worst-case performance bounds on optimization algorithms by solving an optimization problem: one maximizes an error criterion over all initial conditions allowed and…

Optimization and Control · Mathematics 2024-02-13 Anne Rubbens , Nizar Bousselmi , Sebastien Colla , Julien M. Hendrickx