Related papers: Learning to Stop While Learning to Predict

Being Patient and Persistent: Optimizing An Early Stopping Strategy for Deep Learning in Profiled Attacks

The absence of an algorithm that effectively monitors deep learning models used in side-channel attacks increases the difficulty of evaluation. If the attack is unsuccessful, the question is if we are dealing with a resistant implementation…

Cryptography and Security · Computer Science 2021-11-30 Servio Paguada , Lejla Batina , Ileana Buhan , Igor Armendariz

Understanding Deep Architectures with Reasoning Layer

Recently, there has been a surge of interest in combining deep learning models with reasoning in order to handle more sophisticated learning tasks. In many cases, a reasoning task can be solved by an iterative algorithm. This algorithm is…

Machine Learning · Computer Science 2020-11-02 Xinshi Chen , Yufei Zhang , Christoph Reisinger , Le Song

Deep Learning for Predictive Analytics in Reversible Steganography

Deep learning is regarded as a promising solution for reversible steganography. There is an accelerating trend of representing a reversible steo-system by monolithic neural networks, which bypass intermediate operations in traditional…

Multimedia · Computer Science 2023-03-08 Ching-Chun Chang , Xu Wang , Sisheng Chen , Isao Echizen , Victor Sanchez , Chang-Tsun Li

Deep Learning for the Multiple Optimal Stopping Problem

This paper presents a novel deep learning framework for solving multiple optimal stopping problems in high dimensions. While deep learning has recently shown promise for single stopping problems, the multiple exercise case involves complex…

Optimization and Control · Mathematics 2025-12-30 Mathieu Laurière , Mehdi Talbi

Learning with Differentiable Algorithms

Classic algorithms and machine learning systems like neural networks are both abundant in everyday life. While classic computer science algorithms are suitable for precise execution of exactly defined tasks such as finding the shortest path…

Machine Learning · Computer Science 2022-09-02 Felix Petersen

A multi-stage deep learning based algorithm for multiscale modelreduction

In this work, we propose a multi-stage training strategy for the development of deep learning algorithms applied to problems with multiscale features. Each stage of the pro-posed strategy shares an (almost) identical network structure and…

Numerical Analysis · Mathematics 2020-09-25 Eric Chung , Wing Tat Leung , Sai-Mang Pun , Zecheng Zhang

Architecture Matters in Continual Learning

A large body of research in continual learning is devoted to overcoming the catastrophic forgetting of neural networks by designing new algorithms that are robust to the distribution shifts. However, the majority of these works are strictly…

Machine Learning · Computer Science 2022-02-02 Seyed Iman Mirzadeh , Arslan Chaudhry , Dong Yin , Timothy Nguyen , Razvan Pascanu , Dilan Gorur , Mehrdad Farajtabar

Adaptive Neural Architectures for Recommender Systems

Deep learning has proved an effective means to capture the non-linear associations of user preferences. However, the main drawback of existing deep learning architectures is that they follow a fixed recommendation strategy, ignoring users'…

Information Retrieval · Computer Science 2020-12-02 Dimitrios Rafailidis , Stefanos Antaris

Step by Step Network

Scaling up network depth is a fundamental pursuit in neural architecture design, as theory suggests that deeper models offer exponentially greater capability. Benefiting from the residual connections, modern neural networks can scale up to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-19 Dongchen Han , Tianzhu Ye , Zhuofan Xia , Kaiyi Chen , Yulin Wang , Hanting Chen , Gao Huang

Layer-wise learning of deep generative models

When using deep, multi-layered architectures to build generative models of data, it is difficult to train all layers at once. We propose a layer-wise training procedure admitting a performance guarantee compared to the global optimum. It is…

Neural and Evolutionary Computing · Computer Science 2013-02-19 Ludovic Arnold , Yann Ollivier

Bayesian Structure Adaptation for Continual Learning

Continual Learning is a learning paradigm where learning systems are trained with sequential or streaming tasks. Two notable directions among the recent advances in continual learning with neural networks are ($i$) variational Bayes based…

Machine Learning · Computer Science 2020-02-24 Abhishek Kumar , Sunabha Chatterjee , Piyush Rai

Training Very Deep Networks

Theoretical and empirical evidence indicates that the depth of neural networks is crucial for their success. However, training becomes more difficult as depth increases, and training of very deep networks remains an open problem. Here we…

Machine Learning · Computer Science 2015-11-24 Rupesh Kumar Srivastava , Klaus Greff , Jürgen Schmidhuber

Latent Space Policies for Hierarchical Reinforcement Learning

We address the problem of learning hierarchical deep neural network policies for reinforcement learning. In contrast to methods that explicitly restrict or cripple lower layers of a hierarchy to force them to use higher-level modulating…

Machine Learning · Computer Science 2018-09-05 Tuomas Haarnoja , Kristian Hartikainen , Pieter Abbeel , Sergey Levine

A Review of Meta-level Learning in the Context of Multi-component, Multi-level Evolving Prediction Systems

The exponential growth of volume, variety and velocity of data is raising the need for investigations of automated or semi-automated ways to extract useful patterns from the data. It requires deep expert knowledge and extensive…

Machine Learning · Computer Science 2020-07-22 Abbas Raza Ali , Marcin Budka , Bogdan Gabrys

Understanding Learning Dynamics Through Structured Representations

While modern deep networks have demonstrated remarkable versatility, their training dynamics remain poorly understood--often driven more by empirical tweaks than architectural insight. This paper investigates how internal structural choices…

Machine Learning · Computer Science 2025-08-26 Saleh Nikooroo , Thomas Engel

Deep Multiscale Model Learning

The objective of this paper is to design novel multi-layer neural network architectures for multiscale simulations of flows taking into account the observed data and physical modeling concepts. Our approaches use deep learning concepts…

Numerical Analysis · Mathematics 2018-06-14 Yating Wang , Siu Wun Cheung , Eric T. Chung , Yalchin Efendiev , Min Wang

Learning Deep Stochastic Optimal Control Policies using Forward-Backward SDEs

In this paper we propose a new methodology for decision-making under uncertainty using recent advancements in the areas of nonlinear stochastic optimal control theory, applied mathematics, and machine learning. Grounded on the fundamental…

Robotics · Computer Science 2021-07-12 Marcus Pereira , Ziyi Wang , Ioannis Exarchos , Evangelos A. Theodorou

Exploring the design space of deep-learning-based weather forecasting systems

Despite tremendous progress in developing deep-learning-based weather forecasting systems, their design space, including the impact of different design choices, is yet to be well understood. This paper aims to fill this knowledge gap by…

Machine Learning · Computer Science 2024-10-11 Shoaib Ahmed Siddiqui , Jean Kossaifi , Boris Bonev , Christopher Choy , Jan Kautz , David Krueger , Kamyar Azizzadenesheli

Structured Memory based Deep Model to Detect as well as Characterize Novel Inputs

While deep learning has pushed the boundaries in various machine learning tasks, the current models are still far away from replicating many functions that a normal human brain can do. Explicit memorization based deep architecture have been…

Computer Vision and Pattern Recognition · Computer Science 2018-01-31 Pratik Prabhanjan Brahma , Qiuyuan Huang , Dapeng Wu

Depth-Adaptive Neural Networks from the Optimal Control viewpoint

In recent years, deep learning has been connected with optimal control as a way to define a notion of a continuous underlying learning problem. In this view, neural networks can be interpreted as a discretization of a parametric Ordinary…

Optimization and Control · Mathematics 2020-07-07 Joubine Aghili , Olga Mula