Related papers: Exploiting Problem Structure in Deep Declarative N…

Training Deep Networks with Structured Layers by Matrix Backpropagation

Deep neural network architectures have recently produced excellent results in a variety of areas in artificial intelligence and visual recognition, well surpassing traditional shallow architectures trained using hand-designed features. The…

Computer Vision and Pattern Recognition · Computer Science 2016-04-15 Catalin Ionescu , Orestis Vantzos , Cristian Sminchisescu

Decentralized Implicit Differentiation

The ability to differentiate through optimization problems has unlocked numerous applications, from optimization-based layers in machine learning models to complex design problems formulated as bilevel programs. It has been shown that…

Optimization and Control · Mathematics 2024-03-05 Lucas Fuentes Valenzuela , Robin Brown , Marco Pavone

Deep Declarative Networks: A New Hope

We explore a new class of end-to-end learnable models wherein data processing nodes (or network layers) are defined in terms of desired behavior rather than an explicit forward function. Specifically, the forward function is implicitly…

Machine Learning · Computer Science 2021-08-20 Stephen Gould , Richard Hartley , Dylan Campbell

On the performance of deep learning for numerical optimization: an application to protein structure prediction

Deep neural networks have recently drawn considerable attention to build and evaluate artificial learning models for perceptual tasks. Here, we present a study on the performance of the deep learning models to deal with global optimization…

Neural and Evolutionary Computing · Computer Science 2020-12-18 Hojjat Rakhshani , Lhassane Idoumghar , Soheila Ghambari , Julien Lepagnot , Mathieu Brévilliers

Computational issues in Optimization for Deep networks

The paper aims to investigate relevant computational issues of deep neural network architectures with an eye to the interaction between the optimization algorithm and the classification performance. In particular, we aim to analyze the…

Optimization and Control · Mathematics 2024-05-06 Corrado Coppola , Lorenzo Papa , Marco Boresta , Irene Amerini , Laura Palagi

Deep Learning for Explicitly Modeling Optimization Landscapes

In all but the most trivial optimization problems, the structure of the solutions exhibit complex interdependencies between the input parameters. Decades of research with stochastic search techniques has shown the benefit of explicitly…

Neural and Evolutionary Computing · Computer Science 2017-03-23 Shumeet Baluja

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

DeepMapping: Learned Data Mapping for Lossless Compression and Efficient Lookup

Storing tabular data to balance storage and query efficiency is a long-standing research question in the database community. In this work, we argue and show that a novel DeepMapping abstraction, which relies on the impressive memorization…

Databases · Computer Science 2024-09-27 Lixi Zhou , K. Selçuk Candan , Jia Zou

Profile-guided memory optimization for deep neural networks

Recent years have seen deep neural networks (DNNs) becoming wider and deeper to achieve better performance in many applications of AI. Such DNNs however require huge amounts of memory to store weights and intermediate results (e.g.,…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-27 Taro Sekiyama , Takashi Imamichi , Haruki Imai , Rudy Raymond

Large Scale Structure of Neural Network Loss Landscapes

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High…

Machine Learning · Computer Science 2019-06-12 Stanislav Fort , Stanislaw Jastrzebski

Exact Structure Learning of Bayesian Networks by Optimal Path Extension

Bayesian networks are probabilistic graphical models often used in big data analytics. The problem of exact structure learning is to find a network structure that is optimal under certain scoring criteria. The problem is known to be NP-hard…

Artificial Intelligence · Computer Science 2017-03-22 Subhadeep Karan , Jaroslaw Zola

Structure Probing Neural Network Deflation

Deep learning is a powerful tool for solving nonlinear differential equations, but usually, only the solution corresponding to the flattest local minimizer can be found due to the implicit regularization of stochastic gradient descent. This…

Numerical Analysis · Mathematics 2021-03-17 Yiqi Gu , Chunmei Wang , Haizhao Yang

Structure preserving deep learning

Over the past few years, deep learning has risen to the foreground as a topic of massive interest, mainly as a result of successes obtained in solving large-scale image processing tasks. There are multiple challenging mathematical problems…

Machine Learning · Computer Science 2020-06-08 Elena Celledoni , Matthias J. Ehrhardt , Christian Etmann , Robert I McLachlan , Brynjulf Owren , Carola-Bibiane Schönlieb , Ferdia Sherry

Deep Reinforcement Learning for Dynamic Urban Transportation Problems

We explore the use of deep learning and deep reinforcement learning for optimization problems in transportation. Many transportation system analysis tasks are formulated as an optimization problem - such as optimal control problems in…

Machine Learning · Statistics 2018-06-15 Laura Schultz , Vadim Sokolov

Primal-dual residual networks

In this work, we propose a deep neural network architecture motivated by primal-dual splitting methods from convex optimization. We show theoretically that there exists a close relation between the derived architecture and residual…

Machine Learning · Statistics 2018-06-18 Christoph Brauer , Dirk Lorenz

Scalable and Sustainable Deep Learning via Randomized Hashing

Current deep learning architectures are growing larger in order to learn from complex datasets. These architectures require giant matrix multiplication operations to train millions of parameters. Conversely, there is another growing trend…

Machine Learning · Statistics 2016-12-06 Ryan Spring , Anshumali Shrivastava

Identifying and Exploiting Structures for Reliable Deep Learning

Deep learning research has recently witnessed an impressively fast-paced progress in a wide range of tasks including computer vision, natural language processing, and reinforcement learning. The extraordinary performance of these systems…

Machine Learning · Computer Science 2021-08-17 Amartya Sanyal

End-to-End Differentiable Proving

We introduce neural networks for end-to-end differentiable proving of queries to knowledge bases by operating on dense vector representations of symbols. These neural networks are constructed recursively by taking inspiration from the…

Neural and Evolutionary Computing · Computer Science 2017-12-05 Tim Rocktäschel , Sebastian Riedel

Backpropagation of Unrolled Solvers with Folded Optimization

The integration of constrained optimization models as components in deep networks has led to promising advances on many specialized learning tasks. A central challenge in this setting is backpropagation through the solution of an…

Machine Learning · Computer Science 2023-09-06 James Kotary , My H. Dinh , Ferdinando Fioretto

InverseNet: Solving Inverse Problems with Splitting Networks

We propose a new method that uses deep learning techniques to solve the inverse problems. The inverse problem is cast in the form of learning an end-to-end mapping from observed data to the ground-truth. Inspired by the splitting strategy…

Computer Vision and Pattern Recognition · Computer Science 2017-12-04 Kai Fan , Qi Wei , Wenlin Wang , Amit Chakraborty , Katherine Heller