Related papers: Layer Flexible Adaptive Computational Time

Variable Computation in Recurrent Neural Networks

Recurrent neural networks (RNNs) have been used extensively and with increasing success to model various types of sequential data. Much of this progress has been achieved through devising recurrent units and architectures with the…

Machine Learning · Statistics 2017-03-06 Yacine Jernite , Edouard Grave , Armand Joulin , Tomas Mikolov

Comparing Fixed and Adaptive Computation Time for Recurrent Neural Networks

Adaptive Computation Time for Recurrent Neural Networks (ACT) is one of the most promising architectures for variable computation. ACT adapts to the input sequence by being able to look at each sample more than once, and learn how many…

Neural and Evolutionary Computing · Computer Science 2018-03-23 Daniel Fojo , Víctor Campos , Xavier Giro-i-Nieto

Adaptive Recurrent Neural Network Based on Mixture Layer

Although Recurrent Neural Network (RNN) has been a powerful tool for modeling sequential data, its performance is inadequate when processing sequences with multiple patterns. In this paper, we address this challenge by introducing a novel…

Machine Learning · Computer Science 2019-02-28 Kui Zhao , Yuechuan Li , Chi Zhang , Cheng Yang , Huan Xu

Adaptive Neural Networks for Efficient Inference

We present an approach to adaptively utilize deep neural networks in order to reduce the evaluation time on new examples without loss of accuracy. Rather than attempting to redesign or approximate existing networks, we propose two schemes…

Machine Learning · Computer Science 2017-09-20 Tolga Bolukbasi , Joseph Wang , Ofer Dekel , Venkatesh Saligrama

Learning to Adaptively Scale Recurrent Neural Networks

Recent advancements in recurrent neural network (RNN) research have demonstrated the superiority of utilizing multiscale structures in learning temporal representations of time series. Currently, most of multiscale RNNs use fixed scales,…

Machine Learning · Computer Science 2019-02-18 Hao Hu , Liqiang Wang , Guo-Jun Qi

Spatially Adaptive Computation Time for Residual Networks

This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic.…

Computer Vision and Pattern Recognition · Computer Science 2017-07-04 Michael Figurnov , Maxwell D. Collins , Yukun Zhu , Li Zhang , Jonathan Huang , Dmitry Vetrov , Ruslan Salakhutdinov

Adaptive Growth: Real-time CNN Layer Expansion

Deep Neural Networks (DNNs) have shown unparalleled achievements in numerous applications, reflecting their proficiency in managing vast data sets. Yet, their static structure limits their adaptability in ever-changing environments. This…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Yunjie Zhu , Yunhao Chen

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Models based on deep convolutional networks have dominated recent image interpretation tasks; we investigate whether models which are also recurrent, or "temporally deep", are effective for tasks involving sequences, visual and otherwise.…

Computer Vision and Pattern Recognition · Computer Science 2016-06-02 Jeff Donahue , Lisa Anne Hendricks , Marcus Rohrbach , Subhashini Venugopalan , Sergio Guadarrama , Kate Saenko , Trevor Darrell

Dynamic Neural Networks: A Survey

Dynamic neural network is an emerging research topic in deep learning. Compared to static models which have fixed computational graphs and parameters at the inference stage, dynamic networks can adapt their structures or parameters to…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Yizeng Han , Gao Huang , Shiji Song , Le Yang , Honghui Wang , Yulin Wang

Adaptive control of recurrent neural networks using conceptors

Recurrent Neural Networks excel at predicting and generating complex high-dimensional temporal patterns. Due to their inherent nonlinear dynamics and memory, they can learn unbounded temporal dependencies from data. In a Machine Learning…

Machine Learning · Computer Science 2024-05-14 Guillaume Pourcel , Mirko Goldmann , Ingo Fischer , Miguel C. Soriano

Feed-forward approximations to dynamic recurrent network architectures

Recurrent neural network architectures can have useful computational properties, with complex temporal dynamics and input-sensitive attractor states. However, evaluation of recurrent dynamic architectures requires solution of systems of…

Neural and Evolutionary Computing · Computer Science 2019-11-18 Dylan Richard Muir

Recurrent Models of Visual Attention

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of…

Machine Learning · Computer Science 2014-06-25 Volodymyr Mnih , Nicolas Heess , Alex Graves , Koray Kavukcuoglu

Adaptive Computation Time for Recurrent Neural Networks

This paper introduces Adaptive Computation Time (ACT), an algorithm that allows recurrent neural networks to learn how many computational steps to take between receiving an input and emitting an output. ACT requires minimal changes to the…

Neural and Evolutionary Computing · Computer Science 2017-02-22 Alex Graves

Deep Recurrent Neural Networks for Time Series Prediction

Ability of deep networks to extract high level features and of recurrent networks to perform time-series inference have been studied. In view of universality of one hidden layer network at approximating functions under weak constraints, the…

Neural and Evolutionary Computing · Computer Science 2014-12-19 Sharat C. Prasad , Piyush Prasad

Flexible Deep Neural Network Processing

The recent success of Deep Neural Networks (DNNs) has drastically improved the state of the art for many application domains. While achieving high accuracy performance, deploying state-of-the-art DNNs is a challenge since they typically…

Neural and Evolutionary Computing · Computer Science 2018-01-24 Hokchhay Tann , Soheil Hashemi , Sherief Reda

Learning The Sequential Temporal Information with Recurrent Neural Networks

Recurrent Networks are one of the most powerful and promising artificial neural network algorithms to processing the sequential data such as natural languages, sound, time series data. Unlike traditional feed-forward network, Recurrent…

Machine Learning · Computer Science 2018-07-11 Pushparaja Murugan

Liquid Time-constant Networks

We introduce a new class of time-continuous recurrent neural network models. Instead of declaring a learning system's dynamics by implicit nonlinearities, we construct networks of linear first-order dynamical systems modulated via nonlinear…

Machine Learning · Computer Science 2020-12-16 Ramin Hasani , Mathias Lechner , Alexander Amini , Daniela Rus , Radu Grosu

Depth-Adaptive Transformer

State of the art sequence-to-sequence models for large scale tasks perform a fixed number of computations for each input sequence regardless of whether it is easy or hard to process. In this paper, we train Transformer models which can make…

Computation and Language · Computer Science 2020-02-18 Maha Elbayad , Jiatao Gu , Edouard Grave , Michael Auli

Recurrent Ladder Networks

We propose a recurrent extension of the Ladder networks whose structure is motivated by the inference required in hierarchical latent variable models. We demonstrate that the recurrent Ladder is able to handle a wide variety of complex…

Neural and Evolutionary Computing · Computer Science 2017-12-19 Isabeau Prémont-Schwarz , Alexander Ilin , Tele Hotloo Hao , Antti Rasmus , Rinu Boney , Harri Valpola

Efficient Time-Series Approximation with Linear Recurrent Neural Networks: Architecture Learning and Predictive Power

Recurrent neural networks are a powerful means to cope with time series. We show how autoregressive linear, i.e., linearly activated recurrent neural networks (LRNNs) can approximate any time-dependent function f(t). The approximation can…

Machine Learning · Computer Science 2025-10-01 Frieder Stolzenburg , Sandra Litz , Olivia Michael , Oliver Obst