Related papers: Rectified Factor Networks

Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference…

Machine Learning · Computer Science 2019-05-20 Philipp Becker , Harit Pandya , Gregor Gebhardt , Cheng Zhao , James Taylor , Gerhard Neumann

Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks

Deep neural networks (DNNs) are so over-parametrized that recent research has found them to already contain a subnetwork with high accuracy at their randomly initialized state. Finding these subnetworks is a viable alternative training…

Computer Vision and Pattern Recognition · Computer Science 2021-11-25 Ángel López García-Arias , Masanori Hashimoto , Masato Motomura , Jaehoon Yu

A Fully Tensorized Recurrent Neural Network

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

Recurrent Neural Networks With Limited Numerical Precision

Recurrent Neural Networks (RNNs) produce state-of-art performance on many machine learning tasks but their demand on resources in terms of memory and computational power are often high. Therefore, there is a great interest in optimizing the…

Neural and Evolutionary Computing · Computer Science 2017-02-28 Joachim Ott , Zhouhan Lin , Ying Zhang , Shih-Chii Liu , Yoshua Bengio

Low Rank Factorizations are Indirect Encodings for Deep Neuroevolution

Deep neuroevolution is a highly scalable alternative to reinforcement learning due to its unique ability to encode network updates in a small number of bytes. Recent insights from traditional deep learning indicate high-dimensional models…

Neural and Evolutionary Computing · Computer Science 2025-04-07 Jack Garbus , Jordan Pollack

Regularizing linear inverse problems with convolutional neural networks

Deep convolutional neural networks trained on large datsets have emerged as an intriguing alternative for compressing images and solving inverse problems such as denoising and compressive sensing. However, it has only recently been realized…

Machine Learning · Computer Science 2019-07-09 Reinhard Heckel

Adaptive Low-Rank Factorization to regularize shallow and deep neural networks

The overfitting is one of the cursing subjects in the deep learning field. To solve this challenge, many approaches were proposed to regularize the learning models. They add some hyper-parameters to the model to extend the generalization;…

Machine Learning · Computer Science 2020-05-06 Mohammad Mahdi Bejani , Mehdi Ghatee

Inference of Sparse Networks with Unobserved Variables. Application to Gene Regulatory Networks

Networks are a unifying framework for modeling complex systems and network inference problems are frequently encountered in many fields. Here, I develop and apply a generative approach to network inference (RCweb) for the case when the…

Machine Learning · Statistics 2014-07-01 Nikolai Slavov

Feature Flow Regularization: Improving Structured Sparsity in Deep Neural Networks

Pruning is a model compression method that removes redundant parameters in deep neural networks (DNNs) while maintaining accuracy. Most available filter pruning methods require complex treatments such as iterative pruning, features…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Yue Wu , Yuan Lan , Luchan Zhang , Yang Xiang

Robust Learning with Frequency Domain Regularization

Convolution neural networks have achieved remarkable performance in many tasks of computing vision. However, CNN tends to bias to low frequency components. They prioritize capturing low frequency patterns which lead them fail when suffering…

Machine Learning · Computer Science 2020-07-08 Weiyu Guo , Yidong Ouyang

Deep Neural Networks pruning via the Structured Perspective Regularization

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes…

Machine Learning · Computer Science 2022-06-29 Matteo Cacciola , Antonio Frangioni , Xinlin Li , Andrea Lodi

Variance-Reducing Couplings for Random Features

Random features (RFs) are a popular technique to scale up kernel methods in machine learning, replacing exact kernel evaluations with stochastic Monte Carlo estimates. They underpin models as diverse as efficient transformers (by…

Machine Learning · Statistics 2024-10-04 Isaac Reid , Stratis Markou , Krzysztof Choromanski , Richard E. Turner , Adrian Weller

RFFNet: Large-Scale Interpretable Kernel Methods via Random Fourier Features

Kernel methods provide a flexible and theoretically grounded approach to nonlinear and nonparametric learning. While memory and run-time requirements hinder their applicability to large datasets, many low-rank kernel approximations, such as…

Machine Learning · Statistics 2024-04-15 Mateus P. Otto , Rafael Izbicki

Predicting Confusion from Eye-Tracking Data with Recurrent Neural Networks

Encouraged by the success of deep learning in a variety of domains, we investigate the suitability and effectiveness of Recurrent Neural Networks (RNNs) in a domain where deep learning has not yet been used; namely detecting confusion from…

Computer Vision and Pattern Recognition · Computer Science 2019-06-27 Shane D. Sims , Vanessa Putnam , Cristina Conati

Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification

Twisted Convolutional Networks (TCNs) are proposed as a novel deep learning architecture for classifying one-dimensional data with arbitrary feature order and minimal spatial relationships. Unlike conventional Convolutional Neural Networks…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Junbo Jacob Lian , Haoran Chen , Kaichen Ouyang , Yujun Zhang , Rui Zhong , Huiling Chen

Convolutional Neural Networks with Recurrent Neural Filters

We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters. A convolution filter is typically implemented as a linear affine transformation followed by a non-linear…

Computation and Language · Computer Science 2018-08-29 Yi Yang

Particle Filter Recurrent Neural Networks

Recurrent neural networks (RNNs) have been extraordinarily successful for prediction with sequential data. To tackle highly variable and noisy real-world data, we introduce Particle Filter Recurrent Neural Networks (PF-RNNs), a new RNN…

Machine Learning · Computer Science 2019-12-03 Xiao Ma , Peter Karkus , David Hsu , Wee Sun Lee

Reversible Recurrent Neural Networks

Recurrent neural networks (RNNs) provide state-of-the-art performance in processing sequential data but are memory intensive to train, limiting the flexibility of RNN models which can be trained. Reversible RNNs---RNNs for which the…

Machine Learning · Computer Science 2018-10-26 Matthew MacKay , Paul Vicol , Jimmy Ba , Roger Grosse

On the challenges of learning with inference networks on sparse, high-dimensional data

We study parameter estimation in Nonlinear Factor Analysis (NFA) where the generative model is parameterized by a deep neural network. Recent work has focused on learning such models using inference (or recognition) networks; we identify a…

Machine Learning · Statistics 2017-10-18 Rahul G. Krishnan , Dawen Liang , Matthew Hoffman

Learning Robust Convolutional Neural Networks with Relevant Feature Focusing via Explanations

Existing image recognition techniques based on convolutional neural networks (CNNs) basically assume that the training and test datasets are sampled from i.i.d distributions. However, this assumption is easily broken in the real world…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Kazuki Adachi , Shin'ya Yamaguchi