Related papers: Tensor Dropout for Robust Learning

Machine learning with tree tensor networks, CP rank constraints, and tensor dropout

Tensor networks developed in the context of condensed matter physics try to approximate order-$N$ tensors with a reduced number of degrees of freedom that is only polynomial in $N$ and arranged as a network of partially contracted smaller…

Machine Learning · Computer Science 2025-01-07 Hao Chen , Thomas Barthel

SelectScale: Mining More Patterns from Images via Selective and Soft Dropout

Convolutional neural networks (CNNs) have achieved remarkable success in image recognition. Although the internal patterns of the input images are effectively learned by the CNNs, these patterns only constitute a small proportion of useful…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Zhengsu Chen , Jianwei Niu , Xuefeng Liu , Shaojie Tang

Tensor-Train Recurrent Neural Networks for Video Classification

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

Convolutional neural networks with low-rank regularization

Large CNNs have delivered impressive performance in various computer vision applications. But the storage and computation requirements make it problematic for deploying these models on mobile devices. Recently, tensor decompositions have…

Machine Learning · Computer Science 2016-02-16 Cheng Tai , Tong Xiao , Yi Zhang , Xiaogang Wang , Weinan E

Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Informative Dropout for Robust Representation Learning: A Shape-bias Perspective

Convolutional Neural Networks (CNNs) are known to rely more on local texture rather than global shape when making decisions. Recent work also indicates a close relationship between CNN's texture-bias and its robustness against distribution…

Machine Learning · Computer Science 2020-08-11 Baifeng Shi , Dinghuai Zhang , Qi Dai , Zhanxing Zhu , Yadong Mu , Jingdong Wang

Tensor network compressibility of convolutional models

Convolutional neural networks (CNNs) are one of the most widely used neural network architectures, showcasing state-of-the-art performance in computer vision tasks. Although larger CNNs generally exhibit higher accuracy, their size can be…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Sukhbinder Singh , Saeed S. Jahromi , Roman Orus

Exploiting Elasticity in Tensor Ranks for Compressing Neural Networks

Elasticities in depth, width, kernel size and resolution have been explored in compressing deep neural networks (DNNs). Recognizing that the kernels in a convolutional neural network (CNN) are 4-way tensors, we further exploit a new…

Machine Learning · Computer Science 2021-05-11 Jie Ran , Rui Lin , Hayden K. H. So , Graziano Chesi , Ngai Wong

Tensor Regression Networks with various Low-Rank Tensor Approximations

Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years,…

Machine Learning · Computer Science 2018-11-30 Xingwei Cao , Guillaume Rabusseau

Defensive Tensorization

We propose defensive tensorization, an adversarial defence technique that leverages a latent high-order factorization of the network. The layers of a network are first expressed as factorized tensor layers. Tensor dropout is then applied in…

Machine Learning · Computer Science 2021-10-27 Adrian Bulat , Jean Kossaifi , Sourav Bhattacharya , Yannis Panagakis , Timothy Hospedales , Georgios Tzimiropoulos , Nicholas D Lane , Maja Pantic

Adaptive Learning of Tensor Network Structures

Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models…

Machine Learning · Computer Science 2021-06-24 Meraj Hashemizadeh , Michelle Liu , Jacob Miller , Guillaume Rabusseau

Bayesian Tensorized Neural Networks with Automatic Rank Selection

Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms. However, directly applying tensor compression in the training process is…

Machine Learning · Computer Science 2019-05-28 Cole Hawkins , Zheng Zhang

A Fully Tensorized Recurrent Neural Network

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Modern neural networks are usually highly over-parameterized. Behind the wide usage of over-parameterized networks is the belief that, if the data are simple, then the trained network will be automatically equivalent to a simple predictor.…

Machine Learning · Statistics 2025-04-14 Chenyang Zhang , Peifeng Gao , Difan Zou , Yuan Cao

Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing

This work focuses on designing low complexity hybrid tensor networks by considering trade-offs between the model complexity and practical performance. Firstly, we exploit a low-rank tensor-train deep neural network (TT-DNN) to build an…

Machine Learning · Computer Science 2022-03-14 Jun Qi , Chao-Han Huck Yang , Pin-Yu Chen , Javier Tejedor

Variational Tensor Neural Networks for Deep Learning

Deep neural networks (NNs) encounter scalability limitations when confronted with a vast array of neurons, thereby constraining their achievable network depth. To address this challenge, we propose an integration of tensor networks (TN)…

Disordered Systems and Neural Networks · Physics 2024-08-20 Saeed S. Jahromi , Roman Orus

Quantum-Inspired Robust and Scalable SAR Object Classification

SAR image classification naturally has to deal with huge noise and a high dynamic range particularly requiring robust classification models. Additionally, the deployment of these models on edge devices, such as drones and military aircraft,…

Quantum Physics · Physics 2026-05-25 Maximilian Scharf , Marco Trenti , Felix Bock , Padraig Davidson , Tobias Brosch , Benjamin Rodrigues de Miranda , Sigurd Huber , Timo Felser

Fraternal Dropout

Recurrent neural networks (RNNs) are important class of architectures among neural networks useful for language modeling and sequential prediction. However, optimizing RNNs is known to be harder compared to feed-forward neural networks. A…

Machine Learning · Statistics 2018-03-29 Konrad Zolna , Devansh Arpit , Dendi Suhubdy , Yoshua Bengio

FocusedDropout for Convolutional Neural Network

In convolutional neural network (CNN), dropout cannot work well because dropped information is not entirely obscured in convolutional layers where features are correlated spatially. Except randomly discarding regions or channels, many…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Tianshu Xie , Minghui Liu , Jiali Deng , Xuan Cheng , Xiaomin Wang , Ming Liu

Low-Rank Tensor Decompositions for the Theory of Neural Networks

The groundbreaking performance of deep neural networks (NNs) promoted a surge of interest in providing a mathematical basis to deep learning theory. Low-rank tensor decompositions are specially befitting for this task due to their close…

Machine Learning · Computer Science 2025-12-18 Ricardo Borsoi , Konstantin Usevich , Marianne Clausel