English
Related papers

Related papers: Tensor Dropout for Robust Learning

200 papers

Tensor networks developed in the context of condensed matter physics try to approximate order-$N$ tensors with a reduced number of degrees of freedom that is only polynomial in $N$ and arranged as a network of partially contracted smaller…

Machine Learning · Computer Science 2025-01-07 Hao Chen , Thomas Barthel

Convolutional neural networks (CNNs) have achieved remarkable success in image recognition. Although the internal patterns of the input images are effectively learned by the CNNs, these patterns only constitute a small proportion of useful…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Zhengsu Chen , Jianwei Niu , Xuefeng Liu , Shaojie Tang

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

Large CNNs have delivered impressive performance in various computer vision applications. But the storage and computation requirements make it problematic for deploying these models on mobile devices. Recently, tensor decompositions have…

Machine Learning · Computer Science 2016-02-16 Cheng Tai , Tong Xiao , Yi Zhang , Xiaogang Wang , Weinan E

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Convolutional Neural Networks (CNNs) are known to rely more on local texture rather than global shape when making decisions. Recent work also indicates a close relationship between CNN's texture-bias and its robustness against distribution…

Machine Learning · Computer Science 2020-08-11 Baifeng Shi , Dinghuai Zhang , Qi Dai , Zhanxing Zhu , Yadong Mu , Jingdong Wang

Convolutional neural networks (CNNs) are one of the most widely used neural network architectures, showcasing state-of-the-art performance in computer vision tasks. Although larger CNNs generally exhibit higher accuracy, their size can be…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Sukhbinder Singh , Saeed S. Jahromi , Roman Orus

Elasticities in depth, width, kernel size and resolution have been explored in compressing deep neural networks (DNNs). Recognizing that the kernels in a convolutional neural network (CNN) are 4-way tensors, we further exploit a new…

Machine Learning · Computer Science 2021-05-11 Jie Ran , Rui Lin , Hayden K. H. So , Graziano Chesi , Ngai Wong

Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years,…

Machine Learning · Computer Science 2018-11-30 Xingwei Cao , Guillaume Rabusseau

We propose defensive tensorization, an adversarial defence technique that leverages a latent high-order factorization of the network. The layers of a network are first expressed as factorized tensor layers. Tensor dropout is then applied in…

Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models…

Machine Learning · Computer Science 2021-06-24 Meraj Hashemizadeh , Michelle Liu , Jacob Miller , Guillaume Rabusseau

Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms. However, directly applying tensor compression in the training process is…

Machine Learning · Computer Science 2019-05-28 Cole Hawkins , Zheng Zhang

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

Modern neural networks are usually highly over-parameterized. Behind the wide usage of over-parameterized networks is the belief that, if the data are simple, then the trained network will be automatically equivalent to a simple predictor.…

Machine Learning · Statistics 2025-04-14 Chenyang Zhang , Peifeng Gao , Difan Zou , Yuan Cao

This work focuses on designing low complexity hybrid tensor networks by considering trade-offs between the model complexity and practical performance. Firstly, we exploit a low-rank tensor-train deep neural network (TT-DNN) to build an…

Machine Learning · Computer Science 2022-03-14 Jun Qi , Chao-Han Huck Yang , Pin-Yu Chen , Javier Tejedor

Deep neural networks (NNs) encounter scalability limitations when confronted with a vast array of neurons, thereby constraining their achievable network depth. To address this challenge, we propose an integration of tensor networks (TN)…

Disordered Systems and Neural Networks · Physics 2024-08-20 Saeed S. Jahromi , Roman Orus

SAR image classification naturally has to deal with huge noise and a high dynamic range particularly requiring robust classification models. Additionally, the deployment of these models on edge devices, such as drones and military aircraft,…

Recurrent neural networks (RNNs) are important class of architectures among neural networks useful for language modeling and sequential prediction. However, optimizing RNNs is known to be harder compared to feed-forward neural networks. A…

Machine Learning · Statistics 2018-03-29 Konrad Zolna , Devansh Arpit , Dendi Suhubdy , Yoshua Bengio

In convolutional neural network (CNN), dropout cannot work well because dropped information is not entirely obscured in convolutional layers where features are correlated spatially. Except randomly discarding regions or channels, many…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Tianshu Xie , Minghui Liu , Jiali Deng , Xuan Cheng , Xiaomin Wang , Ming Liu

The groundbreaking performance of deep neural networks (NNs) promoted a surge of interest in providing a mathematical basis to deep learning theory. Low-rank tensor decompositions are specially befitting for this task due to their close…

Machine Learning · Computer Science 2025-12-18 Ricardo Borsoi , Konstantin Usevich , Marianne Clausel
‹ Prev 1 2 3 10 Next ›