Related papers: Learning Compositional Latent Structure with Vecto…

Towards Deep Compositional Networks

Hierarchical feature learning based on convolutional neural networks (CNN) has recently shown significant potential in various computer vision tasks. While allowing high-quality discriminative feature learning, the downside of CNNs is the…

Computer Vision and Pattern Recognition · Computer Science 2016-09-14 Domen Tabernik , Matej Kristan , Jeremy L. Wyatt , Aleš Leonardis

How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

Deep learning algorithms demonstrate a surprising ability to learn high-dimensional tasks from limited examples. This is commonly attributed to the depth of neural networks, enabling them to build a hierarchy of abstract, low-dimensional…

Machine Learning · Computer Science 2024-07-04 Francesco Cagnetta , Leonardo Petrini , Umberto M. Tomasini , Alessandro Favero , Matthieu Wyart

A Deep Network with Visual Text Composition Behavior

While natural languages are compositional, how state-of-the-art neural models achieve compositionality is still unclear. We propose a deep network, which not only achieves competitive accuracy for text classification, but also exhibits…

Computation and Language · Computer Science 2017-07-07 Hongyu Guo

Combine Convolution with Recurrent Networks for Text Classification

Convolutional neural network (CNN) and recurrent neural network (RNN) are two popular architectures used in text classification. Traditional methods to combine the strengths of the two networks rely on streamlining them or concatenating…

Computation and Language · Computer Science 2020-06-30 Shengfei Lyu , Jiaqi Liu

Investigating the Compositional Structure Of Deep Neural Networks

The current understanding of deep neural networks can only partially explain how input structure, network parameters and optimization algorithms jointly contribute to achieve the strong generalization power that is typically observed in…

Machine Learning · Computer Science 2021-01-28 Francesco Craighero , Fabrizio Angaroni , Alex Graudenzi , Fabio Stella , Marco Antoniotti

Dynamic Compositional Neural Networks over Tree Structure

Tree-structured neural networks have proven to be effective in learning semantic representations by exploiting syntactic information. In spite of their success, most existing models suffer from the underfitting problem: they recursively use…

Computation and Language · Computer Science 2017-05-12 Pengfei Liu , Xipeng Qiu , Xuanjing Huang

KVNN: Learnable Multi-Kernel Volterra Neural Networks

Higher-order learning is fundamentally rooted in exploiting compositional features. It clearly hinges on enriching the representation by more elaborate interactions of the data which, in turn, tends to increase the model complexity of…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Haoyu Yun , Hamid Krim , Yufang Bao

Self-Attention Based Semantic Decomposition in Vector Symbolic Architectures

Vector Symbolic Architectures (VSAs) have emerged as a novel framework for enabling interpretable machine learning algorithms equipped with the ability to reason and explain their decision processes. The basic idea is to represent discrete…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Calvin Yeung , Prathyush Poduval , Mohsen Imani

Attention as a Hypernetwork

Transformers can under some circumstances generalize to novel problem instances whose constituent parts might have been encountered during training, but whose compositions have not. What mechanisms underlie this ability for compositional…

Machine Learning · Computer Science 2025-02-18 Simon Schug , Seijin Kobayashi , Yassir Akram , João Sacramento , Razvan Pascanu

Deep Complex Networks

At present, the vast majority of building blocks, techniques, and architectures for deep learning are based on real-valued operations and representations. However, recent work on recurrent neural networks and older fundamental theoretical…

Neural and Evolutionary Computing · Computer Science 2018-02-27 Chiheb Trabelsi , Olexa Bilaniuk , Ying Zhang , Dmitriy Serdyuk , Sandeep Subramanian , João Felipe Santos , Soroush Mehri , Negar Rostamzadeh , Yoshua Bengio , Christopher J Pal

Covariant Compositional Networks For Learning Graphs

Most existing neural networks for learning graphs address permutation invariance by conceiving of the network as a message passing scheme, where each node sums the feature vectors coming from its neighbors. We argue that this imposes a…

Machine Learning · Computer Science 2018-01-09 Risi Kondor , Hy Truong Son , Horace Pan , Brandon Anderson , Shubhendu Trivedi

Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks

Deep networks consume a large amount of memory by their nature. A natural question arises can we reduce that memory requirement whilst maintaining performance. In particular, in this work we address the problem of memory efficient learning…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Eunwoo Kim , Chanho Ahn , Philip H. S. Torr , Songhwai Oh

Learning Neural Network Classifiers with Low Model Complexity

Modern neural network architectures for large-scale learning tasks have substantially higher model complexities, which makes understanding, visualizing and training these architectures difficult. Recent contributions to deep learning…

Machine Learning · Computer Science 2024-10-30 Jayadeva , Himanshu Pant , Mayank Sharma , Abhimanyu Dubey , Sumit Soman , Suraj Tripathi , Sai Guruju , Nihal Goalla

Latent Code-Based Fusion: A Volterra Neural Network Approach

We propose a deep structure encoder using the recently introduced Volterra Neural Networks (VNNs) to seek a latent representation of multi-modal data whose features are jointly captured by a union of subspaces. The so-called…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Sally Ghanem , Siddharth Roheda , Hamid Krim

Deep learning architectures for nonlinear operator functions and nonlinear inverse problems

We develop a theoretical analysis for special neural network architectures, termed operator recurrent neural networks, for approximating nonlinear functions whose inputs are linear operators. Such functions commonly arise in solution…

Optimization and Control · Mathematics 2022-01-05 Maarten V. de Hoop , Matti Lassas , Christopher A. Wong

Deep Variable-Block Chain with Adaptive Variable Selection

The architectures of deep neural networks (DNN) rely heavily on the underlying grid structure of variables, for instance, the lattice of pixels in an image. For general high dimensional data with variables not associated with a grid, the…

Machine Learning · Statistics 2024-08-07 Lixiang Zhang , Lin Lin , Jia Li

A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music

The Variational Autoencoder (VAE) has proven to be an effective model for producing semantically meaningful latent representations for natural data. However, it has thus far seen limited application to sequential data, and, as we…

Machine Learning · Computer Science 2019-11-12 Adam Roberts , Jesse Engel , Colin Raffel , Curtis Hawthorne , Douglas Eck

Compositional Sparsity as an Inductive Bias for Neural Architecture Design

Identifying the structural priors that enable Deep Neural Networks (DNNs) to overcome the curse of dimensionality is a fundamental challenge in machine learning theory. Existing literature suggests that effective high-dimensional learning…

Machine Learning · Computer Science 2026-05-15 Hongyu Lin , Antonio Briola , Yuanrong Wang , Tomaso Aste

Rethinking Recurrent Latent Variable Model for Music Composition

We present a model for capturing musical features and creating novel sequences of music, called the Convolutional Variational Recurrent Neural Network. To generate sequential data, the model uses an encoder-decoder architecture with latent…

Sound · Computer Science 2018-10-09 Eunjeong Stella Koh , Shlomo Dubnov , Dustin Wright

Discovering the Compositional Structure of Vector Representations with Role Learning Networks

How can neural networks perform so well on compositional tasks even though they lack explicit compositional representations? We use a novel analysis technique called ROLE to show that recurrent neural networks perform well on such tasks by…

Machine Learning · Computer Science 2023-02-10 Paul Soulos , Tom McCoy , Tal Linzen , Paul Smolensky