Related papers: Deep RNNs Encode Soft Hierarchical Syntax

How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

Deep learning algorithms demonstrate a surprising ability to learn high-dimensional tasks from limited examples. This is commonly attributed to the depth of neural networks, enabling them to build a hierarchy of abstract, low-dimensional…

Machine Learning · Computer Science 2024-07-04 Francesco Cagnetta , Leonardo Petrini , Umberto M. Tomasini , Alessandro Favero , Matthieu Wyart

Learning Robust, Transferable Sentence Representations for Text Classification

Despite deep recurrent neural networks (RNNs) demonstrate strong performance in text classification, training RNN models are often expensive and requires an extensive collection of annotated data which may not be available. To overcome the…

Computation and Language · Computer Science 2018-10-02 Wasi Uddin Ahmad , Xueying Bai , Nanyun Peng , Kai-Wei Chang

Recurrent Hierarchical Topic-Guided RNN for Language Generation

To simultaneously capture syntax and global semantics from a text corpus, we propose a new larger-context recurrent neural network (RNN) based language model, which extracts recurrent hierarchical semantic structure via a dynamic deep topic…

Computation and Language · Computer Science 2020-06-30 Dandan Guo , Bo Chen , Ruiying Lu , Mingyuan Zhou

Distilling a Neural Network Into a Soft Decision Tree

Deep neural networks have proved to be a very effective way to perform classification tasks. They excel when the input data is high dimensional, the relationship between the input and the output is complicated, and the number of labeled…

Machine Learning · Computer Science 2017-11-28 Nicholas Frosst , Geoffrey Hinton

Towards Visual Syntactical Understanding

Syntax is usually studied in the realm of linguistics and refers to the arrangement of words in a sentence. Similarly, an image can be considered as a visual 'sentence', with the semantic parts of the image acting as 'words'. While visual…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Sayeed Shafayet Chowdhury , Soumyadeep Chandra , Kaushik Roy

How Deep Networks Learn Sparse and Hierarchical Data: the Sparse Random Hierarchy Model

Understanding what makes high-dimensional data learnable is a fundamental question in machine learning. On the one hand, it is believed that the success of deep learning lies in its ability to build a hierarchy of representations that…

Machine Learning · Statistics 2024-05-03 Umberto Tomasini , Matthieu Wyart

Finding Syntactic Representations in Neural Stacks

Neural network architectures have been augmented with differentiable stacks in order to introduce a bias toward learning hierarchy-sensitive regularities. It has, however, proven difficult to assess the degree to which such a bias is…

Computation and Language · Computer Science 2019-06-05 William Merrill , Lenny Khazan , Noah Amsel , Yiding Hao , Simon Mendelsohn , Robert Frank

Deep Networks Learn Deep Hierarchical Models

We consider supervised learning with $n$ labels and show that layerwise SGD on residual networks can efficiently learn a class of hierarchical models. This model class assumes the existence of an (unknown) label hierarchy $L_1 \subseteq L_2…

Machine Learning · Computer Science 2026-01-05 Amit Daniely

A Self-supervised Representation Learning of Sentence Structure for Authorship Attribution

Syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of…

Computation and Language · Computer Science 2022-02-25 Fereshteh Jafariakinabad , Kien A. Hua

Provable Learning of Random Hierarchy Models and Hierarchical Shallow-to-Deep Chaining

The empirical success of deep learning is often attributed to deep networks' ability to exploit hierarchical structure in data, constructing increasingly complex features across layers. Yet despite substantial progress in deep learning…

Machine Learning · Computer Science 2026-01-28 Yunwei Ren , Yatin Dandi , Florent Krzakala , Jason D. Lee

Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions

This paper investigates a fundamental problem of scene understanding: how to parse a scene image into a structured configuration (i.e., a semantic object hierarchy with object interaction relations). We propose a deep architecture…

Computer Vision and Pattern Recognition · Computer Science 2018-01-30 Ruimao Zhang , Liang Lin , Guangrun Wang , Meng Wang , Wangmeng Zuo

Unsupervised Recurrent Neural Network Grammars

Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve…

Computation and Language · Computer Science 2019-08-06 Yoon Kim , Alexander M. Rush , Lei Yu , Adhiguna Kuncoro , Chris Dyer , Gábor Melis

Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks

Syntactic rules in natural language typically need to make reference to hierarchical sentence structure. However, the simple examples that language learners receive are often equally compatible with linear rules. Children consistently…

Computation and Language · Computer Science 2018-06-11 R. Thomas McCoy , Robert Frank , Tal Linzen

Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models

While vector-based language representations from pretrained language models have set a new standard for many NLP tasks, there is not yet a complete accounting of their inner workings. In particular, it is not entirely clear what aspects of…

Computation and Language · Computer Science 2021-04-16 Matteo Alleman , Jonathan Mamou , Miguel A Del Rio , Hanlin Tang , Yoon Kim , SueYeon Chung

Randomized Deep Structured Prediction for Discourse-Level Processing

Expressive text encoders such as RNNs and Transformer Networks have been at the center of NLP models in recent work. Most of the effort has focused on sentence-level tasks, capturing the dependencies between words in a single sentence, or…

Computation and Language · Computer Science 2021-09-15 Manuel Widmoser , Maria Leonor Pacheco , Jean Honorio , Dan Goldwasser

A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks

Much effort has been devoted to evaluate whether multi-task learning can be leveraged to learn rich representations that can be used in various Natural Language Processing (NLP) down-stream applications. However, there is still a lack of…

Computation and Language · Computer Science 2018-11-27 Victor Sanh , Thomas Wolf , Sebastian Ruder

Syntax-aware Neural Semantic Role Labeling

Semantic role labeling (SRL), also known as shallow semantic parsing, is an important yet challenging task in NLP. Motivated by the close correlation between syntactic and semantic structures, traditional discrete-feature-based SRL…

Computation and Language · Computer Science 2019-07-23 Qingrong Xia , Zhenghua Li , Min Zhang , Meishan Zhang , Guohong Fu , Rui Wang , Luo Si

Towards Analyzing Semantic Robustness of Deep Neural Networks

Despite the impressive performance of Deep Neural Networks (DNNs) on various vision tasks, they still exhibit erroneous high sensitivity toward semantic primitives (e.g. object pose). We propose a theoretically grounded analysis for DNN…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Abdullah Hamdi , Bernard Ghanem

What Do Recurrent Neural Network Grammars Learn About Syntax?

Recurrent neural network grammars (RNNG) are a recently proposed probabilistic generative modeling family for natural language. They show state-of-the-art language modeling and parsing performance. We investigate what information they…

Computation and Language · Computer Science 2017-01-12 Adhiguna Kuncoro , Miguel Ballesteros , Lingpeng Kong , Chris Dyer , Graham Neubig , Noah A. Smith

Internal representation dynamics and geometry in recurrent neural networks

The efficiency of recurrent neural networks (RNNs) in dealing with sequential data has long been established. However, unlike deep, and convolution networks where we can attribute the recognition of a certain feature to every layer, it is…

Machine Learning · Computer Science 2020-01-15 Stefan Horoi , Guillaume Lajoie , Guy Wolf