Related papers: Learning Structured Text Representations

Evaluating Discourse in Structured Text Representations

Discourse structure is integral to understanding a text and is helpful in many NLP tasks. Learning latent representations of discourse is an attractive alternative to acquiring expensive labeled discourse data. Liu and Lapata (2018) propose…

Computation and Language · Computer Science 2019-06-11 Elisa Ferracane , Greg Durrett , Junyi Jessy Li , Katrin Erk

Structure by Architecture: Structured Representations without Regularization

We study the problem of self-supervised structured representation learning using autoencoders for downstream tasks such as generative modeling. Unlike most methods which rely on matching an arbitrary, relatively unstructured, prior…

Machine Learning · Computer Science 2024-02-16 Felix Leeb , Guilia Lanzillotta , Yashas Annadani , Michel Besserve , Stefan Bauer , Bernhard Schölkopf

Interpretable Structure-aware Document Encoders with Hierarchical Attention

We propose a method to create document representations that reflect their internal structure. We modify Tree-LSTMs to hierarchically merge basic elements such as words and sentences into blocks of increasing complexity. Our Structure…

Computation and Language · Computer Science 2019-10-08 Khalil Mrini , Claudiu Musat , Michael Baeriswyl , Martin Jaggi

Structured Attention for Unsupervised Dialogue Structure Induction

Inducing a meaningful structural representation from one or a set of dialogues is a crucial but challenging task in computational linguistics. Advancement made in this area is critical for dialogue system design and discourse analysis. It…

Computation and Language · Computer Science 2021-03-15 Liang Qiu , Yizhou Zhao , Weiyan Shi , Yuan Liang , Feng Shi , Tao Yuan , Zhou Yu , Song-Chun Zhu

Modelling the semantics of text in complex document layouts using graph transformer networks

Representing structured text from complex documents typically calls for different machine learning techniques, such as language models for paragraphs and convolutional neural networks (CNNs) for table extraction, which prohibits drawing…

Computation and Language · Computer Science 2022-02-21 Thomas Roland Barillot , Jacob Saks , Polena Lilyanova , Edward Torgas , Yachen Hu , Yuanqing Liu , Varun Balupuri , Paul Gaskell

Neural Discourse Structure for Text Categorization

We show that discourse structure, as defined by Rhetorical Structure Theory and provided by an existing discourse parser, benefits text categorization. Our approach uses a recursive neural network and a newly proposed attention mechanism to…

Computation and Language · Computer Science 2017-05-09 Yangfeng Ji , Noah Smith

Learning Structured Natural Language Representations for Semantic Parsing

We introduce a neural semantic parser that converts natural language utterances to intermediate representations in the form of predicate-argument structures, which are induced with a transition system and subsequently mapped to target…

Computation and Language · Computer Science 2017-06-15 Jianpeng Cheng , Siva Reddy , Vijay Saraswat , Mirella Lapata

Unsupervised Inference of Data-Driven Discourse Structures using a Tree Auto-Encoder

With a growing need for robust and general discourse structures in many downstream tasks and real-world applications, the current lack of high-quality, high-quantity discourse trees poses a severe shortcoming. In order the alleviate this…

Computation and Language · Computer Science 2022-10-19 Patrick Huber , Giuseppe Carenini

A document is worth a structured record: Principled inductive bias design for document recognition

Many document types use intrinsic, convention-driven structures that serve to encode precise and structured information, such as the conventions governing engineering drawings. However, many state-of-the-art approaches treat document…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Benjamin Meyer , Lukas Tuggener , Sascha Hänzi , Daniel Schmid , Erdal Ayfer , Benjamin F. Grewe , Ahmed Abdulkadir , Thilo Stadelmann

Structure-Regularized Attention for Deformable Object Representation

Capturing contextual dependencies has proven useful to improve the representational power of deep neural networks. Recent approaches that focus on modeling global context, such as self-attention and non-local operation, achieve this goal by…

Computer Vision and Pattern Recognition · Computer Science 2021-06-15 Shenao Zhang , Li Shen , Zhifeng Li , Wei Liu

Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective

Learning controllable and generalizable representation of multivariate data with desired structural properties remains a fundamental problem in machine learning. In this paper, we present a novel framework for learning generative models…

Machine Learning · Computer Science 2020-10-05 Ruixiang Zhang , Masanori Koyama , Katsuhiko Ishiguro

Cutting Recursive Autoencoder Trees

Deep Learning models enjoy considerable success in Natural Language Processing. While deep architectures produce useful representations that lead to improvements in various tasks, they are often difficult to interpret. This makes the…

Computation and Language · Computer Science 2013-04-29 Christian Scheible , Hinrich Schuetze

Structured Attention Networks

Attention networks have proven to be an effective approach for embedding categorical inference within a deep neural network. However, for many tasks we may want to model richer structural dependencies without abandoning end-to-end training.…

Computation and Language · Computer Science 2017-02-17 Yoon Kim , Carl Denton , Luong Hoang , Alexander M. Rush

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

In this work, we aim at equipping pre-trained language models with structured knowledge. We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs. Building upon entity-level masked language models,…

Computation and Language · Computer Science 2020-04-30 Tao Shen , Yi Mao , Pengcheng He , Guodong Long , Adam Trischler , Weizhu Chen

Discrete Latent Structure in Neural Networks

Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a…

Machine Learning · Computer Science 2026-02-04 Vlad Niculae , Caio F. Corro , Nikita Nangia , Tsvetomila Mihaylova , André F. T. Martins

Learning to Compose Words into Sentences with Reinforcement Learning

We use reinforcement learning to learn tree-structured neural networks for computing representations of natural language sentences. In contrast with prior work on tree-structured models in which the trees are either provided as input or…

Computation and Language · Computer Science 2016-11-29 Dani Yogatama , Phil Blunsom , Chris Dyer , Edward Grefenstette , Wang Ling

Language Model Pre-training for Hierarchical Document Representations

Hierarchical neural architectures are often used to capture long-distance dependencies and have been applied to many document-level tasks such as summarization, document segmentation, and sentiment analysis. However, effective usage of such…

Computation and Language · Computer Science 2019-01-29 Ming-Wei Chang , Kristina Toutanova , Kenton Lee , Jacob Devlin

StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure

This work presents StrAE: a Structured Autoencoder framework that through strict adherence to explicit structure, and use of a novel contrastive objective over tree-structured representations, enables effective learning of multi-level…

Computation and Language · Computer Science 2025-02-25 Mattia Opper , Victor Prokhorov , N. Siddharth

DRTS Parsing with Structure-Aware Encoding and Decoding

Discourse representation tree structure (DRTS) parsing is a novel semantic parsing task which has been concerned most recently. State-of-the-art performance can be achieved by a neural sequence-to-sequence model, treating the tree…

Computation and Language · Computer Science 2020-05-15 Qiankun Fu , Yue Zhang , Jiangming Liu , Meishan Zhang

A Self-supervised Representation Learning of Sentence Structure for Authorship Attribution

Syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of…

Computation and Language · Computer Science 2022-02-25 Fereshteh Jafariakinabad , Kien A. Hua