Related papers: Cutting Recursive Autoencoder Trees

Bidirectional Recursive Neural Networks for Token-Level Labeling with Structure

Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks. Inspired by bidirectional recurrent neural networks which use representations that…

Machine Learning · Computer Science 2013-12-03 Ozan İrsoy , Claire Cardie

Less is More: Pre-train a Strong Text Encoder for Dense Retrieval Using a Weak Decoder

Dense retrieval requires high-quality text sequence embeddings to support effective search in the representation space. Autoencoder-based language models are appealing in dense retrieval as they train the encoder to output high-quality…

Machine Learning · Computer Science 2021-09-17 Shuqi Lu , Di He , Chenyan Xiong , Guolin Ke , Waleed Malik , Zhicheng Dou , Paul Bennett , Tieyan Liu , Arnold Overwijk

Can Transformers Learn to Solve Problems Recursively?

Neural networks have in recent years shown promise for helping software engineers write programs and even formally verify them. While semantic information plays a crucial part in these processes, it remains unclear to what degree popular…

Machine Learning · Computer Science 2023-06-27 Shizhuo Dylan Zhang , Curt Tigges , Stella Biderman , Maxim Raginsky , Talia Ringer

Learning Disentangled Representations for Natural Language Definitions

Disentangling the encodings of neural models is a fundamental aspect for improving interpretability, semantic control and downstream task performance in Natural Language Processing. Currently, most disentanglement methods are unsupervised…

Computation and Language · Computer Science 2023-02-17 Danilo S. Carvalho , Giangiacomo Mercatali , Yingji Zhang , Andre Freitas

Recent Advances in Autoencoder-Based Representation Learning

Learning useful representations with little or no supervision is a key challenge in artificial intelligence. We provide an in-depth review of recent advances in representation learning with a focus on autoencoder-based models. To organize…

Machine Learning · Computer Science 2018-12-13 Michael Tschannen , Olivier Bachem , Mario Lucic

Supervised Dimensionality Reduction and Image Classification Utilizing Convolutional Autoencoders

The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear mapping is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a…

Machine Learning · Computer Science 2022-11-07 Ioannis A. Nellas , Sotiris K. Tasoulis , Vassilis P. Plagianakos , Spiros V. Georgakopoulos

Discriminative Autoencoder for Feature Extraction: Application to Character Recognition

Conventionally, autoencoders are unsupervised representation learning tools. In this work, we propose a novel discriminative autoencoder. Use of supervised discriminative learning ensures that the learned representation is robust to…

Computer Vision and Pattern Recognition · Computer Science 2019-12-30 Anupriya Gogna , Angshul Majumdar

Discrete Autoencoders for Sequence Models

Recurrent models for sequences have been recently successful at many tasks, especially for language modeling and machine translation. Nevertheless, it remains challenging to extract good representations from these models. For instance, even…

Machine Learning · Computer Science 2018-01-31 Łukasz Kaiser , Samy Bengio

Structure by Architecture: Structured Representations without Regularization

We study the problem of self-supervised structured representation learning using autoencoders for downstream tasks such as generative modeling. Unlike most methods which rely on matching an arbitrary, relatively unstructured, prior…

Machine Learning · Computer Science 2024-02-16 Felix Leeb , Guilia Lanzillotta , Yashas Annadani , Michel Besserve , Stefan Bauer , Bernhard Schölkopf

When Are Tree Structures Necessary for Deep Learning of Representations?

Recursive neural models, which use syntactic parse trees to recursively generate representations bottom-up, are a popular architecture. But there have not been rigorous evaluations showing for exactly which tasks this syntax-based method is…

Artificial Intelligence · Computer Science 2015-08-19 Jiwei Li , Minh-Thang Luong , Dan Jurafsky , Eudard Hovy

Sentence Bottleneck Autoencoders from Transformer Language Models

Representation learning for text via pretraining a language model on a large corpus has become a standard starting point for building NLP systems. This approach stands in contrast to autoencoders, also trained on raw text, but with the…

Computation and Language · Computer Science 2021-09-14 Ivan Montero , Nikolaos Pappas , Noah A. Smith

Sentence Ordering and Coherence Modeling using Recurrent Neural Networks

Modeling the structure of coherent texts is a key NLP problem. The task of coherently organizing a given set of sentences has been commonly used to build and evaluate models that understand such structure. We propose an end-to-end…

Computation and Language · Computer Science 2017-12-25 Lajanugen Logeswaran , Honglak Lee , Dragomir Radev

Semisupervised Autoencoder for Sentiment Analysis

In this paper, we investigate the usage of autoencoders in modeling textual data. Traditional autoencoders suffer from at least two aspects: scalability with the high dimensionality of vocabulary size and dealing with task-irrelevant words.…

Machine Learning · Computer Science 2015-12-15 Shuangfei Zhai , Zhongfei Zhang

Unsupervised Learning of Explainable Parse Trees for Improved Generalisation

Recursive neural networks (RvNN) have been shown useful for learning sentence representations and helped achieve competitive performance on several natural language inference tasks. However, recent RvNN-based models fail to learn simple…

Computation and Language · Computer Science 2021-04-13 Atul Sahay , Ayush Maheshwari , Ritesh Kumar , Ganesh Ramakrishnan , Manjesh Kumar Hanawal , Kavi Arya

Tree-structured composition in neural networks without tree-structured architectures

Tree-structured neural networks encode a particular tree geometry for a sentence in the network design. However, these models have at best only slightly outperformed simpler sequence-based models. We hypothesize that neural sequence models…

Computation and Language · Computer Science 2015-11-10 Samuel R. Bowman , Christopher D. Manning , Christopher Potts

Learning Structured Text Representations

In this paper, we focus on learning structure-aware document representations from data without recourse to a discourse parser or additional annotations. Drawing inspiration from recent efforts to empower neural networks with a structural…

Computation and Language · Computer Science 2018-02-06 Yang Liu , Mirella Lapata

Autoencoder Based Residual Deep Networks for Robust Regression Prediction and Spatiotemporal Estimation

To have a superior generalization, a deep learning neural network often involves a large size of training sample. With increase of hidden layers in order to increase learning ability, neural network has potential degradation in accuracy.…

Machine Learning · Computer Science 2019-01-01 Lianfa Li , Ying Fang , Jun Wu , Jinfeng Wang

Logical Interpretations of Autoencoders

The unification of low-level perception and high-level reasoning is a long-standing problem in artificial intelligence, which has the potential to not only bring the areas of logic and learning closer together but also demonstrate how…

Artificial Intelligence · Computer Science 2019-11-27 Anton Fuxjaeger , Vaishak Belle

Revisiting Image Reconstruction for Semi-supervised Semantic Segmentation

Autoencoding, which aims to reconstruct the input images through a bottleneck latent representation, is one of the classic feature representation learning strategies. It has been shown effective as an auxiliary task for semi-supervised…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Yuhao Lin , Haiming Xu , Lingqiao Liu , Jinan Zou , Javen Qinfeng Shi

Autoencoder Trees

We discuss an autoencoder model in which the encoding and decoding functions are implemented by decision trees. We use the soft decision tree where internal nodes realize soft multivariate splits given by a gating function and the overall…

Machine Learning · Computer Science 2014-09-29 Ozan İrsoy , Ethem Alpaydın