English
Related papers

Related papers: Discrete Autoencoders for Sequence Models

200 papers

Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have…

Computation and Language · Computer Science 2023-11-08 Justin Lovelace , Varsha Kishore , Chao Wan , Eliot Shekhtman , Kilian Q. Weinberger

In this work we explore deep generative models of text in which the latent representation of a document is itself drawn from a discrete language model distribution. We formulate a variational auto-encoder for inference in this model and…

Computation and Language · Computer Science 2016-10-17 Yishu Miao , Phil Blunsom

We train multi-task autoencoders on linguistic tasks and analyze the learned hidden sentence representations. The representations change significantly when translation and part-of-speech decoders are added. The more decoders a model…

Computation and Language · Computer Science 2018-01-19 Gino Brunner , Yuyi Wang , Roger Wattenhofer , Michael Weigelt

Dense retrieval requires high-quality text sequence embeddings to support effective search in the representation space. Autoencoder-based language models are appealing in dense retrieval as they train the encoder to output high-quality…

Machine Learning · Computer Science 2021-09-17 Shuqi Lu , Di He , Chenyan Xiong , Guolin Ke , Waleed Malik , Zhicheng Dou , Paul Bennett , Tieyan Liu , Arnold Overwijk

Neural sequence-to-sequence models are currently the dominant approach in several natural language processing tasks, but require large parallel corpora. We present a sequence-to-sequence-to-sequence autoencoder (SEQ^3), consisting of two…

Computation and Language · Computer Science 2019-06-11 Christos Baziotis , Ion Androutsopoulos , Ioannis Konstas , Alexandros Potamianos

Variational autoencoders (VAEs) are essential tools in end-to-end representation learning. However, the sequential text generation common pitfall with VAEs is that the model tends to ignore latent variables with a strong auto-regressive…

Machine Learning · Computer Science 2021-02-26 Yang Zhao , Ping Yu , Suchismit Mahapatra , Qinliang Su , Changyou Chen

Deep generative models have been wildly successful at learning coherent latent representations for continuous data such as video and audio. However, generative modeling of discrete data such as arithmetic expressions and molecular…

Machine Learning · Statistics 2017-03-07 Matt J. Kusner , Brooks Paige , José Miguel Hernández-Lobato

Autoregressive language models have demonstrated a remarkable ability to extract latent structure from text. The embeddings from large language models have been shown to capture aspects of the syntax and semantics of language. But what…

Machine Learning · Computer Science 2026-01-09 Liyi Zhang , Michael Y. Li , R. Thomas McCoy , Theodore R. Sumers , Jian-Qiao Zhu , Thomas L. Griffiths

Ever since the successful application of sequence to sequence learning for neural machine translation systems, interest has surged in its applicability towards language generation in other problem domains. Recent work has investigated the…

Computation and Language · Computer Science 2017-10-31 Sharath T. S. , Shubhangi Tandon , Ryan Bauer

Generative autoencoders offer a promising approach for controllable text generation by leveraging their latent sentence representations. However, current models struggle to maintain coherent latent spaces required to perform meaningful text…

Machine Learning · Computer Science 2020-07-08 Tianxiao Shen , Jonas Mueller , Regina Barzilay , Tommi Jaakkola

Deep generative models have been used in recent years to learn coherent latent representations in order to synthesize high-quality images. In this work, we propose a neural network to learn a generative model for sampling consistent indoor…

Computer Vision and Pattern Recognition · Computer Science 2020-08-24 Pulak Purkait , Christopher Zach , Ian Reid

Representation learning for text via pretraining a language model on a large corpus has become a standard starting point for building NLP systems. This approach stands in contrast to autoencoders, also trained on raw text, but with the…

Computation and Language · Computer Science 2021-09-14 Ivan Montero , Nikolaos Pappas , Noah A. Smith

Despite advances in deep probabilistic models, learning discrete latent representations remains challenging. This work introduces a novel method to improve inference in discrete Variational Autoencoders by reframing the inference problem…

Machine Learning · Computer Science 2025-06-11 María Martínez-García , Grace Villacrés , David Mitchell , Pablo M. Olmos

Representation learning is important for solving sequence-to-sequence problems in natural language processing. Representation learning transforms raw data into vector-form representations while preserving their features. However, data with…

Computation and Language · Computer Science 2023-01-12 Yunhao Yang , Zhaokun Xue , Andrew Whinston

Multi-modal data-sets are ubiquitous in modern applications, and multi-modal Variational Autoencoders are a popular family of models that aim to learn a joint representation of the different modalities. However, existing approaches suffer…

Machine Learning · Computer Science 2023-12-19 Mustapha Bounoua , Giulio Franzese , Pietro Michiardi

Despite the extensive success of pretrained language models as encoders for building NLP systems, they haven't seen prominence as decoders for sequence generation tasks. We explore the question of whether these models can be adapted to be…

Computation and Language · Computer Science 2020-08-21 Nishant Subramani , Nivedita Suresh

Variational autoencoders have been widely applied for natural language generation, however, there are two long-standing problems: information under-representation and posterior collapse. The former arises from the fact that only the last…

Machine Learning · Computer Science 2021-06-17 Xianghong Fang , Haoli Bai , Zenglin Xu , Michael Lyu , Irwin King

Cross-language learning allows us to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this…

Computation and Language · Computer Science 2014-02-07 Sarath Chandar A P , Stanislas Lauly , Hugo Larochelle , Mitesh M. Khapra , Balaraman Ravindran , Vikas Raykar , Amrita Saha

There has been a lot of recent interest in designing neural network models to estimate a distribution from a set of examples. We introduce a simple modification for autoencoder neural networks that yields powerful generative models. Our…

Machine Learning · Computer Science 2015-06-08 Mathieu Germain , Karol Gregor , Iain Murray , Hugo Larochelle

The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear mapping is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a…

Machine Learning · Computer Science 2022-11-07 Ioannis A. Nellas , Sotiris K. Tasoulis , Vassilis P. Plagianakos , Spiros V. Georgakopoulos
‹ Prev 1 2 3 10 Next ›