Related papers: Exact Decoding on Latent Variable Conditional Mode…

On the Computational Intractability of Exact and Approximate Dictionary Learning

The efficient sparse coding and reconstruction of signal vectors via linear observations has received a tremendous amount of attention over the last decade. In this context, the automated learning of a suitable basis or overcomplete…

Information Theory · Computer Science 2015-06-19 Andreas M. Tillmann

Exact Inference with Latent Variables in an Arbitrary Domain

We analyze the necessary and sufficient conditions for exact inference of a latent model. In latent models, each entity is associated with a latent variable following some probability distribution. The challenging question we try to solve…

Social and Information Networks · Computer Science 2020-06-30 Chuyang Ke , Jean Honorio

A Tutorial on Deep Latent Variable Models of Natural Language

There has been much recent, exciting work on combining the complementary strengths of latent variable models and deep learning. Latent variable modeling makes it easy to explicitly specify model constraints through conditional independence…

Computation and Language · Computer Science 2019-08-06 Yoon Kim , Sam Wiseman , Alexander M. Rush

Stress-Testing Neural Models of Natural Language Inference with Multiply-Quantified Sentences

Standard evaluations of deep learning models for semantics using naturalistic corpora are limited in what they can tell us about the fidelity of the learned representations, because the corpora rarely come with good measures of semantic…

Computation and Language · Computer Science 2018-11-01 Atticus Geiger , Ignacio Cases , Lauri Karttunen , Christopher Potts

Conditional Variational Autoencoder for Neural Machine Translation

We explore the performance of latent variable models for conditional text generation in the context of neural machine translation (NMT). Similar to Zhang et al., we augment the encoder-decoder NMT paradigm by introducing a continuous latent…

Computation and Language · Computer Science 2018-12-12 Artidoro Pagnoni , Kevin Liu , Shangyan Li

Efficient implementation of linear programming decoding

While linear programming (LP) decoding provides more flexibility for finite-length performance analysis than iterative message-passing (IMP) decoding, it is computationally more complex to implement in its original form, due to both the…

Information Theory · Computer Science 2009-02-05 Mohammad H. Taghavi , Amin Shokrollahi , Paul H. Siegel

Learning a Loopy Model For Semantic Segmentation Exactly

Learning structured models using maximum margin techniques has become an indispensable tool for com- puter vision researchers, as many computer vision applications can be cast naturally as an image labeling problem. Pixel-based or…

Machine Learning · Computer Science 2013-09-17 Andreas Christian Mueller , Sven Behnke

How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics

Natural Language Inference (NLI) evaluation is crucial for assessing language understanding models; however, popular datasets suffer from systematic spurious correlations that artificially inflate actual model performance. To address this,…

Computation and Language · Computer Science 2024-10-07 Adrian Cosma , Stefan Ruseti , Mihai Dascalu , Cornelia Caragea

Predictive Coding for Locally-Linear Control

High-dimensional observations and unknown dynamics are major challenges when applying optimal control to many real-world decision making tasks. The Learning Controllable Embedding (LCE) framework addresses these challenges by embedding the…

Machine Learning · Computer Science 2020-03-03 Rui Shu , Tung Nguyen , Yinlam Chow , Tuan Pham , Khoat Than , Mohammad Ghavamzadeh , Stefano Ermon , Hung H. Bui

Challenges for cognitive decoding using deep learning methods

In cognitive decoding, researchers aim to characterize a brain region's representations by identifying the cognitive states (e.g., accepting/rejecting a gamble) that can be identified from the region's activity. Deep learning (DL) methods…

Machine Learning · Computer Science 2021-08-17 Armin W. Thomas , Christopher Ré , Russell A. Poldrack

A Neural Network-aided Low Complexity Chase Decoder for URLLC

Ultra-reliable low-latency communications (URLLC) demand decoding algorithms that simultaneously offer high reliability and low complexity under stringent latency constraints. While iterative decoding schemes for LDPC and Polar codes offer…

Signal Processing · Electrical Eng. & Systems 2025-07-30 Enrico Testi , Enrico Paolini

Deconvolutional Latent-Variable Model for Text Sequence Matching

A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives. To alleviate typical optimization challenges in latent-variable models for text, we…

Computation and Language · Computer Science 2017-11-23 Dinghan Shen , Yizhe Zhang , Ricardo Henao , Qinliang Su , Lawrence Carin

Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference

Deep learning models have achieved remarkable success in natural language inference (NLI) tasks. While these models are widely explored, they are hard to interpret and it is often unclear how and why they actually work. In this paper, we…

Computation and Language · Computer Science 2019-05-21 Reza Ghaeini , Xiaoli Z. Fern , Prasad Tadepalli

Learning Latent Dynamics via Invariant Decomposition and (Spatio-)Temporal Transformers

We propose a method for learning dynamical systems from high-dimensional empirical data that combines variational autoencoders and (spatio-)temporal attention within a framework designed to enforce certain scientifically-motivated…

Machine Learning · Computer Science 2023-06-22 Kai Lagemann , Christian Lagemann , Sach Mukherjee

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

To mitigate the high inference latency stemming from autoregressive decoding in Large Language Models (LLMs), Speculative Decoding has emerged as a novel decoding paradigm for LLM inference. In each decoding step, this method first drafts…

Computation and Language · Computer Science 2024-06-05 Heming Xia , Zhe Yang , Qingxiu Dong , Peiyi Wang , Yongqi Li , Tao Ge , Tianyu Liu , Wenjie Li , Zhifang Sui

Latent Regression Bayesian Network for Data Representation

Deep directed generative models have attracted much attention recently due to their expressive representation power and the ability of ancestral sampling. One major difficulty of learning directed models with many latent variables is the…

Machine Learning · Computer Science 2015-06-16 Siqi Nie , Qiang Ji

On the Hardness of the Lee Syndrome Decoding Problem

In this paper we study the hardness of the syndrome decoding problem over finite rings endowed with the Lee metric. We first prove that the decisional version of the problem is NP-complete, by a reduction from the $3$-dimensional matching…

Information Theory · Computer Science 2022-04-04 Violetta Weger , Karan Khathuria , Anna-Lena Horlemann , Massimo Battaglioni , Paolo Santini , Edoardo Persichetti

Latent Programmer: Discrete Latent Codes for Program Synthesis

In many sequence learning tasks, such as program synthesis and document summarization, a key problem is searching over a large space of possible output sequences. We propose to learn representations of the outputs that are specifically…

Machine Learning · Computer Science 2021-08-09 Joey Hong , David Dohan , Rishabh Singh , Charles Sutton , Manzil Zaheer

Hyperdecoders: Instance-specific decoders for multi-task NLP

We investigate input-conditioned hypernetworks for multi-tasking in NLP, generating parameter-efficient adaptations for a decoder using a hypernetwork conditioned on the output of an encoder. This approach produces a unique decoder…

Computation and Language · Computer Science 2022-10-19 Hamish Ivison , Matthew E. Peters

Lazy-k: Decoding for Constrained Token Classification

We explore the possibility of improving probabilistic models in structured prediction. Specifically, we combine the models with constrained decoding approaches in the context of token classification for information extraction. The decoding…

Computation and Language · Computer Science 2023-12-07 Arthur Hemmer , Mickaël Coustaty , Nicola Bartolo , Jérôme Brachat , Jean-Marc Ogier