Related papers: Reference-Aware Language Models

Reference Dependence and Random Attention

We explore the ways that a reference point may direct attention. Utilizing a stochastic choice framework, we provide behavioral foundations for the Reference-Dependent Random Attention Model (RD-RAM). Our characterization result shows that…

Theoretical Economics · Economics 2023-03-21 Matthew Kovach , Elchin Suleymanov

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Sequential data often possesses a hierarchical structure with complex dependencies between subsequences, such as found between the utterances in a dialogue. In an effort to model this kind of generative process, we propose a neural…

Computation and Language · Computer Science 2016-06-15 Iulian Vlad Serban , Alessandro Sordoni , Ryan Lowe , Laurent Charlin , Joelle Pineau , Aaron Courville , Yoshua Bengio

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual…

Computation and Language · Computer Science 2016-04-06 Yangfeng Ji , Gholamreza Haffari , Jacob Eisenstein

Variational Memory Addressing in Generative Models

Aiming to augment generative models with external memory, we interpret the output of a memory module with stochastic addressing as a conditional mixture distribution, where a read operation corresponds to sampling a discrete memory address…

Machine Learning · Computer Science 2017-09-22 Jörg Bornschein , Andriy Mnih , Daniel Zoran , Danilo J. Rezende

Living a discrete life in a continuous world: Reference with distributed representations

Reference is a crucial property of language that allows us to connect linguistic expressions to the world. Modeling it requires handling both continuous and discrete aspects of meaning. Data-driven models excel at the former, but struggle…

Computation and Language · Computer Science 2017-09-05 Gemma Boleda , Sebastian Padó , Nghia The Pham , Marco Baroni

Emergent Communication with World Models

We introduce Language World Models, a class of language-conditional generative model which interpret natural language messages by predicting latent codes of future observations. This provides a visual grounding of the message, similar to an…

Computation and Language · Computer Science 2020-02-25 Alexander I. Cowen-Rivers , Jason Naradowsky

Latent Relation Language Models

In this paper, we propose Latent Relation Language Models (LRLMs), a class of language models that parameterizes the joint distribution over the words in a document and the entities that occur therein via knowledge graph relations. This…

Computation and Language · Computer Science 2019-08-22 Hiroaki Hayashi , Zecong Hu , Chenyan Xiong , Graham Neubig

Toward Forgetting-Sensitive Referring Expression Generationfor Integrated Robot Architectures

To engage in human-like dialogue, robots require the ability to describe the objects, locations, and people in their environment, a capability known as "Referring Expression Generation." As speakers repeatedly refer to similar objects, they…

Artificial Intelligence · Computer Science 2020-07-20 Tom Williams , Torin Johnson , Will Culpepper , Kellyn Larson

Learning Interpretable Latent Dialogue Actions With Less Supervision

We present a novel architecture for explainable modeling of task-oriented dialogues with discrete latent variables to represent dialogue actions. Our model is based on variational recurrent neural networks (VRNN) and requires no explicit…

Computation and Language · Computer Science 2022-10-14 Vojtěch Hudeček , Ondřej Dušek

Collaborating on Referring Expressions

This paper presents a computational model of how conversational participants collaborate in order to make a referring action successful. The model is based on the view of language as goal-directed behavior. We propose that the content of a…

cmp-lg · Computer Science 2008-02-03 Peter A. Heeman , Graeme Hirst

Improving Entity Linking by Modeling Latent Relations between Mentions

Entity linking involves aligning textual mentions of named entities to their corresponding entries in a knowledge base. Entity linking systems often exploit relations between textual mentions in a document (e.g., coreference) to decide if…

Computation and Language · Computer Science 2018-05-01 Phong Le , Ivan Titov

Joint Modeling of Content and Discourse Relations in Dialogues

We present a joint modeling approach to identify salient discussion points in spoken meetings as well as to label the discourse relations between speaker turns. A variation of our model is also discussed when discourse relations are treated…

Computation and Language · Computer Science 2017-05-16 Kechen Qin , Lu Wang , Joseph Kim

Reference Resolution within the Framework of Cognitive Grammar

Following the principles of Cognitive Grammar, we concentrate on a model for reference resolution that attempts to overcome the difficulties previous approaches, based on the fundamental assumption that all reference (independent on the…

Computation and Language · Computer Science 2009-09-15 Susanne Salmon-Alt , Laurent Romary

Iterative Recursive Attention Model for Interpretable Sequence Classification

Natural language processing has greatly benefited from the introduction of the attention mechanism. However, standard attention models are of limited interpretability for tasks that involve a series of inference steps. We describe an…

Computation and Language · Computer Science 2018-09-03 Martin Tutek , Jan Šnajder

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. We describe how we can train this model in a deterministic manner using…

Machine Learning · Computer Science 2016-04-20 Kelvin Xu , Jimmy Ba , Ryan Kiros , Kyunghyun Cho , Aaron Courville , Ruslan Salakhutdinov , Richard Zemel , Yoshua Bengio

Temporal Latent Variable Structural Causal Model for Causal Discovery under External Interferences

Inferring causal relationships from observed data is an important task, yet it becomes challenging when the data is subject to various external interferences. Most of these interferences are the additional effects of external factors on…

Machine Learning · Computer Science 2025-11-14 Ruichu Cai , Xiaokai Huang , Wei Chen , Zijian Li , Zhifeng Hao

A Latent Variable Approach to Learning High-dimensional Multivariate longitudinal Data

High-dimensional multivariate longitudinal data, which arise when many outcome variables are measured repeatedly over time, are becoming increasingly common in social, behavioral and health sciences. We propose a latent variable model for…

Methodology · Statistics 2025-12-09 Sze Ming Lee , Yunxiao Chen , Tony Sit

Neural Language Modeling With Implicit Cache Pointers

A cache-inspired approach is proposed for neural language models (LMs) to improve long-range dependency and better predict rare words from long contexts. This approach is a simpler alternative to attention-based pointer mechanism that…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-30 Ke Li , Daniel Povey , Sanjeev Khudanpur

Concurrent Lexicalized Dependency Parsing: The ParseTalk Model

A grammar model for concurrent, object-oriented natural language parsing is introduced. Complete lexical distribution of grammatical knowledge is achieved building upon the head-oriented notions of valency and dependency, while inheritance…

cmp-lg · Computer Science 2008-02-03 Norbert Broeker , Udo Hahn , Susanne Schacht

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

Pre-training models have been proved effective for a wide range of natural language processing tasks. Inspired by this, we propose a novel dialogue generation pre-training framework to support various kinds of conversations, including…

Computation and Language · Computer Science 2020-05-01 Siqi Bao , Huang He , Fan Wang , Hua Wu , Haifeng Wang