Related papers: Context based Text-generation using LSTM networks

Detecting Context Dependent Messages in a Conversational Environment

While automatic response generation for building chatbot systems has drawn a lot of attention recently, there is limited understanding on when we need to consider the linguistic context of an input text in the generation process. The task…

Computation and Language · Computer Science 2016-11-04 Chaozhuo Li , Yu Wu , Wei Wu , Chen Xing , Zhoujun Li , Ming Zhou

How LSTM Encodes Syntax: Exploring Context Vectors and Semi-Quantization on Natural Text

Long Short-Term Memory recurrent neural network (LSTM) is widely used and known to capture informative long-term syntactic dependencies. However, how such information are reflected in its internal vectors for natural text has not yet been…

Computation and Language · Computer Science 2020-10-02 Chihiro Shibata , Kei Uchiumi , Daichi Mochihashi

Evaluating the Ability of LSTMs to Learn Context-Free Grammars

While long short-term memory (LSTM) neural net architectures are designed to capture sequence information, human language is generally composed of hierarchical structures. This raises the question as to whether LSTMs can learn hierarchical…

Computation and Language · Computer Science 2018-11-08 Luzi Sennhauser , Robert C. Berwick

Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and…

Computation and Language · Computer Science 2015-08-27 Tsung-Hsien Wen , Milica Gasic , Nikola Mrksic , Pei-Hao Su , David Vandyke , Steve Young

A Hierarchical Neural Autoencoder for Paragraphs and Documents

Natural language generation of coherent long texts like paragraphs or longer documents is a challenging problem for recurrent networks models. In this paper, we explore an important step toward this generation task: training an LSTM…

Computation and Language · Computer Science 2015-06-09 Jiwei Li , Minh-Thang Luong , Dan Jurafsky

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of…

Computation and Language · Computer Science 2015-06-02 Kai Sheng Tai , Richard Socher , Christopher D. Manning

Extending Memory for Language Modelling

Breakthroughs in deep learning and memory networks have made major advances in natural language understanding. Language is sequential and information carried through the sequence can be captured through memory networks. Learning the…

Computation and Language · Computer Science 2023-05-22 Anupiya Nugaliyadde

Contextual LSTM (CLSTM) models for Large scale NLP tasks

Documents exhibit sequential structure at multiple levels of abstraction (e.g., sentences, paragraphs, sections). These abstractions constitute a natural hierarchy for representing the context in which to infer the meaning of words and…

Computation and Language · Computer Science 2016-06-01 Shalini Ghosh , Oriol Vinyals , Brian Strope , Scott Roy , Tom Dean , Larry Heck

Sequence to Sequence -- Video to Text

Real-world videos often have complex dynamics; and methods for generating open-domain video descriptions should be sensitive to temporal structure and allow both input (sequence of frames) and output (sequence of words) of variable length.…

Computer Vision and Pattern Recognition · Computer Science 2015-10-20 Subhashini Venugopalan , Marcus Rohrbach , Jeff Donahue , Raymond Mooney , Trevor Darrell , Kate Saenko

Jointly Modeling Embedding and Translation to Bridge Video and Language

Automatically describing video content with natural language is a fundamental challenge of multimedia. Recurrent Neural Networks (RNN), which models sequence dynamics, has attracted increasing attention on visual interpretation. However,…

Computer Vision and Pattern Recognition · Computer Science 2015-06-05 Yingwei Pan , Tao Mei , Ting Yao , Houqiang Li , Yong Rui

Exploration of Masked and Causal Language Modelling for Text Generation

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability to capture long term…

Computation and Language · Computer Science 2016-11-18 Hamid Palangi , Li Deng , Yelong Shen , Jianfeng Gao , Xiaodong He , Jianshu Chen , Xinying Song , Rabab Ward

Effective LSTMs for Target-Dependent Sentiment Classification

Target-dependent sentiment classification remains a challenge: modeling the semantic relatedness of a target with its context words in a sentence. Different context words have different influences on determining the sentiment polarity of a…

Computation and Language · Computer Science 2016-09-30 Duyu Tang , Bing Qin , Xiaocheng Feng , Ting Liu

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Top-down Tree Long Short-Term Memory Networks

Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have been successfully applied to a variety of sequence modeling tasks. In this paper we develop Tree Long Short-Term Memory…

Computation and Language · Computer Science 2016-04-05 Xingxing Zhang , Liang Lu , Mirella Lapata

ACER: Automatic Language Model Context Extension via Retrieval

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through…

Computation and Language · Computer Science 2024-10-15 Luyu Gao , Yunyi Zhang , Jamie Callan

Long-Short Range Context Neural Networks for Language Modeling

The goal of language modeling techniques is to capture the statistical and structural properties of natural languages from training corpora. This task typically involves the learning of short range dependencies, which generally model the…

Computation and Language · Computer Science 2017-08-23 Youssef Oualil , Mittul Singh , Clayton Greenberg , Dietrich Klakow

The Role of Conversation Context for Sarcasm Detection in Online Interactions

Computational models for sarcasm detection have often relied on the content of utterances in isolation. However, speaker's sarcastic intent is not always obvious without additional context. Focusing on social media discussions, we…

Computation and Language · Computer Science 2017-07-21 Debanjan Ghosh , Alexander Richard Fabbri , Smaranda Muresan

Future Vector Enhanced LSTM Language Model for LVCSR

Language models (LM) play an important role in large vocabulary continuous speech recognition (LVCSR). However, traditional language models only predict next single word with given history, while the consecutive predictions on a sequence of…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-06 Qi Liu , Yanmin Qian , Kai Yu

LSTM-Based Text Generation: A Study on Historical Datasets

This paper presents an exploration of Long Short-Term Memory (LSTM) networks in the realm of text generation, focusing on the utilization of historical datasets for Shakespeare and Nietzsche. LSTMs, known for their effectiveness in handling…

Computation and Language · Computer Science 2024-03-13 Mustafa Abbas Hussein Hussein , Serkan Savaş