English
Related papers

Related papers: Context based Text-generation using LSTM networks

200 papers

While automatic response generation for building chatbot systems has drawn a lot of attention recently, there is limited understanding on when we need to consider the linguistic context of an input text in the generation process. The task…

Computation and Language · Computer Science 2016-11-04 Chaozhuo Li , Yu Wu , Wei Wu , Chen Xing , Zhoujun Li , Ming Zhou

Long Short-Term Memory recurrent neural network (LSTM) is widely used and known to capture informative long-term syntactic dependencies. However, how such information are reflected in its internal vectors for natural text has not yet been…

Computation and Language · Computer Science 2020-10-02 Chihiro Shibata , Kei Uchiumi , Daichi Mochihashi

While long short-term memory (LSTM) neural net architectures are designed to capture sequence information, human language is generally composed of hierarchical structures. This raises the question as to whether LSTMs can learn hierarchical…

Computation and Language · Computer Science 2018-11-08 Luzi Sennhauser , Robert C. Berwick

Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and…

Computation and Language · Computer Science 2015-08-27 Tsung-Hsien Wen , Milica Gasic , Nikola Mrksic , Pei-Hao Su , David Vandyke , Steve Young

Natural language generation of coherent long texts like paragraphs or longer documents is a challenging problem for recurrent networks models. In this paper, we explore an important step toward this generation task: training an LSTM…

Computation and Language · Computer Science 2015-06-09 Jiwei Li , Minh-Thang Luong , Dan Jurafsky

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of…

Computation and Language · Computer Science 2015-06-02 Kai Sheng Tai , Richard Socher , Christopher D. Manning

Breakthroughs in deep learning and memory networks have made major advances in natural language understanding. Language is sequential and information carried through the sequence can be captured through memory networks. Learning the…

Computation and Language · Computer Science 2023-05-22 Anupiya Nugaliyadde

Documents exhibit sequential structure at multiple levels of abstraction (e.g., sentences, paragraphs, sections). These abstractions constitute a natural hierarchy for representing the context in which to infer the meaning of words and…

Computation and Language · Computer Science 2016-06-01 Shalini Ghosh , Oriol Vinyals , Brian Strope , Scott Roy , Tom Dean , Larry Heck

Real-world videos often have complex dynamics; and methods for generating open-domain video descriptions should be sensitive to temporal structure and allow both input (sequence of frames) and output (sequence of words) of variable length.…

Computer Vision and Pattern Recognition · Computer Science 2015-10-20 Subhashini Venugopalan , Marcus Rohrbach , Jeff Donahue , Raymond Mooney , Trevor Darrell , Kate Saenko

Automatically describing video content with natural language is a fundamental challenge of multimedia. Recurrent Neural Networks (RNN), which models sequence dynamics, has attracted increasing attention on visual interpretation. However,…

Computer Vision and Pattern Recognition · Computer Science 2015-06-05 Yingwei Pan , Tao Mei , Ting Yao , Houqiang Li , Yong Rui

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability to capture long term…

Computation and Language · Computer Science 2016-11-18 Hamid Palangi , Li Deng , Yelong Shen , Jianfeng Gao , Xiaodong He , Jianshu Chen , Xinying Song , Rabab Ward

Target-dependent sentiment classification remains a challenge: modeling the semantic relatedness of a target with its context words in a sentence. Different context words have different influences on determining the sentiment polarity of a…

Computation and Language · Computer Science 2016-09-30 Duyu Tang , Bing Qin , Xiaocheng Feng , Ting Liu

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have been successfully applied to a variety of sequence modeling tasks. In this paper we develop Tree Long Short-Term Memory…

Computation and Language · Computer Science 2016-04-05 Xingxing Zhang , Liang Lu , Mirella Lapata

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through…

Computation and Language · Computer Science 2024-10-15 Luyu Gao , Yunyi Zhang , Jamie Callan

The goal of language modeling techniques is to capture the statistical and structural properties of natural languages from training corpora. This task typically involves the learning of short range dependencies, which generally model the…

Computation and Language · Computer Science 2017-08-23 Youssef Oualil , Mittul Singh , Clayton Greenberg , Dietrich Klakow

Computational models for sarcasm detection have often relied on the content of utterances in isolation. However, speaker's sarcastic intent is not always obvious without additional context. Focusing on social media discussions, we…

Computation and Language · Computer Science 2017-07-21 Debanjan Ghosh , Alexander Richard Fabbri , Smaranda Muresan

Language models (LM) play an important role in large vocabulary continuous speech recognition (LVCSR). However, traditional language models only predict next single word with given history, while the consecutive predictions on a sequence of…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-06 Qi Liu , Yanmin Qian , Kai Yu

This paper presents an exploration of Long Short-Term Memory (LSTM) networks in the realm of text generation, focusing on the utilization of historical datasets for Shakespeare and Nietzsche. LSTMs, known for their effectiveness in handling…

Computation and Language · Computer Science 2024-03-13 Mustafa Abbas Hussein Hussein , Serkan Savaş
‹ Prev 1 2 3 10 Next ›