Related papers: Interpretable Structure-Evolving LSTM

Semantic Object Parsing with Graph LSTM

By taking the semantic object parsing task as an exemplar application scenario, we propose the Graph Long Short-Term Memory (Graph LSTM) network, which is the generalization of LSTM from sequential data or multi-dimensional data to general…

Computer Vision and Pattern Recognition · Computer Science 2016-03-24 Xiaodan Liang , Xiaohui Shen , Jiashi Feng , Liang Lin , Shuicheng Yan

Exploring Interpretable LSTM Neural Networks over Multi-Variable Data

For recurrent neural networks trained on time series with target and exogenous variables, in addition to accurate prediction, it is also desired to provide interpretable insights into the data. In this paper, we explore the structure of…

Machine Learning · Computer Science 2019-05-30 Tian Guo , Tao Lin , Nino Antulov-Fantulin

From Nodes to Networks: Evolving Recurrent Neural Networks

Gated recurrent networks such as those composed of Long Short-Term Memory (LSTM) nodes have recently been used to improve state of the art in many sequential processing tasks such as speech recognition and machine translation. However, the…

Neural and Evolutionary Computing · Computer Science 2018-06-11 Aditya Rawal , Risto Miikkulainen

Stochastic Blockmodels meet Graph Neural Networks

Stochastic blockmodels (SBM) and their variants, $e.g.$, mixed-membership and overlapping stochastic blockmodels, are latent variable based generative models for graphs. They have proven to be successful for various tasks, such as…

Machine Learning · Computer Science 2019-05-15 Nikhil Mehta , Lawrence Carin , Piyush Rai

Interpretable Structure-aware Document Encoders with Hierarchical Attention

We propose a method to create document representations that reflect their internal structure. We modify Tree-LSTMs to hierarchically merge basic elements such as words and sentences into blocks of increasing complexity. Our Structure…

Computation and Language · Computer Science 2019-10-08 Khalil Mrini , Claudiu Musat , Michael Baeriswyl , Martin Jaggi

Long Short-Term Memory with Gate and State Level Fusion for Light Field-Based Face Recognition

Long Short-Term Memory (LSTM) is a prominent recurrent neural network for extracting dependencies from sequential data such as time-series and multi-view data, having achieved impressive results for different visual recognition tasks. A…

Computer Vision and Pattern Recognition · Computer Science 2020-06-03 Alireza Sepas-Moghaddam , Ali Etemad , Fernando Pereira , Paulo Lobato Correia

Learning an Interpretable Graph Structure in Multi-Task Learning

We present a novel methodology to jointly perform multi-task learning and infer intrinsic relationship among tasks by an interpretable and sparse graph. Unlike existing multi-task learning methodologies, the graph structure is not assumed…

Machine Learning · Computer Science 2020-09-15 Shujian Yu , Francesco Alesiani , Ammar Shaker , Wenzhe Yin

Visualizing and Understanding Recurrent Networks

Recurrent Neural Networks (RNNs), and specifically a variant with Long Short-Term Memory (LSTM), are enjoying renewed interest as a result of successful applications in a wide range of machine learning problems that involve sequential data.…

Machine Learning · Computer Science 2015-11-18 Andrej Karpathy , Justin Johnson , Li Fei-Fei

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of…

Computation and Language · Computer Science 2015-06-02 Kai Sheng Tai , Richard Socher , Christopher D. Manning

Learning From Graph Neighborhoods Using LSTMs

Many prediction problems can be phrased as inferences over local neighborhoods of graphs. The graph represents the interaction between entities, and the neighborhood of each entity contains information that allows the inferences or…

Machine Learning · Computer Science 2016-11-22 Rakshit Agrawal , Luca de Alfaro , Vassilis Polychronopoulos

Learning Multi-graph Structure for Temporal Knowledge Graph Reasoning

Temporal Knowledge Graph (TKG) reasoning that forecasts future events based on historical snapshots distributed over timestamps is denoted as extrapolation and has gained significant attention. Owing to its extreme versatility and variation…

Artificial Intelligence · Computer Science 2024-07-01 Jinchuan Zhang , Bei Hui , Chong Mu , Ling Tian

An interpretable LSTM neural network for autoregressive exogenous model

In this paper, we propose an interpretable LSTM recurrent neural network, i.e., multi-variable LSTM for time series with exogenous variables. Currently, widely used attention mechanism in recurrent neural networks mostly focuses on the…

Machine Learning · Computer Science 2018-04-17 Tian Guo , Tao Lin , Yao Lu

Long Short-Term Memory Over Tree Structures

The chain-structured long short-term memory (LSTM) has showed to be effective in a wide range of problems such as speech recognition and machine translation. In this paper, we propose to extend it to tree structures, in which a memory cell…

Computation and Language · Computer Science 2015-03-18 Xiaodan Zhu , Parinaz Sobhani , Hongyu Guo

A memory enhanced LSTM for modeling complex temporal dependencies

In this paper, we present Gamma-LSTM, an enhanced long short term memory (LSTM) unit, to enable learning of hierarchical representations through multiple stages of temporal abstractions. Gamma memory, a hierarchical memory unit, forms the…

Machine Learning · Computer Science 2019-10-29 Sneha Aenugu

LiteLSTM Architecture for Deep Recurrent Neural Networks

Long short-term memory (LSTM) is a robust recurrent neural network architecture for learning spatiotemporal sequential data. However, it requires significant computational power for learning and implementing from both software and hardware…

Machine Learning · Computer Science 2022-10-26 Nelly Elsayed , Zag ElSayed , Anthony S. Maida

Multi-Perspective LSTM for Joint Visual Representation Learning

We present a novel LSTM cell architecture capable of learning both intra- and inter-perspective relationships available in visual sequences captured from multiple perspectives. Our architecture adopts a novel recurrent joint learning…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Alireza Sepas-Moghaddam , Fernando Pereira , Paulo Lobato Correia , Ali Etemad

Representation Learning of Graphs Using Graph Convolutional Multilayer Networks Based on Motifs

The graph structure is a commonly used data storage mode, and it turns out that the low-dimensional embedded representation of nodes in the graph is extremely useful in various typical tasks, such as node classification, link prediction ,…

Social and Information Networks · Computer Science 2020-08-03 Xing Li , Wei Wei , Xiangnan Feng , Xue Liu , Zhiming Zheng

Stochastic Graph Recurrent Neural Network

Representation learning over graph structure data has been widely studied due to its wide application prospects. However, previous methods mainly focus on static graphs while many real-world graphs evolve over time. Modeling such evolution…

Machine Learning · Statistics 2020-09-02 Tijin Yan , Hongwei Zhang , Zirui Li , Yuanqing Xia

Implicit Graph, Explicit Retrieval: Towards Efficient and Interpretable Long-horizon Memory for Large Language Models

Long-horizon applications increasingly require large language models (LLMs) to answer queries when relevant evidence is sparse and dispersed across very long contexts. Existing memory systems largely follow two paradigms: explicit…

Computation and Language · Computer Science 2026-01-08 Xin Zhang , Kailai Yang , Hao Li , Chenyue Li , Qiyu Wei , Sophia Ananiadou

Revisiting the Hierarchical Multiscale LSTM

Hierarchical Multiscale LSTM (Chung et al., 2016a) is a state-of-the-art language model that learns interpretable structure from character-level input. Such models can provide fertile ground for (cognitive) computational linguistics…

Computation and Language · Computer Science 2018-07-11 Ákos Kádár , Marc-Alexandre Côté , Grzegorz Chrupała , Afra Alishahi