Related papers: Memory Fusion Network for Multi-view Sequential Le…

Dual Memory Neural Computer for Asynchronous Two-view Sequential Learning

One of the core tasks in multi-view learning is to capture relations among views. For sequential data, the relations not only span across views, but also extend throughout the view length to form long-term intra-view and inter-view…

Machine Learning · Computer Science 2018-02-13 Hung Le , Truyen Tran , Svetha Venkatesh

Long Short-Term Memory with Gate and State Level Fusion for Light Field-Based Face Recognition

Long Short-Term Memory (LSTM) is a prominent recurrent neural network for extracting dependencies from sequential data such as time-series and multi-view data, having achieved impressive results for different visual recognition tasks. A…

Computer Vision and Pattern Recognition · Computer Science 2020-06-03 Alireza Sepas-Moghaddam , Ali Etemad , Fernando Pereira , Paulo Lobato Correia

Learnable Graph Convolutional Network and Feature Fusion for Multi-view Learning

In practical applications, multi-view data depicting objectives from assorted perspectives can facilitate the accuracy increase of learning algorithms. However, given multi-view data, there is limited work for learning discriminative node…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Zhaoliang Chen , Lele Fu , Jie Yao , Wenzhong Guo , Claudia Plant , Shiping Wang

Latent Variable Algorithms for Multimodal Learning and Sensor Fusion

Multimodal learning has been lacking principled ways of combining information from different modalities and learning a low-dimensional manifold of meaningful representations. We study multimodal learning and sensor fusion from a latent…

Machine Learning · Computer Science 2019-04-24 Lijiang Guo

Multi-Level Feature Fusion for Continual Learning in Visual Quality Inspection

Deep neural networks show great potential for automating various visual quality inspection tasks in manufacturing. However, their applicability is limited in more volatile scenarios, such as remanufacturing, where the inspected products and…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Johannes C. Bauer , Paul Geng , Stephan Trattnig , Petr Dokládal , Rüdiger Daub

Dynamic Fusion Networks for Machine Reading Comprehension

This paper presents a novel neural model - Dynamic Fusion Network (DFN), for machine reading comprehension (MRC). DFNs differ from most state-of-the-art models in their use of a dynamic multi-strategy attention process, in which passages,…

Computation and Language · Computer Science 2018-02-28 Yichong Xu , Jingjing Liu , Jianfeng Gao , Yelong Shen , Xiaodong Liu

Multi-Perspective LSTM for Joint Visual Representation Learning

We present a novel LSTM cell architecture capable of learning both intra- and inter-perspective relationships available in visual sequences captured from multiple perspectives. Our architecture adopts a novel recurrent joint learning…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Alireza Sepas-Moghaddam , Fernando Pereira , Paulo Lobato Correia , Ali Etemad

Multi-Level Feature Fusion Mechanism for Single Image Super-Resolution

Convolution neural network (CNN) has been widely used in Single Image Super Resolution (SISR) so that SISR has been a great success recently. As the network deepens, the learning ability of network becomes more and more powerful. However,…

Computer Vision and Pattern Recognition · Computer Science 2020-02-17 Jiawen Lyn

Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition

In this paper we address the problem of human action recognition from video sequences. Inspired by the exemplary results obtained via automatic feature learning and deep learning approaches in computer vision, we focus our attention towards…

Computer Vision and Pattern Recognition · Computer Science 2017-04-06 Harshala Gammulle , Simon Denman , Sridha Sridharan , Clinton Fookes

Multi-level Attention Fusion Network for Audio-visual Event Recognition

Event classification is inherently sequential and multimodal. Therefore, deep neural models need to dynamically focus on the most relevant time window and/or modality of a video. In this study, we propose the Multi-level Attention Fusion…

Computer Vision and Pattern Recognition · Computer Science 2021-06-15 Mathilde Brousmiche , Jean Rouat , Stéphane Dupont

Research on Data Fusion Algorithm Based on Deep Learning in Target Tracking

Aiming at the limitation that deep long and short-term memory network(DLSTM) algorithm cannot perform parallel computing and cannot obtain global information, in this paper, feature extraction and feature processing are firstly carried out…

Computer Vision and Pattern Recognition · Computer Science 2022-11-24 Huihui Wu

Multi-modal Conditional Attention Fusion for Dimensional Emotion Prediction

Continuous dimensional emotion prediction is a challenging task where the fusion of various modalities usually achieves state-of-the-art performance such as early fusion or late fusion. In this paper, we propose a novel multi-modal fusion…

Computer Vision and Pattern Recognition · Computer Science 2017-09-08 Shizhe Chen , Qin Jin

Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency

In this paper, we propose a novel neural network structure, namely \emph{feedforward sequential memory networks (FSMN)}, to model long-term dependency in time series without using recurrent feedback. The proposed FSMN is a standard…

Neural and Evolutionary Computing · Computer Science 2016-01-06 Shiliang Zhang , Cong Liu , Hui Jiang , Si Wei , Lirong Dai , Yu Hu

Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks

Deep learning and Convolutional Neural Networks (CNNs) have driven major transformations in diverse research areas. However, their limitations in handling low-frequency information present obstacles in certain tasks like interpreting global…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Fuzhi Wu , Jiasong Wu , Youyong Kong , Chunfeng Yang , Guanyu Yang , Huazhong Shu , Guy Carrault , Lotfi Senhadji

Feedforward Sequential Memory Neural Networks without Recurrent Feedback

We introduce a new structure for memory neural networks, called feedforward sequential memory networks (FSMN), which can learn long-term dependency without using recurrent feedback. The proposed FSMN is a standard feedforward neural…

Neural and Evolutionary Computing · Computer Science 2016-01-07 ShiLiang Zhang , Hui Jiang , Si Wei , LiRong Dai

Recurrent Neural Networks for Driver Activity Anticipation via Sensory-Fusion Architecture

Anticipating the future actions of a human is a widely studied problem in robotics that requires spatio-temporal reasoning. In this work we propose a deep learning approach for anticipation in sensory-rich robotics applications. We…

Computer Vision and Pattern Recognition · Computer Science 2015-09-17 Ashesh Jain , Avi Singh , Hema S Koppula , Shane Soh , Ashutosh Saxena

Deep Multimodal Representation Learning from Temporal Data

In recent years, Deep Learning has been successfully applied to multimodal learning problems, with the aim of learning useful joint representations in data fusion applications. When the available modalities consist of time series data such…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Xitong Yang , Palghat Ramesh , Radha Chitta , Sriganesh Madhvanath , Edgar A. Bernal , Jiebo Luo

Learning to Focus: Cascaded Feature Matching Network for Few-shot Image Recognition

Deep networks can learn to accurately recognize objects of a category by training on a large number of annotated images. However, a meta-learning challenge known as a low-shot image recognition task comes when only a few images with…

Computer Vision and Pattern Recognition · Computer Science 2021-01-14 Mengting Chen , Xinggang Wang , Heng Luo , Yifeng Geng , Wenyu Liu

Fusing Deep Convolutional Networks for Large Scale Visual Concept Classification

Deep learning architectures are showing great promise in various computer vision domains including image classification, object detection, event detection and action recognition. In this study, we investigate various aspects of…

Computer Vision and Pattern Recognition · Computer Science 2016-08-08 Hilal Ergun , Mustafa Sert

3D Convolutional with Attention for Action Recognition

Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Labina Shrestha , Shikha Dubey , Farrukh Olimov , Muhammad Aasim Rafique , Moongu Jeon