Related papers: Temporal Convolution Networks with Positional Enco…

Convolutional neural networks pretrained on large face recognition datasets for emotion classification from video

In this paper we describe a solution to our entry for the emotion recognition challenge EmotiW 2017. We propose an ensemble of several models, which capture spatial and audio features from videos. Spatial features are captured by…

Computer Vision and Pattern Recognition · Computer Science 2017-11-15 Boris Knyazev , Roman Shvetsov , Natalia Efremova , Artem Kuharenko

Boosting Continuous Emotion Recognition with Self-Pretraining using Masked Autoencoders, Temporal Convolutional Networks, and Transformers

Human emotion recognition holds a pivotal role in facilitating seamless human-computer interaction. This paper delineates our methodology in tackling the Valence-Arousal (VA) Estimation Challenge, Expression (Expr) Classification Challenge,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Weiwei Zhou , Jiada Lu , Chenkun Ling , Weifeng Wang , Shaowei Liu

Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks

Facial expressions are one of the most powerful ways for depicting specific patterns in human behavior and describing human emotional state. Despite the impressive advances of affective computing over the last decade, automatic video-based…

Computer Vision and Pattern Recognition · Computer Science 2021-01-18 Thomas Teixeira , Eric Granger , Alessandro Lameiras Koerich

Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary

This study reports an unintuitive finding that positional encoding enhances learning of recurrent neural networks (RNNs). Positional encoding is a high-dimensional representation of time indices on input data. Most famously, positional…

Machine Learning · Computer Science 2024-11-28 Takashi Morita

Spatially Encoding Temporal Correlations to Classify Temporal Data Using Convolutional Neural Networks

We propose an off-line approach to explicitly encode temporal patterns spatially as different types of images, namely, Gramian Angular Fields and Markov Transition Fields. This enables the use of techniques from computer vision for feature…

Machine Learning · Computer Science 2015-09-25 Zhiguang Wang , Tim Oates

EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video

Videos can evoke a range of affective responses in viewers. The ability to predict evoked affect from a video, before viewers watch the video, can help in content creation and video recommendation. We introduce the Evoked Expressions from…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Jennifer J. Sun , Ting Liu , Alan S. Cowen , Florian Schroff , Hartwig Adam , Gautam Prasad

Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning

It is well believed that video captioning is a fundamental but challenging task in both computer vision and artificial intelligence fields. The prevalent approach is to map an input video to a variable-length output sentence in a sequence…

Computer Vision and Pattern Recognition · Computer Science 2019-05-06 Jingwen Chen , Yingwei Pan , Yehao Li , Ting Yao , Hongyang Chao , Tao Mei

EnK: Encoding time-information in convolution

Recent development in deep learning techniques has attracted attention in decoding and classification in EEG signals. Despite several efforts utilizing different features of EEG signals, a significant research challenge is to use…

Machine Learning · Computer Science 2020-06-09 Avinash Kumar Singh , Chin-Teng Lin

Temporal-spatial Representation Learning Transformer for EEG-based Emotion Recognition

Both the temporal dynamics and spatial correlations of Electroencephalogram (EEG), which contain discriminative emotion information, are essential for the emotion recognition. However, some redundant information within the EEG signals would…

Signal Processing · Electrical Eng. & Systems 2022-11-17 Zhe Wang , Yongxiong Wang , Chuanfei Hu , Zhong Yin , Yu Song

EchoVPR: Echo State Networks for Visual Place Recognition

Recognising previously visited locations is an important, but unsolved, task in autonomous navigation. Current visual place recognition (VPR) benchmarks typically challenge models to recover the position of a query image (or images) from…

Computer Vision and Pattern Recognition · Computer Science 2022-02-14 Anil Ozdemir , Mark Scerri , Andrew B. Barron , Andrew Philippides , Michael Mangan , Eleni Vasilaki , Luca Manneschi

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning

Recently, deep learning approach, especially deep Convolutional Neural Networks (ConvNets), have achieved overwhelming accuracy with fast processing speed for image classification. Incorporating temporal structure with deep ConvNets for…

Computer Vision and Pattern Recognition · Computer Science 2015-11-12 Pingbo Pan , Zhongwen Xu , Yi Yang , Fei Wu , Yueting Zhuang

How Deep Neural Networks Can Improve Emotion Recognition on Video Data

We consider the task of dimensional emotion recognition on video data using deep learning. While several previous methods have shown the benefits of training temporal neural network models such as recurrent neural networks (RNNs) on…

Computer Vision and Pattern Recognition · Computer Science 2017-01-11 Pooya Khorrami , Tom Le Paine , Kevin Brady , Charlie Dagli , Thomas S. Huang

Learning Representations from EEG with Deep Recurrent-Convolutional Neural Networks

One of the challenges in modeling cognitive events from electroencephalogram (EEG) data is finding representations that are invariant to inter- and intra-subject differences, as well as to inherent noise associated with such data. Herein,…

Machine Learning · Computer Science 2016-03-02 Pouya Bashivan , Irina Rish , Mohammed Yeasin , Noel Codella

Leveraging Semantic Scene Characteristics and Multi-Stream Convolutional Architectures in a Contextual Approach for Video-Based Visual Emotion Recognition in the Wild

In this work we tackle the task of video-based visual emotion recognition in the wild. Standard methodologies that rely solely on the extraction of bodily and facial features often fall short of accurate emotion prediction in cases where…

Computer Vision and Pattern Recognition · Computer Science 2022-02-03 Ioannis Pikoulis , Panagiotis P. Filntisis , Petros Maragos

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment. Our proposed model predicts 2D facial point maps regularized by a regression loss, while uniquely exploiting recurrent learning at both…

Computer Vision and Pattern Recognition · Computer Science 2016-08-24 Xi Peng , Rogerio S. Feris , Xiaoyu Wang , Dimitris N. Metaxas

Gaze-Vector Estimation in the Dark with Temporally Encoded Event-driven Neural Networks

In this paper, we address the intricate challenge of gaze vector prediction, a pivotal task with applications ranging from human-computer interaction to driver monitoring systems. Our innovative approach is designed for the demanding…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Abeer Banerjee , Naval K. Mehta , Shyam S. Prasad , Himanshu , Sumeet Saurav , Sanjay Singh

A Real-time Action Representation with Temporal Encoding and Deep Compression

Deep neural networks have achieved remarkable success for video-based action recognition. However, most of existing approaches cannot be deployed in practice due to the high computational cost. To address this challenge, we propose a new…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Kun Liu , Wu Liu , Huadong Ma , Mingkui Tan , Chuang Gan

Temporal Network Embedding via Tensor Factorization

Representation learning on static graph-structured data has shown a significant impact on many real-world applications. However, less attention has been paid to the evolving nature of temporal networks, in which the edges are often changing…

Machine Learning · Computer Science 2021-08-24 Jing Ma , Qiuchen Zhang , Jian Lou , Li Xiong , Joyce C. Ho

Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos

Anticipating human actions is an important task that needs to be addressed for the development of reliable intelligent agents, such as self-driving cars or robot assistants. While the ability to make future predictions with high accuracy is…

Computer Vision and Pattern Recognition · Computer Science 2021-07-21 Olga Zatsarynna , Yazan Abu Farha , Juergen Gall

Temporal Attention Evolutional Graph Convolutional Network for Multivariate Time Series Forecasting

Multivariate time series forecasting enables the prediction of future states by leveraging historical data, thereby facilitating decision-making processes. Each data node in a multivariate time series encompasses a sequence of multiple…

Machine Learning · Computer Science 2025-05-02 Xinlong Zhao , Liying Zhang , Tianbo Zou , Yan Zhang