Related papers: Unsupervised Multiple-Object Tracking with a Dynam…

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

In this paper, we propose a latent-variable generative model called mixture of dynamical variational autoencoders (MixDVAE) to model the dynamics of a system composed of multiple moving sources. A DVAE model is pre-trained on a…

Machine Learning · Computer Science 2023-12-08 Xiaoyu Lin , Laurent Girin , Xavier Alameda-Pineda

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

Dynamical variational autoencoders (DVAEs) are a class of deep generative models with latent variables, dedicated to model time series of high-dimensional data. DVAEs can be considered as extensions of the variational autoencoder (VAE) that…

Sound · Computer Science 2022-10-04 Xiaoyu Bie , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin

Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach

Human behavior prediction models enable robots to anticipate how humans may react to their actions, and hence are instrumental to devising safe and proactive robot planning algorithms. However, modeling complex interaction dynamics and…

Robotics · Computer Science 2020-11-24 Boris Ivanovic , Karen Leung , Edward Schmerling , Marco Pavone

Variational Autoencoder-Based Vehicle Trajectory Prediction with an Interpretable Latent Space

This paper introduces the Descriptive Variational Autoencoder (DVAE), an unsupervised and end-to-end trainable neural network for predicting vehicle trajectories that provides partial interpretability. The novel approach is based on the…

Machine Learning · Computer Science 2021-06-25 Marion Neumeier , Andreas Tollkühn , Thomas Berberich , Michael Botsch

Multimodal Generative Models for Scalable Weakly-Supervised Learning

Multiple modalities often co-occur when describing natural phenomena. Learning a joint representation of these modalities should yield deeper and more useful representations. Previous generative approaches to multi-modal input either do not…

Machine Learning · Computer Science 2018-11-13 Mike Wu , Noah Goodman

Multi-Modal Anomaly Detection for Unstructured and Uncertain Environments

To achieve high-levels of autonomy, modern robots require the ability to detect and recover from anomalies and failures with minimal human supervision. Multi-modal sensor signals could provide more information for such anomaly detection…

Robotics · Computer Science 2020-12-17 Tianchen Ji , Sri Theja Vuppala , Girish Chowdhary , Katherine Driggs-Campbell

Unsupervised Multiple Person Tracking using AutoEncoder-Based Lifted Multicuts

Multiple Object Tracking (MOT) is a long-standing task in computer vision. Current approaches based on the tracking by detection paradigm either require some sort of domain knowledge or supervision to associate data correctly into tracks.…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Kalun Ho , Janis Keuper , Margret Keuper

Enhanced Variational Inference with Dyadic Transformation

Variational autoencoder is a powerful deep generative model with variational inference. The practice of modeling latent variables in the VAE's original formulation as normal distributions with a diagonal covariance matrix limits the…

Machine Learning · Computer Science 2019-03-11 Sarin Chandy , Amin Rasekh

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Multi-object tracking (MOT) is a crucial component of situational awareness in military defense applications. With the growing use of unmanned aerial systems (UASs), MOT methods for aerial surveillance is in high demand. Application of MOT…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Wanlin Xie , Jaime Ide , Daniel Izadi , Sean Banger , Thayne Walker , Ryan Ceresani , Dylan Spagnuolo , Christopher Guagliano , Henry Diaz , Jason Twedt

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space learned in an unsupervised manner. In the original VAE model,…

Sound · Computer Science 2021-06-15 Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda

Unbiased Learning of Deep Generative Models with Structured Discrete Representations

By composing graphical models with deep learning architectures, we learn generative models with the strengths of both frameworks. The structured variational autoencoder (SVAE) inherits structure and interpretability from graphical models,…

Machine Learning · Computer Science 2023-11-15 Harry Bendekgey , Gabriel Hope , Erik B. Sudderth

Deep Matching Autoencoders

Increasingly many real world tasks involve data in multiple modalities or views. This has motivated the development of many effective algorithms for learning a common latent space to relate multiple domains. However, most existing…

Computer Vision and Pattern Recognition · Computer Science 2017-11-17 Tanmoy Mukherjee , Makoto Yamada , Timothy M. Hospedales

Variational Mutual Information Maximization Framework for VAE Latent Codes with Continuous and Discrete Priors

Learning interpretable and disentangled representations of data is a key topic in machine learning research. Variational Autoencoder (VAE) is a scalable method for learning directed latent variable models of complex data. It employs a clear…

Machine Learning · Computer Science 2020-06-04 Andriy Serdega , Dae-Shik Kim

HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE

Studies on the automatic processing of 3D human pose data have flourished in the recent past. In this paper, we are interested in the generation of plausible and diverse future human poses following an observed 3D pose sequence. Current…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Xiaoyu Bie , Wen Guo , Simon Leglaive , Lauren Girin , Francesc Moreno-Noguer , Xavier Alameda-Pineda

Motion-DVAE: Unsupervised learning for fast human motion denoising

Pose and motion priors are crucial for recovering realistic and accurate human motion from noisy observations. Substantial progress has been made on pose and shape estimation from images, and recent works showed impressive results using…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Guénolé Fiche , Simon Leglaive , Xavier Alameda-Pineda , Renaud Séguier

Social-DualCVAE: Multimodal Trajectory Forecasting Based on Social Interactions Pattern Aware and Dual Conditional Variational Auto-Encoder

Pedestrian trajectory forecasting is a fundamental task in multiple utility areas, such as self-driving, autonomous robots, and surveillance systems. The future trajectory forecasting is multi-modal, influenced by physical interaction with…

Computer Vision and Pattern Recognition · Computer Science 2022-02-09 Jiashi Gao , Xinming Shi , James J. Q. Yu

UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation

Multiple Object Tracking (MOT) aims to find bounding boxes and identities of targeted objects in consecutive video frames. While fully-supervised MOT methods have achieved high accuracy on existing datasets, they cannot generalize well on a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-19 Pha Nguyen , Kha Gia Quach , John Gauch , Samee U. Khan , Bhiksha Raj , Khoa Luu

EnsembleMOT: A Step towards Ensemble Learning of Multiple Object Tracking

Multiple Object Tracking (MOT) has rapidly progressed in recent years. Existing works tend to design a single tracking algorithm to perform both detection and association. Though ensemble learning has been exploited in many tasks, i.e,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-20 Yunhao Du , Zihang Liu , Fei Su

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

As one of the most popular generative models, Variational Autoencoder (VAE) approximates the posterior of latent variables based on amortized variational inference. However, when the decoder network is sufficiently expressive, VAE may lead…

Machine Learning · Computer Science 2021-10-26 Dazhong Shen , Chuan Qin , Chao Wang , Hengshu Zhu , Enhong Chen , Hui Xiong

Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs

Learning representations of underlying environmental dynamics from partial observations is a critical challenge in machine learning. In the context of Partially Observable Markov Decision Processes (POMDPs), state representations are often…

Machine Learning · Computer Science 2024-11-13 Chao Han , Debabrota Basu , Michael Mangan , Eleni Vasilaki , Aditya Gilra