Deep Generative Models for learning Coherent Latent Representations from Multi-Modal Data

2019-05-01ICLR 2019Unverified0· sign in to hype

Timo Korthals, Marc Hesse, Jürgen Leitner

Unverified — Be the first to reproduce this paper.

Abstract

The application of multi-modal generative models by means of a Variational Auto Encoder (VAE) is an upcoming research topic for sensor fusion and bi-directional modality exchange. This contribution gives insights into the learned joint latent representation and shows that expressiveness and coherence are decisive properties for multi-modal datasets. Furthermore, we propose a multi-modal VAE derived from the full joint marginal log-likelihood that is able to learn the most meaningful representation for ambiguous observations. Since the properties of multi-modal sensor setups are essential for our approach but hardly available, we also propose a technique to generate correlated datasets from uni-modal ones.

Tasks

Sensor Fusion

Deep Generative Models for learning Coherent Latent Representations from Multi-Modal Data

Abstract

Tasks

Reproductions