MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition

2021-10-27Unverified0· sign in to hype

Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang, Haizhou Li

Unverified — Be the first to reproduce this paper.

Abstract

Multimodal emotion recognition study is hindered by the lack of labelled corpora in terms of scale and diversity, due to the high annotation cost and label ambiguity. In this paper, we propose a pre-training model MEmoBERT for multimodal emotion recognition, which learns multimodal joint representations through self-supervised learning from large-scale unlabeled video data that come in sheer volume. Furthermore, unlike the conventional "pre-train, finetune" paradigm, we propose a prompt-based method that reformulates the downstream emotion classification task as a masked text prediction one, bringing the downstream task closer to the pre-training. Extensive experiments on two benchmark datasets, IEMOCAP and MSP-IMPROV, show that our proposed MEmoBERT significantly enhances emotion recognition performance.

Tasks

Diversity Emotion Classification Emotion Recognition Multimodal Emotion Recognition Self-Supervised Learning

MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition

Abstract

Tasks

Reproductions