SOTAVerified

MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space

2021-01-01ICLR 2021Code Available1· sign in to hype

Tsz-Him Cheung, Dit-yan Yeung

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Data augmentation is an efficient way to expand a training dataset by creating additional artificial data. While data augmentation is found to be effective in improving the generalization capability of models for various machine learning tasks, the underlying augmentation methods are usually manually designed and carefully evaluated for each data modality separately, like image processing functions for image data and word-replacing rules for text data. In this work, we propose an automated data augmentation approach called MODALS (Modality-agnostic Automated Data Augmentation in the Latent Space) to augment data for any modality in a generic way. MODALS exploits automated data augmentation to fine-tune four universal data transformation operations in the latent space to adapt the transform to data of different modalities. Through comprehensive experiments, we demonstrate the effectiveness of MODALS on multiple datasets for text, tabular, time-series and image modalities.

Tasks

Reproductions