Mosaic Augmentation for Text: Cropping and Collaging as Cross-Domain Techniques
2022-01-16ACL ARR January 2022Unverified0· sign in to hype
Anonymous
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We present new visually inspired cropping and collaging data augmentations for text. We test how these augmentations impact data-scarce scenarios over multiple NLP tasks: name entity recognition, extractive question answering and abstractive summarization, across 9 prominent datasets. Ablation studies show different prevailing reasons for the augmentations' effectiveness for the different tasks, but all benefit from our approach. We achieve significant improvements over baselines, particularly for limited data use cases.