SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 55515600 of 8378 papers

TitleStatusHype
Triplet Contrastive Learning for Brain Tumor Classification0
Enhancing MR Image Segmentation with Realistic Adversarial Data AugmentationCode1
Impact of Aliasing on Generalization in Deep Convolutional Networks0
Ensemble Augmentation for Deep Neural Networks Using 1-D Time Series Vibration DataCode0
High-frequency shape recovery from shading by CNN and domain adaptation0
Improving Contrastive Learning by Visualizing Feature TransformationCode1
Distilling Transformers for Neural Cross-Domain Search0
Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning0
Exploring Structure Consistency for Deep Model Watermarking0
Automatic Rail Component Detection Based on AttnConv-Net0
Alleviating Mode Collapse in GAN via Diversity Penalty Module0
Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud DetectionCode0
Pervasive Hand Gesture Recognition for Smartphones using Non-audible Sound and Deep Learning0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
MRI to PET Cross-Modality Translation using Globally and Locally Aware GAN (GLA-GAN) for Multi-Modal Diagnosis of Alzheimer's Disease0
Lung Sound Classification Using Co-tuning and Stochastic Normalization0
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
The Devil is in the GAN: Backdoor Attacks and Defenses in Deep Generative ModelsCode1
CPSC: Conformal prediction with shrunken centroids for efficient prediction reliability quantification and data augmentation, a case in alternative herbal medicine classification with electronic nose0
Adversarial Data Augmentation for Disordered Speech Recognition0
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter DatasetCode0
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR0
Robust Semantic Segmentation with Superpixel-MixCode1
Building Goal-oriented Document-grounded Dialogue Systems0
Technical Report on Shared Task in DialDoc210
CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue SystemCode1
Maastricht University’s Multilingual Speech Translation System for IWSLT 20210
Multilingual Speech Translation with Unified Transformer: Huawei Noah’s Ark Lab at IWSLT 20210
ALEM at CASE 2021 Task 1: Multilingual Text Classification on News Articles0
IMS’ Systems for the IWSLT 2021 Low-Resource Speech Translation Task0
Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 20210
Edinburgh’s End-to-End Multilingual Speech Translation System for IWSLT 20210
Team “NoConflict” at CASE 2021 Task 1: Pretraining for Sentence-Level Protest Event Detection0
mixSeq: A Simple Data Augmentation Methodfor Neural Machine Translation0
BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection0
Data augmentation for low-resource grapheme-to-phoneme mapping0
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust FinetuningCode1
Product Review Translation using Phrase Replacement and Attention Guided Noise Augmentation0
Improving Low-Resource Named Entity Recognition via Label-Aware Data Augmentation and Curriculum Denoising0
基于字词粒度噪声数据增强的中文语法纠错(Chinese Grammatical Error Correction enhanced by Data Augmentation from Word and Character Levels)0
Improved English to Hindi Multimodal Neural Machine Translation0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
VL-BERT+: Detecting Protected Groups in Hateful Multimodal Memes0
MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER0
NLPIITR at SemEval-2021 Task 6: RoBERTa Model with Data Augmentation for Persuasion Techniques Detection0
Exploring Listwise Evidence Reasoning with T5 for Fact Verification0
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better GeneralizabilityCode0
Avoiding Overlap in Data Augmentation for AMR-to-Text Generation0
FPAI at SemEval-2021 Task 6: BERT-MRC for Propaganda Techniques Detection0
Continuous Language Generative FlowCode1
Show:102550
← PrevPage 112 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified