SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 61016150 of 8378 papers

TitleStatusHype
SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation0
Cascade Bagging for Accuracy Prediction with Few Training SamplesCode0
Log-Spectral Matching GAN: PPG-based Atrial Fibrillation Detection can be Enhanced by GAN-based Data Augmentation with Integration of Spectral LossCode0
An empirical investigation into audio pipeline approaches for classifying bird species0
InfoGAN-MSF: a data augmentation approach for correlative bridge monitoring factors0
Towards artificially intelligent recycling Improving image processing for waste classification0
Triplet Contrastive Learning for Brain Tumor Classification0
Impact of Aliasing on Generalization in Deep Convolutional Networks0
Ensemble Augmentation for Deep Neural Networks Using 1-D Time Series Vibration DataCode0
High-frequency shape recovery from shading by CNN and domain adaptation0
Distilling Transformers for Neural Cross-Domain Search0
Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning0
Automatic Rail Component Detection Based on AttnConv-Net0
Exploring Structure Consistency for Deep Model Watermarking0
Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud DetectionCode0
Alleviating Mode Collapse in GAN via Diversity Penalty Module0
Pervasive Hand Gesture Recognition for Smartphones using Non-audible Sound and Deep Learning0
MRI to PET Cross-Modality Translation using Globally and Locally Aware GAN (GLA-GAN) for Multi-Modal Diagnosis of Alzheimer's Disease0
Lung Sound Classification Using Co-tuning and Stochastic Normalization0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
CPSC: Conformal prediction with shrunken centroids for efficient prediction reliability quantification and data augmentation, a case in alternative herbal medicine classification with electronic nose0
Adversarial Data Augmentation for Disordered Speech Recognition0
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR0
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter DatasetCode0
Cambridge at SemEval-2021 Task 2: Neural WiC-Model with Data Augmentation and Exploration of Representation0
MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER0
LIORI at SemEval-2021 Task 2: Span Prediction and Binary Classification approaches to Word-in-Context Disambiguation0
OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
Data augmentation for low-resource grapheme-to-phoneme mapping0
PAW at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation : Exploring Cross Lingual Transfer, Augmentations and Adversarial Training0
Multilingual Speech Translation with Unified Transformer: Huawei Noah’s Ark Lab at IWSLT 20210
基于字词粒度噪声数据增强的中文语法纠错(Chinese Grammatical Error Correction enhanced by Data Augmentation from Word and Character Levels)0
Building Goal-oriented Document-grounded Dialogue Systems0
Team “NoConflict” at CASE 2021 Task 1: Pretraining for Sentence-Level Protest Event Detection0
Data Augmentation with Adversarial Training for Cross-Lingual NLI0
DeepBlueAI at SemEval-2021 Task 1: Lexical Complexity Prediction with A Deep Ensemble Approach0
VL-BERT+: Detecting Protected Groups in Hateful Multimodal Memes0
mixSeq: A Simple Data Augmentation Methodfor Neural Machine Translation0
The University of Arizona at SemEval-2021 Task 10: Applying Self-training, Active Learning and Data Augmentation to Source-free Domain Adaptation0
IITK at SemEval-2021 Task 10: Source-Free Unsupervised Domain Adaptation using Class Prototypes0
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better GeneralizabilityCode0
Discriminative Reranking for Neural Machine Translation0
IMS’ Systems for the IWSLT 2021 Low-Resource Speech Translation Task0
Improving Low-Resource Named Entity Recognition via Label-Aware Data Augmentation and Curriculum Denoising0
Technical Report on Shared Task in DialDoc210
BME Submission for SIGMORPHON 2021 Shared Task 0. A Three Step Training Approach with Data Augmentation for Morphological Inflection0
NLPIITR at SemEval-2021 Task 6: RoBERTa Model with Data Augmentation for Persuasion Techniques Detection0
LeCun at SemEval-2021 Task 6: Detecting Persuasion Techniques in Text Using Ensembled Pretrained Transformers and Data Augmentation0
Improved English to Hindi Multimodal Neural Machine Translation0
Show:102550
← PrevPage 123 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified