SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 30513075 of 8378 papers

TitleStatusHype
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationCode1
BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NERCode0
Inspecting the Geographical Representativeness of Images from Text-to-Image Models0
Boosting Distress Support Dialogue Responses with Motivational Interviewing StrategyCode0
Sharpness & Shift-Aware Self-Supervised Learning0
An Ensemble Deep Learning Approach for COVID-19 Severity Prediction Using Chest CT ScansCode0
Rethinking Data Augmentation for Tabular Data in Deep LearningCode1
Advising OpenMP Parallelization via a Graph-Based Approach with TransformersCode0
Data Augmentation for Conflict and Duplicate Detection in Software Engineering Sentence Pairs0
Adversarial Word Dilution as Text Data Augmentation in Low-Resource RegimeCode0
Boosting Event Extraction with Denoised Structure-to-Text Augmentation0
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
Learning Better Contrastive View from Radiologist's GazeCode1
Improved baselines for vision-language pre-training0
Exploiting Frequency Spectrum of Adversarial Images for General Robustness0
t-RAIN: Robust generalization under weather-aliasing label shift attacks0
AdamR at SemEval-2023 Task 10: Solving the Class Imbalance Problem in Sexism Detection with Ensemble Learning0
Learning to Generalize for Cross-domain QACode0
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-LearningCode1
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative ExamplesCode0
Consistency Regularization for Domain Generalization with Logit Attribution MatchingCode0
Cloud-RAIN: Point Cloud Analysis with Reflectional InvarianceCode0
Subject-based Non-contrastive Self-Supervised Learning for ECG Signal Processing0
Improving Small Language Models on PubMedQA via Generative Data Augmentation0
Uncertainty Estimation and Out-of-Distribution Detection for Deep Learning-Based Image Reconstruction using the Local Lipschitz0
Show:102550
← PrevPage 123 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified