SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 12011250 of 8378 papers

TitleStatusHype
TrivialAugment: Tuning-free Yet State-of-the-Art Data AugmentationCode1
Temporal Cluster Matching for Change Detection of Structures from Satellite ImageryCode1
Training GANs with Stronger Augmentations via Contrastive DiscriminatorCode1
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics ModelCode1
Pushing the Limits of Capsule NetworksCode1
Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge DistillationCode1
Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum LearningCode1
Fair Mixup: Fairness via InterpolationCode1
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object DetectionCode1
Doubly Contrastive Deep ClusteringCode1
Multitask 3D CBCT-to-CT Translation and Organs-at-Risk Segmentation Using Physics-Based Data AugmentationCode1
Contemplating real-world object classificationCode1
Consistency Regularization for Adversarial RobustnessCode1
What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer LabelsCode1
Modeling tail risks of inflation using unobserved component quantile regressionsCode1
VIPriors 1: Visual Inductive Priors for Data-Efficient Deep Learning ChallengesCode1
Data Augmentation for Object Detection via Differentiable Neural RenderingCode1
On the effectiveness of adversarial training against common corruptionsCode1
Multi-attentional Deepfake DetectionCode1
Diffusion Probabilistic Models for 3D Point Cloud GenerationCode1
Data Augmentation for Abstractive Query-Focused Multi-Document SummarizationCode1
Fixing Data Augmentation to Improve Adversarial RobustnessCode1
SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound ClassificationCode1
Context Decoupling Augmentation for Weakly Supervised Semantic SegmentationCode1
Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket PerspectiveCode1
GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical ImagingCode1
PRIN/SPRIN: On Extracting Point-wise Rotation Invariant FeaturesCode1
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-LearningCode1
Analyzing Overfitting under Class Imbalance in Neural Networks for Image SegmentationCode1
Image Compositing for Segmentation of Surgical Tools without Manual AnnotationsCode1
Towards Solving the DeepFake Problem : An Analysis on Improving DeepFake Detection using Dynamic Face AugmentationCode1
IoTDevID: A Behavior-Based Device Identification Method for the IoTCode1
End-to-end lyrics Recognition with Voice to Singing Style TransferCode1
Comparison of semi-supervised deep learning algorithms for audio classificationCode1
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale UpCode1
Estimation of kinematics from inertial measurement units using a combined deep learning and optimization frameworkCode1
SelfHAR: Improving Human Activity Recognition through Self-training with Unlabeled DataCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
COVID-19 detection from scarce chest x-ray image data using few-shot deep learning approachCode1
Bootstrapping Relation Extractors using Syntactic Search by ExamplesCode1
Negative Data AugmentationCode1
Quantifying and Mitigating Privacy Risks of Contrastive LearningCode1
Deep Semi-Supervised Learning for Time Series ClassificationCode1
Modeling the Probabilistic Distribution of Unlabeled Data forOne-shot Medical Image SegmentationCode1
Regularization Strategy for Point Cloud via Rigidly Mixed SampleCode1
Learning domain-agnostic visual representation for computational pathology using medically-irrelevant style transfer augmentationCode1
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and AggregationCode1
Single Model Deep Learning on Imbalanced Small Datasets for Skin Lesion ClassificationCode1
Efficient-CapsNet: Capsule Network with Self-Attention RoutingCode1
Generating Syntactically Controlled Paraphrases without Using Annotated Parallel PairsCode1
Show:102550
← PrevPage 25 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified