SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 2650 of 8378 papers

TitleStatusHype
Pythia v0.1: the Winning Entry to the VQA Challenge 2018Code3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any CameraCode3
Improved motif-scaffolding with SE(3) flow matchingCode3
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative RepresentationsCode3
EfficientNetV2: Smaller Models and Faster TrainingCode3
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone TrainingCode3
Anatomy-informed Data Augmentation for Enhanced Prostate Cancer DetectionCode3
Data Augmentation for Sequential Recommendation: A SurveyCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
AutoAugment: Learning Augmentation Policies from DataCode3
Generative Data Augmentation using LLMs improves Distributional Robustness in Question AnsweringCode3
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
Deep Visual Geo-localization BenchmarkCode2
Delving into the Trajectory Long-tail Distribution for Muti-object TrackingCode2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future DirectionsCode2
Deep learning for time series classificationCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation frameworkCode2
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
Deep PCB To COCO ConvertorCode2
Addressing Concept Shift in Online Time Series Forecasting: Detect-then-AdaptCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Show:102550
← PrevPage 2 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified