SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 81018150 of 8378 papers

TitleStatusHype
On the Importance of Capturing a Sufficient Diversity of Perspective for the Classification of micro-PCBsCode0
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation StrategiesCode0
Tailoring Mixup to Data for CalibrationCode0
Classification robustness to common optical aberrationsCode0
Augmentation Matters: A Mix-Paste Method for X-Ray Prohibited Item Detection under Noisy AnnotationsCode0
Classification of Bark Beetle-Induced Forest Tree Mortality using Deep LearningCode0
DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning ModelsCode0
Classification Beats Regression: Counting of Cells from Greyscale Microscopic Images based on Annotation-free Training SamplesCode0
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information RetrievalCode0
Augmentation BackdoorsCode0
Semantically Consistent Data Augmentation for Neural Machine Translation via Conditional Masked Language ModelCode0
Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice ExtractionCode0
Semantically Distributed Robust Optimization for Vision-and-Language InferenceCode0
Enhance Multimodal Model Performance with Data Augmentation: Facebook Hateful Meme Challenge SolutionCode0
On the Privacy Effect of Data Enhancement via the Lens of MemorizationCode0
Semantically Equivalent Adversarial Rules for Debugging NLP modelsCode0
On the Robustness of Intent Classification and Slot Labeling in Goal-oriented Dialog Systems to Real-world NoiseCode0
Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction TuningCode0
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset ChallengeCode0
Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data AugmentationCode0
WavLM model ensemble for audio deepfake detectionCode0
On the Summarization of Consumer Health QuestionsCode0
Semantic-aware Data Augmentation for Text-to-image SynthesisCode0
CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal BrainCode0
Towards Robust Unsupervised Attention Prediction in Autonomous DrivingCode0
Enhanced Long-Tailed Recognition with Contrastive CutMix AugmentationCode0
An Analysis of Unsupervised Pre-training in Light of Recent AdvancesCode0
Data Augmentation for Object Detection via Progressive and Selective Instance-SwitchingCode0
UniGAP: A Universal and Adaptive Graph Upsampling Approach to Mitigate Over-Smoothing in Node Classification TasksCode0
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger GuidanceCode0
Targeted Nonlinear Adversarial Perturbations in Images and VideosCode0
ENGAGE: Explanation Guided Data Augmentation for Graph Representation LearningCode0
CIC-BART-SSA: Controllable Image Captioning with Structured Semantic AugmentationCode0
Targeted synthetic data generation for tabular data via hardness characterizationCode0
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based ModelCode0
Adversarial Robustness Study of Convolutional Neural Network for Lumbar Disk Shape Reconstruction from MR imagesCode0
TSA on AutoPilot: Self-tuning Self-supervised Time Series Anomaly DetectionCode0
Enabling Data Diversity: Efficient Automatic Augmentation via Regularized Adversarial TrainingCode0
OpenMix+: Revisiting Data Augmentation for Open Set RecognitionCode0
An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood VesselsCode0
Semantic keypoint extraction for scanned animals using multi-depth-camera systemsCode0
CIAug: Equipping Interpolative Augmentation with Curriculum LearningCode0
ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker VerificationCode0
Augmentation-Aware Self-Supervision for Data-Efficient GAN TrainingCode0
AugDMC: Data Augmentation Guided Deep Multiple ClusteringCode0
AudRandAug: Random Image Augmentations for Audio ClassificationCode0
Task Augmentation by Rotating for Meta-LearningCode0
Audiogmenter: a MATLAB Toolbox for Audio Data AugmentationCode0
Towards Self-Supervised Learning of Global and Object-Centric RepresentationsCode0
Optimal Transport Posterior Alignment for Cross-lingual Semantic ParsingCode0
Show:102550
← PrevPage 163 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified