SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 351400 of 8378 papers

TitleStatusHype
Unleashing the Power of Graph Data Augmentation on Covariate Distribution ShiftCode1
Unsupervised Sketch-to-Photo SynthesisCode1
Context Decoupling Augmentation for Weakly Supervised Semantic SegmentationCode1
Convex Combination Consistency between Neighbors for Weakly-supervised Action LocalizationCode1
AADG: Automatic Augmentation for Domain Generalization on Retinal Image SegmentationCode1
A Robust Real-Time Automatic License Plate Recognition Based on the YOLO DetectorCode1
IRNet: Iterative Refinement Network for Noisy Partial Label LearningCode1
Copula-based synthetic data augmentation for machine-learning emulatorsCode1
AcroFOD: An Adaptive Method for Cross-domain Few-shot Object DetectionCode1
Adversarial Semantic Data Augmentation for Human Pose EstimationCode1
ACTION: Augmentation and Computation Toolbox for Brain Network Analysis with Functional MRICode1
Artificial Pupil Dilation for Data Augmentation in Iris Semantic SegmentationCode1
AASAE: Augmentation-Augmented Stochastic AutoencodersCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
Arrhythmia Classification using CGAN-augmented ECG SignalsCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Continuous Language Generative FlowCode1
Adversarial Vertex Mixup: Toward Better Adversarially Robust GeneralizationCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
Cross-head mutual Mean-Teaching for semi-supervised medical image segmentationCode1
A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data AugmentationCode1
A Semi-supervised Learning Approach with Two Teachers to Improve Breakdown Identification in DialoguesCode1
AdvST: Revisiting Data Augmentations for Single Domain GeneralizationCode1
AEDA: An Easier Data Augmentation Technique for Text ClassificationCode1
A Simple Semi-Supervised Learning Framework for Object DetectionCode1
A Simple Recipe for Language-guided Domain Generalized SegmentationCode1
AdaAug: Learning Class- and Instance-adaptive Data Augmentation PoliciesCode1
Aerial Imagery Pixel-level SegmentationCode1
Aspect-Controlled Neural Argument GenerationCode1
Confident Sinkhorn Allocation for Pseudo-LabelingCode1
AESOP: Paraphrase Generation with Adaptive Syntactic ControlCode1
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimesCode1
A Study on Transferability of Deep Learning Models for Network Intrusion DetectionCode1
A Feature-space Multimodal Data Augmentation Technique for Text-video RetrievalCode1
Conditioned Text Generation with Transfer for Closed-Domain Dialogue SystemsCode1
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
Conformal Prediction with Missing ValuesCode1
A Survey of World Models for Autonomous DrivingCode1
Concatenated Masked Autoencoders as Spatial-Temporal LearnerCode1
A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent AttentionCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
DAGAD: Data Augmentation for Graph Anomaly DetectionCode1
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example SentencesCode1
A Fourier-based Framework for Domain GeneralizationCode1
A systematic approach to deep learning-based nodule detection in chest radiographsCode1
Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical ReasoningCode1
Consistency Regularization for Adversarial RobustnessCode1
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat ReportsCode1
A Two-Stage Approach to Device-Robust Acoustic Scene ClassificationCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
Show:102550
← PrevPage 8 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified