SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 14011450 of 8378 papers

TitleStatusHype
MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error CorrectionCode1
MixGen: A New Multi-Modal Data AugmentationCode1
Compositional Generalization for Multi-label Text Classification: A Data-Augmentation ApproachCode1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision TransformerCode1
MixSKD: Self-Knowledge Distillation from Mixup for Image RecognitionCode1
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text ClassificationCode1
Cross-modulated Few-shot Image Generation for Colorectal Tissue ClassificationCode1
CST5: Data Augmentation for Code-Switched Semantic ParsingCode1
AutoCLINT: The Winning Method in AutoCV Challenge 2019Code1
MODALS: Modality-agnostic Automated Data Augmentation in the Latent SpaceCode1
AutoBalance: Optimized Loss Functions for Imbalanced DataCode1
Model Patching: Closing the Subgroup Performance Gap with Data AugmentationCode1
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
Monkeypox Image Data collectionCode1
CUDA: Curriculum of Data Augmentation for Long-Tailed RecognitionCode1
CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate SpeechCode1
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance SegmentationCode1
CVAE-GAN: Fine-Grained Image Generation through Asymmetric TrainingCode1
MotionAug: Augmentation with Physical Correction for Human Motion PredictionCode1
Motion-Focused Contrastive Learning of Video RepresentationsCode1
A Probabilistic Framework for Knowledge Graph Data AugmentationCode1
Motion Robust High-Speed Light-Weighted Object Detection With Event CameraCode1
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile glovesCode1
AADG: Automatic Augmentation for Domain Generalization on Retinal Image SegmentationCode1
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation PerspectiveCode1
Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 ChallengeCode1
Multi-modal Conditional Bounding Box Regression for Music Score FollowingCode1
Multimodal Data Augmentation for Visual-Infrared Person ReID with Corrupted DataCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing ModalitiesCode1
CarveMix: A Simple Data Augmentation Method for Brain Lesion SegmentationCode1
Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic EnhancementCode1
Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)Code1
MUM: Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object DetectionCode1
Cascaded deep monocular 3D human pose estimation with evolutionary training dataCode1
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup StrategiesCode1
Cross-domain Compositing with Pretrained Diffusion ModelsCode1
AcroFOD: An Adaptive Method for Cross-domain Few-shot Object DetectionCode1
Conditioned Text Generation with Transfer for Closed-Domain Dialogue SystemsCode1
Cross-head mutual Mean-Teaching for semi-supervised medical image segmentationCode1
Natural Adversarial ExamplesCode1
Adversarial Semantic Data Augmentation for Human Pose EstimationCode1
Negative Data AugmentationCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
ACTION: Augmentation and Computation Toolbox for Brain Network Analysis with Functional MRICode1
Causal Action Influence Aware Counterfactual Data AugmentationCode1
Neural Topic Modeling with Continual Lifelong LearningCode1
Learning from Counterfactual Links for Link PredictionCode1
Show:102550
← PrevPage 29 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified