SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 801850 of 8378 papers

TitleStatusHype
Coreference Resolution as Query-based Span PredictionCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
An augmentation strategy to mimic multi-scanner variability in MRICode1
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from PixelsCode1
Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced DataCode1
Adapting pretrained speech model for Mandarin lyrics transcription and alignmentCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Aspect-Controlled Neural Argument GenerationCode1
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation TechniquesCode1
I'm Me, We're Us, and I'm Us: Tri-directional Contrastive Learning on HypergraphsCode1
BSUV-Net: A Fully-Convolutional Neural Network for Background Subtraction of Unseen VideosCode1
Learning from Counterfactual Links for Link PredictionCode1
3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce RegimesCode1
Improved Contrastive Divergence Training of Energy Based ModelsCode1
AutoBalance: Optimized Loss Functions for Imbalanced DataCode1
Improved Probabilistic Image-Text RepresentationsCode1
Improving BERT Model Using Contrastive Learning for Biomedical Relation ExtractionCode1
Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic EnhancementCode1
Improving Compositional Generalization with Latent Structure and Data AugmentationCode1
Improving Contrastive Learning by Visualizing Feature TransformationCode1
Cross-domain Compositing with Pretrained Diffusion ModelsCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect DetectionCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce ClassificationCode1
Cross-head mutual Mean-Teaching for semi-supervised medical image segmentationCode1
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion RecognitionCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text GuidanceCode1
DAGAD: Data Augmentation for Graph Anomaly DetectionCode1
CultureLLM: Incorporating Cultural Differences into Large Language ModelsCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
CUDA: Curriculum of Data Augmentation for Long-Tailed RecognitionCode1
CST5: Data Augmentation for Code-Switched Semantic ParsingCode1
CutDepth:Edge-aware Data Augmentation in Depth EstimationCode1
CVAE-GAN: Fine-Grained Image Generation through Asymmetric TrainingCode1
CutPaste: Self-Supervised Learning for Anomaly Detection and LocalizationCode1
CutMIB: Boosting Light Field Super-Resolution via Multi-View Image BlendingCode1
CyCNN: A Rotation Invariant CNN using Polar Mapping and Cylindrical Convolution LayersCode1
A Study on Transferability of Deep Learning Models for Network Intrusion DetectionCode1
D3A-TS: Denoising-Driven Data Augmentation in Time SeriesCode1
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-LearningCode1
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming with Feasibility GuaranteeCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
Source Code Data Augmentation for Deep Learning: A SurveyCode1
Disentangled Representations for Domain-generalized Cardiac SegmentationCode1
Show:102550
← PrevPage 17 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified