SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 22012250 of 8378 papers

TitleStatusHype
Beyond the Known: Enhancing Open Set Domain Adaptation with Unknown ExplorationCode0
Data-Driven Self-Supervised Graph Representation LearningCode0
3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement0
Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework0
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models0
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults0
Revisiting In-Context Learning with Long Context Language Models0
Enhancing Contrastive Learning Inspired by the Philosophy of "The Blind Men and the Elephant"Code0
Enhancing Nighttime Vehicle Detection with Day-to-Night Style Transfer and Labeling-Free Augmentation0
FairDD: Enhancing Fairness with domain-incremental learning in dermatological disease diagnosis0
Autonomous Crack Detection using Deep Learning on Synthetic Thermogram Datasets0
Automated Classification of Cybercrime Complaints using Transformer-based Language Models for Hinglish Texts0
Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer Ensembles0
EnhancePPG: Improving PPG-based Heart Rate Estimation with Self-Supervision and Augmentation0
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy0
A Deep Probabilistic Framework for Continuous Time Dynamic Graph GenerationCode0
Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI AnalysisCode0
Enhancing Masked Time-Series Modeling via Dropping PatchesCode0
Head and Neck Tumor Segmentation of MRI from Pre- and Mid-radiotherapy with Pre-training, Data Augmentation and Dual Flow UNetCode0
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data0
Enhancing Diffusion Models for High-Quality Image Generation0
VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation ExtractionCode0
GenX: Mastering Code and Test Generation with Execution Feedback0
Retrieval Augmented Image Harmonization0
Solid-SQL: Enhanced Schema-linking based In-context Learning for Robust Text-to-SQL0
Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study0
3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation0
MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning0
SE-GCL: An Event-Based Simple and Effective Graph Contrastive Learning for Text Representation0
CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector0
Sound Classification of Four Insect Classes0
Understanding and Mitigating Memorization in Diffusion Models for Tabular Data0
Facial Surgery Preview Based on the Orthognathic Treatment Prediction0
SegHeD+: Segmentation of Heterogeneous Data for Multiple Sclerosis Lesions with Anatomical Constraints and Lesion-aware Augmentation0
Fully Test-time Adaptation for Tabular Data0
APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-TuningCode0
Deep Learning Models for Colloidal Nanocrystal SynthesisCode0
Enhancement of text recognition for hanja handwritten documents of Ancient Korea0
Who's the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of MulticalibrationCode0
One Node One Model: Featuring the Missing-Half for Graph ClusteringCode0
AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation0
Exemplar Masking for Multimodal Incremental LearningCode0
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction0
First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI0
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model0
Comparative Analysis of Mel-Frequency Cepstral Coefficients and Wavelet Based Audio Signal Processing for Emotion Detection and Mental Health Assessment in Spoken Speech0
Residual Channel Boosts Contrastive Learning for Radio Frequency Fingerprint Identification0
DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation0
NLPineers@ NLU of Devanagari Script Languages 2025: Hate Speech Detection using Ensembling of BERT-based modelsCode0
AGMixup: Adaptive Graph Mixup for Semi-supervised Node ClassificationCode0
Show:102550
← PrevPage 45 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified