SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 150 of 8378 papers

TitleStatusHype
Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images0
Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management0
Similarity-Guided Diffusion for Contrastive Sequential Recommendation0
Data Augmentation in Time Series Forecasting through Inverted Framework0
Iceberg: Enhancing HLS Modeling with Synthetic DataCode0
AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)Code0
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation0
Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques0
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis0
PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer LearningCode0
DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data AugmentationCode0
Semantic Certainty Assessment in Vector Retrieval Systems: A Novel Framework for Embedding Quality Evaluation0
SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor VariationsCode0
TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems0
Evolution without Large Models: Training Language Model with Task Principles0
Piggyback Camera: Easy-to-Deploy Visual Surveillance by Mobile Sensing on Commercial Robot Vacuums0
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback SynergyCode1
Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation0
Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges0
Robust Deep Learning for Myocardial Scar Segmentation in Cardiac MRI with Noisy LabelsCode0
HybridQ: Hybrid Classical-Quantum Generative Adversarial Network for Skin Disease Image Generation0
Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation0
RAG-VisualRec: An Open Resource for Vision- and Text-Enhanced Retrieval-Augmented Generation in RecommendationCode0
Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests0
Industrial Energy Disaggregation with Digital Twin-generated Dataset and Efficient Data AugmentationCode0
Cross-regularization: Adaptive Model Complexity through Validation Gradients0
Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization0
HARPT: A Corpus for Analyzing Consumers' Trust and Privacy Concerns in Mobile Health Apps0
On the Robustness of Human-Object Interaction Detection against Distribution Shift0
Dynamic Temporal Positional Encodings for Early Intrusion Detection in IoT0
DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation0
Robust Training with Data Augmentation for Medical Imaging Classification0
CopulaSMOTE: A Copula-Based Oversampling Approach for Imbalanced Classification in Diabetes Prediction0
Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition0
Model compression using knowledge distillation with integrated gradients0
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust ClassificationCode0
orGAN: A Synthetic Data Augmentation Pipeline for Simultaneous Generation of Surgical Images and Ground Truth Labels0
Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation0
Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet's Performance with Automatically Generated Document Images0
Adaptive Data Augmentation for Thompson Sampling0
The Perception of Phase Intercept Distortion and its Application in Data Augmentation0
Exploring Non-contrastive Self-supervised Representation Learning for Image-based Profiling0
Compositional Attribute Imbalance in Vision Datasets0
BlastDiffusion: A Latent Diffusion Model for Generating Synthetic Embryo Images to Address Data Scarcity in In Vitro Fertilization0
PRO: Projection Domain Synthesis for CT ImagingCode0
Understanding Learning Invariance in Deep Linear Networks0
MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model0
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis0
SAGDA: Open-Source Synthetic Agriculture Data for AfricaCode0
Graph-Convolutional-Beta-VAE for Synthetic Abdominal Aorta Aneurysm Generation0
Show:102550
← PrevPage 1 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified