SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 27012750 of 8378 papers

TitleStatusHype
NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge BasesCode0
Generating Images of the M87* Black Hole Using GANsCode0
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment ClassificationCode0
DPN-SENet:A self-attention mechanism neural network for detection and diagnosis of COVID-19 from chest x-ray imagesCode0
Generating Synthetic Data for Text RecognitionCode0
MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking ImagesCode0
Balanced Split: A new train-test data splitting strategy for imbalanced datasetsCode0
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLCode0
Generated Graph DetectionCode0
Generate then Refine: Data Augmentation for Zero-shot Intent DetectionCode0
Generating Synthetic Speech from SpokenVocab for Speech TranslationCode0
Generalizing Few-Shot Named Entity Recognizers to Unseen Domains with Type-Related FeaturesCode0
Generalizing Conversational Dense Retrieval via LLM-Cognition Data AugmentationCode0
Generalizing to Unseen Domains via Adversarial Data AugmentationCode0
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask RefinementCode0
Mitigating Data Redundancy to Revitalize Transformer-based Long-Term Time Series Forecasting SystemCode0
MediAug: Exploring Visual Augmentation in Medical ImagingCode0
Medical Image Segmentation Using Deep Learning: A SurveyCode0
Generalizing Across Domains via Cross-Gradient TrainingCode0
General-to-Detailed GAN for Infrequent Class Medical ImagesCode0
Generation of Artificial CT Images using Patch-based Conditional Generative Adversarial NetworksCode0
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation BenchmarkCode0
DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic SurgeryCode0
ANDA: A Novel Data Augmentation Technique Applied to Salient Object DetectionCode0
Bag of Tricks for In-Distribution Calibration of Pretrained TransformersCode0
Meta-learning and Data Augmentation for Stress Testing Forecasting ModelsCode0
GAT: Guided Adversarial Training with Pareto-optimal Auxiliary TasksCode0
Gaussian Blur and Relative Edge ResponseCode0
GANsemble for Small and Imbalanced Data Sets: A Baseline for Synthetic Microplastics DataCode0
Bad Global Minima Exist and SGD Can Reach ThemCode0
GANkyoku: a Generative Adversarial Network for Shakuhachi MusicCode0
DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data AugmentationCode0
A CNN-based tool for automatic tongue contour tracking in ultrasound imagesCode0
Anchor Data AugmentationCode0
GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image GenerationCode0
DTW-Merge: A Novel Data Augmentation Technique for Time Series ClassificationCode0
Practical X-ray Gastric Cancer Diagnostic Support Using Refined Stochastic Data Augmentation and Hard Boundary Box TrainingCode0
A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image ClassificationCode0
Gender-Inclusive Grammatical Error Correction through AugmentationCode0
Dataset Condensation with Differentiable Siamese AugmentationCode0
Back to the Roots: Reconstructing Large and Complex Cranial Defects using an Image-based Statistical Shape ModelCode0
G2GT: Retrosynthesis Prediction with Graph to Graph Attention Neural Network and Self-TrainingCode0
Back-to-Bones: Rediscovering the Role of Backbones in Domain GeneralizationCode0
GaitASMS: Gait Recognition by Adaptive Structured Spatial Representation and Multi-Scale Temporal AggregationCode0
Fusion for Visual-Infrared Person ReID in Real-World Surveillance Using Corrupted Multimodal DataCode0
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentationCode0
Fuzzy Cluster-Aware Contrastive Clustering for Time SeriesCode0
DualMatch: Robust Semi-Supervised Learning with Dual-Level InteractionCode0
Galaxy Spin Classification I: Z-wise vs S-wise Spirals With Chirality Equivariant Residual NetworkCode0
Fully Automatic and Real-Time Catheter Segmentation in X-Ray FluoroscopyCode0
Show:102550
← PrevPage 55 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified