SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 16511700 of 8378 papers

TitleStatusHype
ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators0
CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal BrainCode0
GFRIEND: Generative Few-shot Reward Inference through EfficieNt DPOCode0
Data Augmentation For Small Object using Fast AutoAugment0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging SegmentationCode0
Spatiotemporal deep learning models for detection of rapid intensification in cyclones0
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective0
Learning to Hear Broken Motors: Signature-Guided Data Augmentation for Induction-Motor Diagnostics0
An Explainable Deep Learning Framework for Brain Stroke and Tumor Progression via MRI Interpretation0
SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research0
Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholdingCode0
Scaling Human Activity Recognition: A Comparative Evaluation of Synthetic Data Generation and Augmentation Techniques0
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO0
Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations0
Deep Inertial Pose: A deep learning approach for human pose estimation0
Securing Traffic Sign Recognition Systems in Autonomous Vehicles0
Robust sensor fusion against on-vehicle sensor staleness0
Model-based Neural Data Augmentation for sub-wavelength Radio Localization0
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation0
Geometric and Physical Constraints Synergistically Enhance Neural PDE Surrogates0
LLM-based phoneme-to-grapheme for phoneme-based speech recognition0
PixCell: A generative foundation model for digital histopathology images0
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation0
Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference ModelsCode0
Person Re-Identification System at Semantic Level based on Pedestrian Attributes Ontology0
Fine-Tuning Video Transformers for Word-Level Bangla Sign Language: A Comparative Analysis for Classification Tasks0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching0
How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation AlignmentCode0
MISLEADER: Defending against Model Extraction with Ensembles of Distilled ModelsCode0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentationCode0
3D Skeleton-Based Action Recognition: A Review0
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic SegmentationCode0
SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds0
A Flat Minima Perspective on Understanding Augmentations and Model Robustness0
QGAN-based data augmentation for hybrid quantum-classical neural networks0
Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing0
Boosting Automatic Exercise Evaluation Through Musculoskeletal Simulation-Based IMU Data Augmentation0
Lightweight Convolutional Neural Networks for Retinal Disease Classification0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain AdaptationCode0
AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution PredictionCode0
Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation0
Pseudo Multi-Source Domain Generalization: Bridging the Gap Between Single and Multi-Source Domain GeneralizationCode0
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs0
PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization0
Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics0
Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification0
Show:102550
← PrevPage 34 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified