SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 501550 of 8378 papers

TitleStatusHype
APBench: A Unified Benchmark for Availability Poisoning Attacks and DefensesCode1
Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial NetworksCode1
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup StrategiesCode1
LiDAR View Synthesis for Robust Vehicle Navigation Without Expert LabelsCode1
Pre-training Vision Transformers with Very Limited Synthesized ImagesCode1
MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive LearningCode1
HybridAugment++: Unified Frequency Spectra Perturbations for Model RobustnessCode1
LatentAugment: Data Augmentation via Guided Manipulation of GAN's Latent SpaceCode1
What do neural networks learn in image classification? A frequency shortcut perspectiveCode1
Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person RetrievalCode1
Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy WeatherCode1
AltFreezing for More General Video Face Forgery DetectionCode1
MixupExplainer: Generalizing Explanations for Graph Neural Networks with Data AugmentationCode1
Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial TransferabilityCode1
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute ManipulationCode1
Generative Contrastive Graph Learning for RecommendationCode1
Distill-SODA: Distilling Self-Supervised Vision Transformer for Source-Free Open-Set Domain Adaptation in Computational PathologyCode1
Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio DataCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
PIGNet2: A Versatile Deep Learning-based Protein-Ligand Interaction Prediction Model for Binding Affinity Scoring and Virtual ScreeningCode1
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain GeneralizationCode1
Generative Data Augmentation for Aspect Sentiment Quad PredictionCode1
MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image AnalysisCode1
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
Neural Bayes estimators for censored inference with peaks-over-threshold modelsCode1
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion RecognitionCode1
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
A systematic approach to deep learning-based nodule detection in chest radiographsCode1
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic ModelCode1
Improving Generalizability of Graph Anomaly Detection Models via Data AugmentationCode1
Time-aware Graph Structure Learning via Sequence Prediction on Temporal GraphsCode1
Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper CalibrationCode1
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLCode1
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline DataCode1
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!Code1
Conformal Prediction with Missing ValuesCode1
Improving Conversational Recommendation Systems via Counterfactual Data SimulationCode1
Graph Transformer for RecommendationCode1
Self Contrastive Learning for Session-based RecommendationCode1
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint ModelsCode1
Improving the Robustness of Summarization Systems with Dual AugmentationCode1
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERCode1
Geo-Tiles for Semantic Segmentation of Earth Observation ImageryCode1
Source Code Data Augmentation for Deep Learning: A SurveyCode1
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
A Shapelet-based Framework for Unsupervised Multivariate Time Series Representation LearningCode1
Improved Probabilistic Image-Text RepresentationsCode1
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-TuningCode1
PDE+: Enhancing Generalization via PDE with Adaptive Distributional DiffusionCode1
VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large ScaleCode1
Show:102550
← PrevPage 11 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified