SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 451500 of 8378 papers

TitleStatusHype
Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided DiffusionCode1
Copula-based synthetic data augmentation for machine-learning emulatorsCode1
ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data FusionCode1
DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic SegmentationCode1
Directional Graph NetworksCode1
DISCO: Distilling Counterfactuals with Large Language ModelsCode1
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification ContextCode1
Discriminative Feature Alignment: Improving Transferability of Unsupervised Domain Adaptation by Gaussian-guided Latent AlignmentCode1
Better plain ViT baselines for ImageNet-1kCode1
A Light Recipe to Train Robust Vision TransformersCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust FinetuningCode1
Exploring Discontinuity for Video Frame InterpolationCode1
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper CalibrationCode1
Distilling Model Failures as Directions in Latent SpaceCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature AnalysisCode1
AlignMixup: Improving Representations By Interpolating Aligned FeaturesCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
CutPaste: Self-Supervised Learning for Anomaly Detection and LocalizationCode1
Data Augmentation of Wearable Sensor Data for Parkinson's Disease Monitoring using Convolutional Neural NetworksCode1
DLME: Deep Local-flatness Manifold EmbeddingCode1
DocXClassifier: High Performance Explainable Deep Network for Document Image ClassificationCode1
A Locality-based Neural Solver for Optical Motion CaptureCode1
Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh RecoveryCode1
Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G NetworksCode1
Domain Generalization using Causal MatchingCode1
3D Common Corruptions and Data AugmentationCode1
Alternate Diverse Teaching for Semi-supervised Medical Image SegmentationCode1
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
AltFreezing for More General Video Face Forgery DetectionCode1
CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue SystemCode1
DrugCLIP: Contrastive Protein-Molecule Representation Learning for Virtual ScreeningCode1
Controllable Data Augmentation Through Deep RelightingCode1
A Regularization-Guided Equivariant Approach for Image RestorationCode1
Controllable Dialogue Simulation with In-Context LearningCode1
Dual Contrastive Learning: Text Classification via Label-Aware Data AugmentationCode1
Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text GenerationCode1
3D U-Net: Learning Dense Volumetric Segmentation from Sparse AnnotationCode1
Controllable 3D Face Generation with Conditional Style Code DiffusionCode1
ECG arrhythmia classification using a 2-D convolutional neural networkCode1
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence ModelingCode1
Break-It-Fix-It: Unsupervised Learning for Program RepairCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
CADTransformer: Panoptic Symbol Spotting Transformer for CAD DrawingsCode1
EEG-Inception: An Accurate and Robust End-to-End Neural Network for EEG-based Motor Imagery ClassificationCode1
CONVERT:Contrastive Graph Clustering with Reliable AugmentationCode1
A real-time and high-precision method for small traffic-signs recognitionCode1
Show:102550
← PrevPage 10 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified