SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 9511000 of 8378 papers

TitleStatusHype
A Locality-based Neural Solver for Optical Motion CaptureCode1
DIAGen: Diverse Image Augmentation with Generative ModelsCode1
Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired DataCode1
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific InformationCode1
DiffBatt: A Diffusion Model for Battery Degradation Prediction and SynthesisCode1
DID-M3D: Decoupling Instance Depth for Monocular 3D Object DetectionCode1
Differentiable Data Augmentation for Contrastive Sentence Representation LearningCode1
DiffKG: Knowledge Graph Diffusion Model for RecommendationCode1
3D Common Corruptions and Data AugmentationCode1
Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect DetectionCode1
Alternate Diverse Teaching for Semi-supervised Medical Image SegmentationCode1
BAGAN: Data Augmentation with Balancing GANCode1
Directed Graph Contrastive LearningCode1
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming with Feasibility GuaranteeCode1
Anatomical Data Augmentation via Fluid-based Image RegistrationCode1
Direct Differentiable Augmentation SearchCode1
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense EncodersCode1
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapesCode1
Multi-Perspective Data Augmentation for Few-shot Object DetectionCode1
DISCO: Distilling Counterfactuals with Large Language ModelsCode1
AugmentedNet: A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal TasksCode1
Augmented Neural Fine-Tuning for Efficient Backdoor PurificationCode1
Discriminative Feature Alignment: Improving Transferability of Unsupervised Domain Adaptation by Gaussian-guided Latent AlignmentCode1
Multitask 3D CBCT-to-CT Translation and Organs-at-Risk Segmentation Using Physics-Based Data AugmentationCode1
MIXCODE: Enhancing Code Classification by Mixup-Based Data AugmentationCode1
State-of-the-Art Augmented NLP Transformer models for direct and single-step retrosynthesisCode1
Augmented Ultrasonic Data for Machine LearningCode1
Disentangled Representations for Domain-generalized Cardiac SegmentationCode1
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question AnsweringCode1
Dissecting Image CropsCode1
Entailment as Few-Shot LearnerCode1
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup StrategiesCode1
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter OptimizationCode1
Augmenting Document Representations for Dense Retrieval with Interpolation and PerturbationCode1
NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture SearchCode1
NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet TrainingCode1
A Winning Hand: Compressing Deep Networks Can Improve Out-Of-Distribution RobustnessCode1
Distilling Model Failures as Directions in Latent SpaceCode1
Isotropy Maximization Loss and Entropic Score: Accurate, Fast, Efficient, Scalable, and Turnkey Neural Networks Out-of-Distribution Detection Based on The Principle of Maximum EntropyCode1
Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data ClassificationCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt TuningCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
Augmenting Sequential Recommendation with Balanced Relevance and DiversityCode1
Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint OptimizationCode1
DLME: Deep Local-flatness Manifold EmbeddingCode1
Augmenting the User-Item Graph with Textual Similarity ModelsCode1
DocXClassifier: High Performance Explainable Deep Network for Document Image ClassificationCode1
Do Generated Data Always Help Contrastive Learning?Code1
Empirical Evaluation of Data Augmentations for Biobehavioral Time Series Data with Deep LearningCode1
Show:102550
← PrevPage 20 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified