SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 75517600 of 8378 papers

TitleStatusHype
Missingness Augmentation: A General Approach for Improving Generative Imputation ModelsCode0
Data Augmentation for Imbalanced RegressionCode0
Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and VerificationCode0
Data Augmentation for Hypernymy DetectionCode0
Mitigating annotation shift in cancer classification using single image generative modelsCode0
Data Augmentation for Emotion Detection in Small Imbalanced Text DataCode0
Generalizing Across Domains via Cross-Gradient TrainingCode0
A Generative Model of Symmetry TransformationsCode0
Data Augmentation for Dementia Detection in Spoken LanguageCode0
Mitigating Data Scarcity for Large Language ModelsCode0
Data Augmentation for Conversational AICode0
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask RefinementCode0
Robust Asymmetric Heterogeneous Federated Learning with Corrupted ClientsCode0
Back-to-Bones: Rediscovering the Role of Backbones in Domain GeneralizationCode0
TopoMortar: A dataset to evaluate image segmentation methods focused on topology accuracyCode0
Robust Channel Learning for Large-Scale Radio Speaker VerificationCode0
Robust Classification by Coupling Data Mollification with Label SmoothingCode0
BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic AssessmentCode0
Training Data Augmentation for Context-Sensitive Neural Lemmatization Using Inflection Tables and Raw TextCode0
Mitigating Data Redundancy to Revitalize Transformer-based Long-Term Time Series Forecasting SystemCode0
A Web-based Mpox Skin Lesion Detection System Using State-of-the-art Deep Learning Models Considering Racial DiversityCode0
Data Augmentation for Compositional Data: Advancing Predictive Models of the MicrobiomeCode0
Robust Deep Learning for Myocardial Scar Segmentation in Cardiac MRI with Noisy LabelsCode0
Structural Adversarial Objectives for Self-Supervised Representation LearningCode0
Structurally Diverse Sampling for Sample-Efficient Training and Comprehensive EvaluationCode0
Data Augmentation for Code Translation with Comparable Corpora and Multiple ReferencesCode0
RobustFair: Adversarial Evaluation through Fairness Confusion Directed Gradient SearchCode0
Gender-Inclusive Grammatical Error Correction through AugmentationCode0
Gaussian Blur and Relative Edge ResponseCode0
GAT: Guided Adversarial Training with Pareto-optimal Auxiliary TasksCode0
A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image ModelingCode0
A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection using Compounded CorruptionsCode0
Practical X-ray Gastric Cancer Diagnostic Support Using Refined Stochastic Data Augmentation and Hard Boundary Box TrainingCode0
Robust Heterogeneous Federated Learning under Data CorruptionCode0
Data Augmentation for Biomedical Factoid Question AnsweringCode0
GANsemble for Small and Imbalanced Data Sets: A Baseline for Synthetic Microplastics DataCode0
Mixing times of data-augmentation Gibbs samplers for high-dimensional probit regressionCode0
GANkyoku: a Generative Adversarial Network for Shakuhachi MusicCode0
Robustification of Multilingual Language Models to Real-world Noise in Crosslingual Zero-shot Settings with Robust Contrastive PretrainingCode0
GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image GenerationCode0
A visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification tasksCode0
AdLER: Adversarial Training with Label Error Rectification for One-Shot Medical Image SegmentationCode0
GAN-Based Data Augmentation and Anonymization for Skin-Lesion Analysis: A Critical ReviewCode0
MixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionCode0
Data-Augmentation-Based Dialectal Adaptation for LLMsCode0
An ordinal CNN approach for the assessment of neurological damage in Parkinson's disease patientsCode0
Galaxy Spin Classification I: Z-wise vs S-wise Spirals With Chirality Equivariant Residual NetworkCode0
RAIN: RegulArization on Input and Network for Black-Box Domain AdaptationCode0
Annotating FrameNet via Structure-Conditioned Language GenerationCode0
AVATAR: Adversarial Autoencoders with Autoregressive Refinement for Time Series GenerationCode0
Show:102550
← PrevPage 152 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified