SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 82518300 of 8378 papers

TitleStatusHype
EdaCSC: Two Easy Data Augmentation Methods for Chinese Spelling CorrectionCode0
Traffic Sign Recognition Dataset and Data AugmentationCode0
Asynchronous Graph GeneratorCode0
Temporal Convolutional Memory Networks for Remaining Useful Life Estimation of Industrial MachineryCode0
PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification TasksCode0
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and CalibrationCode0
One-shot Generative Distribution Matching for Augmented RF-based UAV IdentificationCode0
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language DetectionCode0
PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised LearningCode0
Case-Base Neural Networks: survival analysis with time-varying, higher-order interactionsCode0
Echo-E^3Net: Efficient Endo-Epi Spatio-Temporal Network for Ejection Fraction EstimationCode0
ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic SegmentationCode0
Asynchronous and Distributed Data Augmentation for Massive Data SettingsCode0
Performance of GAN-based augmentation for deep learning COVID-19 image classificationCode0
Sequence-to-Sequence Data Augmentation for Dialogue Language UnderstandingCode0
Action Sequence Augmentation for Early Graph-based Anomaly DetectionCode0
Persian Emotion Detection using ParsBERT and Imbalanced Data Handling ApproachesCode0
Temporal Supervised Contrastive Learning for Modeling Patient Risk ProgressionCode0
Learning unfolded networks with a cyclic group structureCode0
A Survey on Deep Learning of Small Sample in Biomedical Image AnalysisCode0
Improving Sequential Recommendations via Bidirectional Temporal Data Augmentation with Pre-trainingCode0
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashingCode0
Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language TasksCode0
A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric EstimationCode0
Cascade Bagging for Accuracy Prediction with Few Training SamplesCode0
Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image CaptioningCode0
A Survey of Data Synthesis ApproachesCode0
Adaptive Data Augmentation for Aspect Sentiment Quad PredictionCode0
E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple PredictionCode0
CardiacGen: A Hierarchical Deep Generative Model for Cardiac SignalsCode0
PGCS: Physical Law embedded Generative Cloud Synthesis in Remote Sensing ImagesCode0
Tensor feature hallucination for few-shot learningCode0
Astraea: Self-balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning ApplicationsCode0
Dynamic Data Augmentation via MCTS for Prostate MRI SegmentationCode0
PHICON: Improving Generalization of Clinical Text De-identification Models via Data AugmentationCode0
Adapting Video Diffusion Models for Time-Lapse MicroscopyCode0
Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking of Financial TermsCode0
CapsuleNet: A Deep Learning Model To Classify GI Diseases Using EfficientNet-b7Code0
Training Data Augmentation for Code-Mixed TranslationCode0
Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?Code0
Can We Achieve More with Less? Exploring Data Augmentation for Toxic Comment ClassificationCode0
Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object DetectionCode0
DualMatch: Robust Semi-Supervised Learning with Dual-Level InteractionCode0
Training Data Augmentation for Context-Sensitive Neural Lemmatizer Using Inflection Tables and Raw TextCode0
Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion ModelsCode0
Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive ModelsCode0
Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentationCode0
Weighted Contrastive HashingCode0
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech RecognitionCode0
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentationCode0
Show:102550
← PrevPage 166 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified