SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 74517500 of 8378 papers

TitleStatusHype
GeoMix: Towards Geometry-Aware Data AugmentationCode0
Generative Style Transfer for MRI Image Segmentation: A Case of Glioma Segmentation in Sub-Saharan AfricaCode0
A Geometry-Sensitive Approach for Photographic Style ClassificationCode0
Type-Driven Multi-Turn Corrections for Grammatical Error CorrectionCode0
BDA: Bangla Text Data Augmentation FrameworkCode0
Together We Can: Multilingual Automatic Post-Editing for Low-Resource LanguagesCode0
Maximum Bayes Smatch Ensemble Distillation for AMR ParsingCode0
Statistical Depth for Ranking and Characterizing Transformer-Based Text EmbeddingsCode0
Generative Modeling Helps Weak Supervision (and Vice Versa)Code0
PhiNet v2: A Mask-Free Brain-Inspired Vision Foundation Model from VideoCode0
Maximum Total Correlation Reinforcement LearningCode0
Generative Modeling and Data Augmentation for Power System Production SimulationCode0
Return of the Devil in the Details: Delving Deep into Convolutional NetsCode0
Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect DetectionCode0
A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image ClassificationCode0
MDMLP: Image Classification from Scratch on Small Datasets with MLPCode0
A Novel Feature Space Augmentation Method to Improve Classification Performance and Evaluation ReliabilityCode0
Data Augmentation Techniques for Cross-Domain WiFi CSI-based Human Activity RecognitionCode0
Reverse Operation based Data Augmentation for Solving Math Word ProblemsCode0
A Dual-Contrastive Framework for Low-Resource Cross-Lingual Named Entity RecognitionCode0
Data Augmentation Techniques for Chinese Disease Name NormalizationCode0
Measuring the Robustness of Audio Deepfake DetectorsCode0
Generative Image Translation for Data Augmentation in Colorectal Histopathology ImagesCode0
ScribbleGen: Generative Data Augmentation Improves Scribble-supervised Semantic SegmentationCode0
Data augmentation on-the-fly and active learning in data stream classificationCode0
MediAug: Exploring Visual Augmentation in Medical ImagingCode0
BCH-NLP at BioCreative VII Track 3: medications detection in tweets using transformer networks and multi-task learningCode0
Medical Image Segmentation Using Deep Learning: A SurveyCode0
Medical Image Segmentation via Single-Source Domain Generalization with Random Amplitude Spectrum SynthesisCode0
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic SegmentationCode0
Steganographic Embeddings as an Effective Data AugmentationCode0
BSDA: Bayesian Random Semantic Data Augmentation for Medical Image ClassificationCode0
Revisiting Data Augmentation for Ultrasound ImagesCode0
Revisiting Data Augmentation in Deep Reinforcement LearningCode0
Bayesian Neural Network Language Modeling for Speech RecognitionCode0
MedMine: Examining Pre-trained Language Models on Medication MiningCode0
Generative-Contrastive Heterogeneous Graph Neural NetworkCode0
MedRep: Medical Concept Representation for General Electronic Health Record Foundation ModelsCode0
Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial VehiclesCode0
Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT NetworksCode0
Balanced Split: A new train-test data splitting strategy for imbalanced datasetsCode0
TopGuNN: Fast NLP Training Data Augmentation using Large CorporaCode0
Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case StudyCode0
How Does Data Augmentation Affect Privacy in Machine Learning?Code0
Balanced and Explainable Social Media Analysis for Public Health with Large Language ModelsCode0
Revisiting Knowledge Distillation under Distribution ShiftCode0
Generative Adversarial Network with Spatial Attention for Face Attribute EditingCode0
Data augmentation on graphs for table type classificationCode0
A Comparative Study of Pre-training and Self-trainingCode0
A Comparative Study of Graph Neural Networks for Shape Classification in NeuroimagingCode0
Show:102550
← PrevPage 150 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified