SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 9511000 of 8378 papers

TitleStatusHype
A Locality-based Neural Solver for Optical Motion CaptureCode1
Minority-Focused Text-to-Image Generation via Prompt OptimizationCode1
Mitigating Data Heterogeneity in Federated Learning with Data AugmentationCode1
Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive LearningCode1
Mixed Autoencoder for Self-supervised Visual Representation LearningCode1
MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error CorrectionCode1
Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source SeparationCode1
Mixing Up Contrastive Learning: Self-Supervised Representation Learning for Time SeriesCode1
Contextual Similarity Aggregation with Self-attention for Visual Re-rankingCode1
Mixup for Node and Graph ClassificationCode1
Alternate Diverse Teaching for Semi-supervised Medical Image SegmentationCode1
Progressive Multi-Modality Learning for Inverse Protein FoldingCode1
MoCoDA: Model-based Counterfactual Data AugmentationCode1
MODALS: Modality-agnostic Automated Data Augmentation in the Latent SpaceCode1
AltFreezing for More General Video Face Forgery DetectionCode1
Modeling the Probabilistic Distribution of Unlabeled Data forOne-shot Medical Image SegmentationCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
Monkeypox Image Data collectionCode1
Anatomical Data Augmentation via Fluid-based Image RegistrationCode1
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function PerspectiveCode1
AugmentedNet: A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal TasksCode1
Augmented Neural Fine-Tuning for Efficient Backdoor PurificationCode1
MotionAug: Augmentation with Physical Correction for Human Motion PredictionCode1
Motion-Focused Contrastive Learning of Video RepresentationsCode1
Contemplating real-world object classificationCode1
State-of-the-Art Augmented NLP Transformer models for direct and single-step retrosynthesisCode1
Augmented Ultrasonic Data for Machine LearningCode1
Multi-attentional Deepfake DetectionCode1
Continual Few-shot Relation Learning via Embedding Space Regularization and Data AugmentationCode1
Multi-level Cross-view Contrastive Learning for Knowledge-aware Recommender SystemCode1
Multi-modal Conditional Bounding Box Regression for Music Score FollowingCode1
Multimodal Data Augmentation for Visual-Infrared Person ReID with Corrupted DataCode1
Augmenting DL with Adversarial Training for Robust Prediction of Epilepsy SeizuresCode1
Augmenting Document Representations for Dense Retrieval with Interpolation and PerturbationCode1
Multi-Sample based Contrastive Loss for Top-k RecommendationCode1
Multi-Spectral Image Synthesis for Crop/Weed Segmentation in Precision FarmingCode1
Contrastive Learning for Knowledge TracingCode1
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter OptimizationCode1
MUM: Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object DetectionCode1
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and GroundingCode1
Conditioned Text Generation with Transfer for Closed-Domain Dialogue SystemsCode1
Confident Sinkhorn Allocation for Pseudo-LabelingCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
Augmenting Sequential Recommendation with Balanced Relevance and DiversityCode1
Concatenated Masked Autoencoders as Spatial-Temporal LearnerCode1
A U-Net Based Discriminator for Generative Adversarial NetworksCode1
Augmenting the User-Item Graph with Textual Similarity ModelsCode1
Negative Data AugmentationCode1
Conformal Prediction with Missing ValuesCode1
Composing Good Shots by Exploiting Mutual RelationsCode1
Show:102550
← PrevPage 20 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified