SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 23512400 of 8378 papers

TitleStatusHype
Lie Point Symmetry and Physics Informed Networks0
Augmenting Radio Signals with Wavelet Transform for Deep Learning-Based Modulation Recognition0
Improving the Effectiveness of Deep Generative Data0
Spoken Dialogue System for Medical Prescription Acquisition on Smartphone: Development, Corpus and Evaluation0
MixUp-MIL: A Study on Linear & Multilinear Interpolation-Based Data Augmentation for Whole Slide Image Classification0
Model-based Counterfactual Generator for Gender Bias Mitigation0
Few-shot Learning using Data Augmentation and Time-Frequency Transformation for Time Series Classification0
DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase0
Co-training and Co-distillation for Quality Improvement and Compression of Language Models0
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language ModelsCode0
SSL-DG: Rethinking and Fusing Semi-supervised Learning and Domain Generalization in Medical Image SegmentationCode0
TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree SwappingCode0
Using DUCK-Net for Polyp Image SegmentationCode1
Comparative Knowledge DistillationCode0
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection0
Noise-Agnostic Quantum Error Mitigation with Data Augmented Neural ModelsCode0
Distilling Out-of-Distribution Robustness from Vision-Language Foundation ModelsCode1
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language DetectionCode0
Improving Robustness via Tilted Exponential Layer: A Communication-Theoretic PerspectiveCode0
Tailoring Mixup to Data for CalibrationCode0
Concatenated Masked Autoencoders as Spatial-Temporal LearnerCode1
Deep Double Descent for Time Series Forecasting: Avoiding Undertrained Models0
C2C: Cough to COVID-19 Detection in BHI 2023 Data ChallengeCode0
Data Augmentation for Code Translation with Comparable Corpora and Multiple ReferencesCode0
Rethinking Samples Selection for Contrastive Learning: Mining of Potential Samples0
DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Indistinct-Boundary Object SegmentationCode1
Bayes-enhanced Multi-view Attention Networks for Robust POI Recommendation0
Dynamic Batch Norm Statistics Update for Natural Robustness0
Thermal-Infrared Remote Target Detection System for Maritime Rescue based on Data Augmentation with 3D Synthetic Data0
Is Robustness Transferable across Languages in Multilingual Neural Machine Translation?0
Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved GeneralizationCode0
Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving0
A Lightweight Method to Generate Unanswerable Questions in EnglishCode0
A Note on Generalization in Variational Autoencoders: How Effective Is Synthetic Data & Overparameterization?0
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise0
On Linear Separation Capacity of Self-Supervised Representation Learning0
Empowering Collaborative Filtering with Principled Adversarial Contrastive LossCode1
ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object DetectionCode0
Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models0
OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning0
Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-OptCode1
MixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionCode0
Instance Segmentation under Occlusions via Location-aware Copy-Paste Data AugmentationCode1
Large-scale Foundation Models and Generative AI for BigData Neuroscience0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
Semi-Supervised Panoptic Narrative GroundingCode1
Better integrating vision and semantics for improving few-shot classificationCode0
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning UpdatesCode0
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge0
PAC-tuning:Fine-tuning Pretrained Language Models with PAC-driven Perturbed Gradient Descent0
Show:102550
← PrevPage 48 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified