SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 36013650 of 8378 papers

TitleStatusHype
Few-shot Learning using Data Augmentation and Time-Frequency Transformation for Time Series Classification0
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language ModelsCode0
SSL-DG: Rethinking and Fusing Semi-supervised Learning and Domain Generalization in Medical Image SegmentationCode0
TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree SwappingCode0
Noise-Agnostic Quantum Error Mitigation with Data Augmented Neural ModelsCode0
Comparative Knowledge DistillationCode0
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection0
Tailoring Mixup to Data for CalibrationCode0
Improving Robustness via Tilted Exponential Layer: A Communication-Theoretic PerspectiveCode0
Deep Double Descent for Time Series Forecasting: Avoiding Undertrained Models0
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language DetectionCode0
Data Augmentation for Code Translation with Comparable Corpora and Multiple ReferencesCode0
Rethinking Samples Selection for Contrastive Learning: Mining of Potential Samples0
C2C: Cough to COVID-19 Detection in BHI 2023 Data ChallengeCode0
Bayes-enhanced Multi-view Attention Networks for Robust POI Recommendation0
Is Robustness Transferable across Languages in Multilingual Neural Machine Translation?0
Dynamic Batch Norm Statistics Update for Natural Robustness0
Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved GeneralizationCode0
Thermal-Infrared Remote Target Detection System for Maritime Rescue based on Data Augmentation with 3D Synthetic Data0
Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving0
A Lightweight Method to Generate Unanswerable Questions in EnglishCode0
A Note on Generalization in Variational Autoencoders: How Effective Is Synthetic Data & Overparameterization?0
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise0
On Linear Separation Capacity of Self-Supervised Representation Learning0
Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models0
ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object DetectionCode0
OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning0
Large-scale Foundation Models and Generative AI for BigData Neuroscience0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
MixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionCode0
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained ModelCode0
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models0
PAC-tuning:Fine-tuning Pretrained Language Models with PAC-driven Perturbed Gradient Descent0
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge0
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning UpdatesCode0
Better integrating vision and semantics for improving few-shot classificationCode0
Data Augmentation for Emotion Detection in Small Imbalanced Text DataCode0
Improving Few-shot Generalization of Safety Classifiers via Data Augmented Parameter-Efficient Fine-Tuning0
UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception0
Transferring a molecular foundation model for polymer property predictions0
Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data AugmentationCode0
Early Detection of Tuberculosis with Machine Learning Cough Audio Analysis: Towards More Accessible Global Triaging Usage0
DualMatch: Robust Semi-Supervised Learning with Dual-Level InteractionCode0
An Explainable Deep Learning-Based Method For Schizophrenia Diagnosis Using Generative Data-Augmentation0
Using GPT-4 to Augment Unbalanced Data for Automatic Scoring0
Improving Language Models Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary0
Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles0
Towards contrast-agnostic soft segmentation of the spinal cordCode0
Statistical Depth for Ranking and Characterizing Transformer-Based Text EmbeddingsCode0
S3Aug: Segmentation, Sampling, and Shift for Action Recognition0
Show:102550
← PrevPage 73 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified