SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 15011550 of 8378 papers

TitleStatusHype
Clarifying Myths About the Relationship Between Shape Bias, Accuracy, and Robustness0
Enhancing Indoor Temperature Forecasting through Synthetic Data in Low-Data Environments0
A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification0
Cut-and-Paste with Precision: a Content and Perspective-aware Data Augmentation for Road Damage Detection0
LNQ Challenge 2023: Learning Mediastinal Lymph Node Segmentation with a Probabilistic Lymph Node AtlasCode0
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories0
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract DescriptionsCode0
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset ChallengeCode0
Enhancing Traffic Sign Recognition with Tailored Data Augmentation: Addressing Class Imbalance and Instance Scarcity0
Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data ImbalanceCode0
GET: A Generative EEG Transformer for Continuous Context-Based Neural Signals0
Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese0
Language-guided Detection and Mitigation of Unknown Dataset Bias0
Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering0
Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion0
Deterministic Reversible Data Augmentation for Neural Machine TranslationCode0
LexMatcher: Dictionary-centric Data Collection for LLM-based Machine TranslationCode0
ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models0
TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context SubsettingCode0
Sensitivity-Informed Augmentation for Robust Segmentation0
Robust Classification by Coupling Data Mollification with Label SmoothingCode0
DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic SurgeryCode0
Mixup Augmentation with Multiple Interpolations0
EMOE: Expansive Matching of Experts for Robust Uncertainty Based Rejection0
Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion ModelsCode0
Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion PriorCode1
Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect DetectionCode1
Understanding and Reducing the Class-Dependent Effects of Data Augmentation with A Two-Player Game Approach0
Domain generalization for retinal vessel segmentation via Hessian-based vector field0
Augmentation-based Unsupervised Cross-Domain Functional MRI Adaptation for Major Depressive Disorder Identification0
MVAD: A Multiple Visual Artifact Detector for Video Streaming0
Class-Based Time Series Data Augmentation to Mitigate Extreme Class Imbalance for Solar Flare Prediction0
GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification0
Symmetries in Overparametrized Neural Networks: A Mean-Field View0
Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation0
PGA-SciRE: Harnessing LLM on Data Augmentation for Enhancing Scientific Relation Extraction0
Mitigating annotation shift in cancer classification using single image generative modelsCode0
Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering0
FaceMixup: Enhancing Facial Expression Recognition through Mixed Face Regularization0
Can the accuracy bias by facial hairstyle be reduced through balancing the training data?0
Weights Augmentation: it has never ever ever ever let her model downCode0
A Point-Neighborhood Learning Framework for Nasal Endoscope Image Segmentation0
Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology0
Leveraging Generative AI for Urban Digital Twins: A Scoping Review on the Autonomous Generation of Urban Data, Scenarios, Designs, and 3D City Models for Smart City Advancement0
EntProp: High Entropy Propagation for Improving Accuracy and Robustness0
EventZoom: A Progressive Approach to Event-Based Data Augmentation for Enhanced Neuromorphic Vision0
Causal Action Influence Aware Counterfactual Data AugmentationCode1
MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding0
Data-augmented phrase-level alignment for mitigating object hallucination0
Arithmetic Reasoning with LLM: Prolog Generation & Permutation0
Show:102550
← PrevPage 31 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified