SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 19011950 of 8378 papers

TitleStatusHype
AnatoMix: Anatomy-aware Data Augmentation for Multi-organ Segmentation0
False Positive Sampling-based Data Augmentation for Enhanced 3D Object Detection Accuracy0
Data Augmentation using Large Language Models: Data Perspectives, Learning Paradigms and Challenges0
Emergent Equivariance in Deep Ensembles0
Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation0
FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal DecouplingCode1
A Generative Model of Symmetry TransformationsCode0
AS-ES Learning: Towards Efficient CoT Learning in Small Models0
Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image ClassificationCode1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language ModelsCode1
Bayesian Uncertainty Estimation by Hamiltonian Monte Carlo: Applications to Cardiac MRI Segmentation0
Fine Tuning vs. Retrieval Augmented Generation for Less Popular KnowledgeCode0
Multi-level Product Category Prediction through Text ClassificationCode0
Self-Supervised Representation Learning with Meta Comprehensive Regularization0
ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation0
OpenGraph: Towards Open Graph Foundation ModelsCode3
The Impact of Frequency Bands on Acoustic Anomaly Detection of Machines using Deep Learning Based Model0
Improving Socratic Question Generation using Data Augmentation and Preference OptimizationCode0
Fractal interpolation in the context of prediction accuracy optimization0
VisRec: A Semi-Supervised Approach to Radio Interferometric Data Reconstruction0
Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks0
MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection0
Binary Gaussian Copula Synthesis: A Novel Data Augmentation Technique to Advance ML-based Clinical Decision Support Systems for Early Prediction of Dialysis Among CKD Patients0
Enhancing Protein Predictive Models via Proteins Data Augmentation: A Benchmark and New Directions0
Predicting UAV Type: An Exploration of Sampling and Data Augmentation for Time Series Classification0
Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras0
Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance0
StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds0
WHU-Synthetic: A Synthetic Perception Dataset for 3-D Multitask Model ResearchCode1
A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric EstimationCode0
OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction0
Data augmentation method for modeling health records with applications to clopidogrel treatment failure detection0
Classes Are Not Equal: An Empirical Study on Image Recognition Fairness0
Robust Synthetic Data-Driven Detection of Living-Off-the-Land Reverse Shells0
Why does music source separation benefit from cacophony?0
3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labellingCode1
Balancing Act: Distribution-Guided Debiasing in Diffusion Models0
FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time AugmentationCode0
How we won BraTS 2023 Adult Glioma challenge? Just faking it! Enhanced Synthetic Data Augmentation and Model Ensemble for brain tumour segmentation0
Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data0
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning0
Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset0
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification0
Generative AI in Vision: A Survey on Models, Metrics and Applications0
A Poisson-Gamma Dynamic Factor Model with Time-Varying Transition Dynamics0
Exploring the Power of Pure Attention Mechanisms in Blind Room Parameter Estimation0
Attention-GAN for Anomaly Detection: A Cutting-Edge Approach to Cybersecurity Threat Management0
NeSy is alive and well: A LLM-driven symbolic approach for better code comment data generation and classificationCode0
Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical StudyCode0
Show:102550
← PrevPage 39 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified