SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 22512300 of 8378 papers

TitleStatusHype
Fast Mixing of Data Augmentation Algorithms: Bayesian Probit, Logit, and Lasso RegressionCode0
AGMixup: Adaptive Graph Mixup for Semi-supervised Node ClassificationCode0
Comparative Opinion Mining in Product Reviews: Multi-perspective Prompt-based Learning0
LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information0
Can We Generate Visual Programs Without Prompting LLMs?0
A Physics-based Generative Model to Synthesize Training Datasets for MRI-based Fat Quantification0
BDA: Bangla Text Data Augmentation FrameworkCode0
Improving the Natural Language Inference robustness to hard dataset by data augmentation and preprocessing0
CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialoguesCode0
Optimizing Alignment with Less: Leveraging Data Augmentation for Personalized Evaluation0
A Parametric Approach to Adversarial Augmentation for Cross-Domain Iris Presentation Attack DetectionCode0
Leveraging Content and Context Cues for Low-Light Image EnhancementCode0
Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial VehiclesCode0
Multi-Scale Contrastive Learning for Video Temporal Grounding0
Generative Modeling and Data Augmentation for Power System Production SimulationCode0
World-Consistent Data Generation for Vision-and-Language Navigation0
SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation0
Data Augmentation with Variational Autoencoder for Imbalanced DatasetCode0
HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map SegmentationCode0
MIMO Detection under Hardware Impairments: Data Augmentation With Boosting0
Comprehensive Evaluation of Multimodal AI Models in Medical Imaging Diagnosis: From Data Augmentation to Preference-Based Comparison0
UNet++ and LSTM combined approach for Breast Ultrasound Image Segmentation0
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASRCode0
Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images0
Improving Post-Earthquake Crack Detection using Semi-Synthetic Generated Images0
EvTTC: An Event Camera Dataset for Time-to-Collision Estimation0
Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud0
BhashaVerse : Translation Ecosystem for Indian Subcontinent Languages0
Enhancing Mathematical Reasoning in LLMs with Background Operators0
Curriculum-style Data Augmentation for LLM-based Metaphor Detection0
Channel Reflection: Knowledge-Driven Data Augmentation for EEG-Based Brain-Computer Interfaces0
Few-Shot Learning with Adaptive Weight Masking in Conditional GANs0
Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning0
Tight PAC-Bayesian Risk Certificates for Contrastive LearningCode0
Distillation of Diffusion Features for Semantic Correspondence0
QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing0
Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation0
GUESS: Generative Uncertainty Ensemble for Self Supervision0
Robust soybean seed yield estimation using high-throughput ground robot videos0
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing0
Direct Coloring for Self-Supervised Enhanced Feature Decoupling0
Evaluating the Impact of Data Augmentation on Predictive Model Performance0
Su-RoBERTa: A Semi-supervised Approach to Predicting Suicide Risk through Social Media using Base Language Models0
ECG-SleepNet: Deep Learning-Based Comprehensive Sleep Stage Classification Using ECG Signals0
Multi-View Incongruity Learning for Multimodal Sarcasm Detection0
A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation0
Improving speaker verification robustness with synthetic emotional utterances0
Table Integration in Data Lakes Unleashed: Pairwise Integrability Judgment, Integrable Set Discovery, and Multi-Tuple Conflict Resolution0
BGM: Background Mixup for X-ray Prohibited Items Detection0
Topology-Preserving Scaling in Data Augmentation0
Show:102550
← PrevPage 46 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified