SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 751800 of 8378 papers

TitleStatusHype
EvTTC: An Event Camera Dataset for Time-to-Collision Estimation0
Improving Post-Earthquake Crack Detection using Semi-Synthetic Generated Images0
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
BhashaVerse : Translation Ecosystem for Indian Subcontinent Languages0
Enhancing Mathematical Reasoning in LLMs with Background Operators0
Few-Shot Learning with Adaptive Weight Masking in Conditional GANs0
Channel Reflection: Knowledge-Driven Data Augmentation for EEG-Based Brain-Computer Interfaces0
Distillation of Diffusion Features for Semantic Correspondence0
Tight PAC-Bayesian Risk Certificates for Contrastive LearningCode0
Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning0
Curriculum-style Data Augmentation for LLM-based Metaphor Detection0
GUESS: Generative Uncertainty Ensemble for Self Supervision0
Many-MobileNet: Multi-Model Augmentation for Robust Retinal Disease ClassificationCode2
Evaluating the Impact of Data Augmentation on Predictive Model Performance0
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing0
Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation0
Robust soybean seed yield estimation using high-throughput ground robot videos0
ProbPose: A Probabilistic Approach to 2D Human Pose EstimationCode2
Direct Coloring for Self-Supervised Enhanced Feature Decoupling0
QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing0
Su-RoBERTa: A Semi-supervised Approach to Predicting Suicide Risk through Social Media using Base Language Models0
ECG-SleepNet: Deep Learning-Based Comprehensive Sleep Stage Classification Using ECG Signals0
Multi-View Incongruity Learning for Multimodal Sarcasm Detection0
A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation0
Table Integration in Data Lakes Unleashed: Pairwise Integrability Judgment, Integrable Set Discovery, and Multi-Tuple Conflict Resolution0
Improving speaker verification robustness with synthetic emotional utterances0
BGM: Background Mixup for X-ray Prohibited Items Detection0
Improving the performance of weak supervision searches using data augmentation0
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMsCode1
Topology-Preserving Scaling in Data Augmentation0
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation0
Reverse Thinking Makes LLMs Stronger Reasoners0
CantorNet: A Sandbox for Testing Geometrical and Topological Complexity Measures0
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image SegmentationCode1
Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough?0
UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation0
Dual-Level Boost Network for Long-Tail Prohibited Items Detection in X-ray Security Inspection0
Enhancing weed detection performance by means of GenAI-based image augmentation0
Training and Evaluating Language Models with Template-based Data GenerationCode1
Thai Financial Domain Adaptation of THaLLE -- Technical Report0
Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification0
Breast Tumor Classification Using EfficientNet Deep Learning ModelCode0
Task Progressive Curriculum Learning for Robust Visual Question Answering0
Scaling nnU-Net for CBCT Segmentation0
Semantic Data Augmentation for Long-tailed Facial Expression Recognition0
RoCoDA: Counterfactual Data Augmentation for Data-Efficient Robot Learning from Demonstrations0
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models0
J-CaPA : Joint Channel and Pyramid Attention Improves Medical Image Segmentation0
Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches0
Unsupervised Event Outlier Detection in Continuous Time0
Show:102550
← PrevPage 16 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified