SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 15511600 of 8378 papers

TitleStatusHype
Arithmetic Reasoning with LLM: Prolog Generation & Permutation0
Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection0
RC-Mixup: A Data Augmentation Strategy against Noisy Data for Regression Tasks0
A Recipe for Unbounded Data Augmentation in Visual Reinforcement LearningCode1
DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization0
Saturn: Sample-efficient Generative Molecular Design using Memory ManipulationCode2
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningCode1
Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation0
Dominant Shuffle: A Simple Yet Powerful Data Augmentation for Time-series Prediction0
TreeFormers -- An Exploration of Vision Transformers for Deforestation Driver Classification0
Certifying Adapters: Enabling and Enhancing the Certification of Classifier Adversarial Robustness0
USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time SeriesCode1
Planted: a dataset for planted forest identification from multi-satellite time series0
Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification0
Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments0
Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2Code2
Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection0
Combining Euclidean Alignment and Data Augmentation for BCI decoding0
Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction0
Improving Language Models Trained on Translated Data with Continual Pre-Training and Dictionary Learning Analysis0
Can LLMs Solve longer Math Word Problems Better?Code0
Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer ModelsCode0
Data Augmentation Techniques for Process Extraction from Scientific Publications0
Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure0
Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers0
FloodDamageCast: Building Flood Damage Nowcasting with Machine Learning and Data Augmentation0
YOLOv10: Real-Time End-to-End Object DetectionCode11
Domain-specific augmentations with resolution agnostic self-attention mechanism improves choroid segmentation in optical coherence tomography images0
An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models0
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood DiscrepancyCode1
Mosaic-IT: Free Compositional Data Augmentation Improves Instruction TuningCode1
A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification0
Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G NetworksCode1
Rumor Detection on Social Media with Reinforcement Learning-based Key Propagation Graph Generator0
Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction0
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask RefinementCode0
Towards Graph Contrastive Learning: A Survey and Beyond0
Data Augmentation for Text-based Person Retrieval Using Large Language Models0
Speech-dependent Data Augmentation for Own Voice Reconstruction with Hearable Microphones in Noisy Environments0
Conditionally-Conjugate Gaussian Process Factor Analysis for Spike Count Data via Data Augmentation0
Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV20
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation0
Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge SystemCode4
Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track0
MixCut:A Data Augmentation Method for Facial Expression Recognition0
Harnessing Collective Structure Knowledge in Data Augmentation for Graph Neural NetworksCode0
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous TranslationCode1
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Training Deep Learning Models with Hybrid Datasets for Robust Automatic Target Detection on real SAR images0
A Comprehensive Survey on Data Augmentation0
Show:102550
← PrevPage 32 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified