SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 101150 of 8378 papers

TitleStatusHype
Boosting Automatic Exercise Evaluation Through Musculoskeletal Simulation-Based IMU Data Augmentation0
A Flat Minima Perspective on Understanding Augmentations and Model Robustness0
QGAN-based data augmentation for hybrid quantum-classical neural networks0
Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation0
Pseudo Multi-Source Domain Generalization: Bridging the Gap Between Single and Multi-Source Domain GeneralizationCode0
AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution PredictionCode0
Individualised Counterfactual Examples Using Conformal Prediction Intervals0
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs0
Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification0
PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization0
Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics0
Supervised Contrastive Learning for Ordinal Engagement Measurement0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
RoBiS: Robust Binary Segmentation for High-Resolution Industrial ImagesCode1
Enhancing Contrastive Learning-based Electrocardiogram Pretrained Model with Patient Memory QueueCode0
Detection of Suicidal Risk on Social Media: A Hybrid Model0
ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image0
Lorentz Local Canonicalization: How to Make Any Network Lorentz-Equivariant0
Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation0
A Regularization-Guided Equivariant Approach for Image RestorationCode1
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image SegmentationCode1
REARANK: Reasoning Re-ranking Agent via Reinforcement LearningCode1
Revolutionizing Wildfire Detection with Convolutional Neural Networks: A VGG16 Model Approach0
Learn Beneficial Noise as Graph Augmentation0
Behavior Injection: Preparing Language Models for Reinforcement LearningCode1
A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic SegmentationCode0
How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic SegmentationCode1
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation0
Building a Functional Machine Translation Corpus for Kpelle0
Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from VideosCode0
Large language model as user daily behavior data generator: balancing population diversity and individual personality0
Supervised Graph Contrastive Learning for Gene Regulatory Network0
What Do You Need for Diverse Trajectory Stitching in Diffusion Planning?0
Audio-to-Audio Emotion Conversion With Pitch And Duration Style Transfer0
Maximum Total Correlation Reinforcement LearningCode0
Efficient Prototype Consistency Learning in Medical Image Segmentation via Joint Uncertainty and Data Augmentation0
Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces0
Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?0
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification0
15,500 Seconds: Lean UAV Classification Leveraging PEFT and Pre-Trained NetworksCode0
Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation0
Data Augmentation and Resolution Enhancement using GANs and Diffusion Models for Tree Segmentation0
Geometrically Regularized Transfer Learning with On-Manifold and Off-Manifold Perturbation0
GAMA: Geometry-Aware Manifold Alignment via Structured Adversarial Perturbations for Robust Domain Adaptation0
GSDFuse: Capturing Cognitive Inconsistencies from Multi-Dimensional Weak Signals in Social Media SteganalysisCode0
Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data0
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals0
Mixing times of data-augmentation Gibbs samplers for high-dimensional probit regressionCode0
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data0
SSPS: Self-Supervised Positive Sampling for Robust Self-Supervised Speaker VerificationCode0
Show:102550
← PrevPage 3 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified