SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 251300 of 8378 papers

TitleStatusHype
Enhancing DR Classification with Swin Transformer and Shifted Window Attention0
Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training0
Beyond One-Hot Labels: Semantic Mixing for Model CalibrationCode0
LIFT+: Lightweight Fine-Tuning for Long-Tail LearningCode0
MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection0
ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models0
NoisyRollout: Reinforcing Visual Reasoning with Data AugmentationCode2
Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration0
Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled DataCode0
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios0
CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors0
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
Dual-Path Enhancements in Event-Based Eye Tracking: Augmented Robustness and Adaptive Temporal Modeling0
Data Augmentation Through Random Style Replacement0
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
Unveiling Contrastive Learning's Capability of Neighborhood Aggregation for Collaborative FilteringCode1
Decoupled Diffusion Sparks Adaptive Scene Generation0
Towards contrast- and pathology-agnostic clinical fetal brain MRI segmentation using SynthSeg0
VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification0
Improving In-Context Learning with Reasoning DistillationCode0
Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark0
Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data AugmentationCode0
Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference UnderstandingCode0
seg2med: a bridge from artificial anatomy to multimodal medical images0
MedRep: Medical Concept Representation for General Electronic Health Record Foundation ModelsCode0
Diffusion Models for Robotic Manipulation: A Survey0
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical ImagingCode1
Exploring Human-Like Thinking in Search Simulations with Large Language ModelsCode0
CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color ConstancyCode1
The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound0
Heart Failure Prediction using Modal Decomposition and Masked Autoencoders for Scarce Echocardiography DatabasesCode1
Deep Learning-Based Wideband Spectrum Sensing with Dual-Representation Inputs and Subband Shuffling Augmentation0
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection0
A Comparison of Deep Learning Methods for Cell Detection in Digital CytologyCode0
WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care0
FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM ExtractionCode0
Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation0
Data Augmentation as Free Lunch: Exploring the Test-Time Augmentation for Sequential RecommendationCode1
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM0
Dynamic hysteresis model of grain-oriented ferromagnetic material using neural operators0
Attentional Graph Meta-Learning for Indoor Localization Using Extremely Sparse Fingerprints0
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing0
SDAFE: A Dual-filter Stable Diffusion Data Augmentation Method for Facial Expression Recognition0
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction0
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning0
Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning0
Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification0
Data Augmentation of Time-Series Data in Human Movement Biomechanics: A Scoping Review0
LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect0
Show:102550
← PrevPage 6 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified