SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 51100 of 8378 papers

TitleStatusHype
Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models0
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection0
HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration0
Self-Adapting Language Models0
DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers0
Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models0
CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal BrainCode0
ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators0
An Explainable Deep Learning Framework for Brain Stroke and Tumor Progression via MRI Interpretation0
SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research0
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
GFRIEND: Generative Few-shot Reward Inference through EfficieNt DPOCode0
SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging SegmentationCode0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective0
Data Augmentation For Small Object using Fast AutoAugment0
Learning to Hear Broken Motors: Signature-Guided Data Augmentation for Induction-Motor Diagnostics0
Spatiotemporal deep learning models for detection of rapid intensification in cyclones0
Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholdingCode0
Scaling Human Activity Recognition: A Comparative Evaluation of Synthetic Data Generation and Augmentation Techniques0
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO0
Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations0
Deep Inertial Pose: A deep learning approach for human pose estimation0
Robust sensor fusion against on-vehicle sensor staleness0
Securing Traffic Sign Recognition Systems in Autonomous Vehicles0
Geometric and Physical Constraints Synergistically Enhance Neural PDE Surrogates0
Model-based Neural Data Augmentation for sub-wavelength Radio Localization0
PixCell: A generative foundation model for digital histopathology images0
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation0
Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference ModelsCode0
LLM-based phoneme-to-grapheme for phoneme-based speech recognition0
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation0
Person Re-Identification System at Semantic Level based on Pedestrian Attributes Ontology0
Fine-Tuning Video Transformers for Word-Level Bangla Sign Language: A Comparative Analysis for Classification Tasks0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching0
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness0
Simple, Good, Fast: Self-Supervised World Models Free of BaggageCode1
MISLEADER: Defending against Model Extraction with Ensembles of Distilled ModelsCode0
How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation AlignmentCode0
OmniV2V: Versatile Video Generation and Editing via Dynamic Content ManipulationCode5
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentationCode0
3D Skeleton-Based Action Recognition: A Review0
Lightweight Convolutional Neural Networks for Retinal Disease Classification0
Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain AdaptationCode0
Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing0
Reinforcing Video Reasoning with Focused ThinkingCode1
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic SegmentationCode0
SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Show:102550
← PrevPage 2 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified