SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 18011850 of 8378 papers

TitleStatusHype
A Time-Series Data Augmentation Model through Diffusion and Transformer Integration0
The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)0
Data-Driven Optical To Thermal Inference in Pool Boiling Using Generative Adversarial Networks0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training0
Light Weight CNN for classification of Brain Tumors from MRI Images0
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard ClassificationCode0
RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library0
DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models0
Dual Attention Driven Lumbar Magnetic Resonance Image Feature Enhancement and Automatic Diagnosis of Herniation0
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets0
ResearchCodeAgent: An LLM Multi-Agent System for Automated Codification of Research Methodologies0
ProFi-Net: Prototype-based Feature Attention with Curriculum Augmentation for WiFi-based Gesture Recognition0
Improving Generalization in MRI-Based Deep Learning Models for Total Knee Replacement Prediction0
SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning0
MediAug: Exploring Visual Augmentation in Medical ImagingCode0
Outlier-aware Tensor Robust Principal Component Analysis with Self-guided Data Augmentation0
Generative AI for Physical-Layer Authentication0
CKMDiff: A Generative Diffusion Model for CKM Construction via Inverse Problems with Learned Priors0
Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images0
DAPLSR: Data Augmentation Partial Least Squares Regression Model via Manifold Optimization0
Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection0
VLM-based Prompts as the Optimal Assistant for Unpaired Histopathology Virtual StainingCode0
Few-shot Hate Speech Detection Based on the MindSpore Framework0
From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System0
Diffusion Bridge Models for 3D Medical Image Translation0
From Dialect Gaps to Identity Maps: Tackling Variability in Speaker Verification0
Enhancing DR Classification with Swin Transformer and Shifted Window Attention0
Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training0
Beyond One-Hot Labels: Semantic Mixing for Model CalibrationCode0
Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled DataCode0
LIFT+: Lightweight Fine-Tuning for Long-Tail LearningCode0
MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection0
ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models0
Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration0
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios0
CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors0
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification0
Decoupled Diffusion Sparks Adaptive Scene Generation0
Improving In-Context Learning with Reasoning DistillationCode0
Data Augmentation Through Random Style Replacement0
Towards contrast- and pathology-agnostic clinical fetal brain MRI segmentation using SynthSeg0
Dual-Path Enhancements in Event-Based Eye Tracking: Augmented Robustness and Adaptive Temporal Modeling0
Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark0
Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data AugmentationCode0
Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference UnderstandingCode0
seg2med: a bridge from artificial anatomy to multimodal medical images0
Diffusion Models for Robotic Manipulation: A Survey0
MedRep: Medical Concept Representation for General Electronic Health Record Foundation ModelsCode0
Show:102550
← PrevPage 37 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified