SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 19512000 of 8378 papers

TitleStatusHype
End-to-End Action Segmentation Transformer0
An Empirical Study of Causal Relation Extraction Transfer: Design and Data0
Removing Geometric Bias in One-Class Anomaly Detection with Adaptive Feature PerturbationCode0
Disconnect to Connect: A Data Augmentation Method for Improving Topology Accuracy in Image SegmentationCode0
From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning0
SAS: Segment Anything Small for Ultrasound -- A Non-Generative Data Augmentation Technique for Robust Deep Learning in Ultrasound Imaging0
Image Data Augmentation for the TAIGA-IACT Experiment with Conditional Generative Adversarial Networks0
HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models0
Using Test-Time Data Augmentation for Cross-Domain Atrial Fibrillation Detection from ECG Signals0
Augmentation-Based Deep Learning for Identification of Circulating Tumor Cells0
Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution0
Effective LLM Knowledge Learning via Model Generalization0
DeepGrav: Anomalous Gravitational-Wave Detection Through Deep Latent FeaturesCode0
Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching0
Synthetic Data Augmentation for Enhancing Harmful Algal Bloom Detection with Machine LearningCode0
Automatic Drywall Analysis for Progress Tracking and Quality Control in Construction0
TopoMortar: A dataset to evaluate image segmentation methods focused on topology accuracyCode0
Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?0
Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection0
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions0
BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modeling0
Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic SegmentationCode0
A Generalized Theory of Mixup for Structure-Preserving Synthetic DataCode0
Data Augmentation for NeRFs in the Low Data Limit0
Random Walks in Self-supervised Learning for Triangular Meshes0
AI-Augmented Thyroid Scintigraphy for Robust Classification0
A data augmentation strategy for deep neural networks with application to epidemic modelling0
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition0
Style Content Decomposition-based Data Augmentation for Domain Generalizable Medical Image SegmentationCode0
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity RepresentationCode0
Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model0
Learning with Exact Invariances in Polynomial Time0
cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning0
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data AugmentationCode0
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation0
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data AugmentationCode0
Graph Augmentation for Cross Graph Domain Generalization0
Iterative Counterfactual Data AugmentationCode0
Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models0
MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification0
Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training0
Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking0
Contrastive Visual Data Augmentation0
GCC: Generative Color Constancy via Diffusing a Color Checker0
On Neural Inertial Classification Networks for Pedestrian Activity Recognition0
SQLong: Enhanced NL2SQL for Longer Contexts with LLMs0
USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images0
Physics-Informed Gradient Estimation for Accelerating Deep Learning based AC-OPF0
Patch Stitching Data Augmentation for Cancer Classification in Pathology Images0
Enhancing LLMs for Identifying and Prioritizing Important Medical Jargons from Electronic Health Record Notes Utilizing Data Augmentation0
Show:102550
← PrevPage 40 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified