SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 401450 of 8378 papers

TitleStatusHype
Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching0
DeepGrav: Anomalous Gravitational-Wave Detection Through Deep Latent FeaturesCode0
Synthetic Data Augmentation for Enhancing Harmful Algal Bloom Detection with Machine LearningCode0
Augmentation-Based Deep Learning for Identification of Circulating Tumor Cells0
TopoMortar: A dataset to evaluate image segmentation methods focused on topology accuracyCode0
Effective LLM Knowledge Learning via Model Generalization0
Automatic Drywall Analysis for Progress Tracking and Quality Control in Construction0
Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution0
Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic SegmentationCode0
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions0
Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?0
Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection0
BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modeling0
A Generalized Theory of Mixup for Structure-Preserving Synthetic DataCode0
Data Augmentation for NeRFs in the Low Data Limit0
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
Random Walks in Self-supervised Learning for Triangular Meshes0
AI-Augmented Thyroid Scintigraphy for Robust Classification0
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity RepresentationCode0
MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition0
A data augmentation strategy for deep neural networks with application to epidemic modelling0
Style Content Decomposition-based Data Augmentation for Domain Generalizable Medical Image SegmentationCode0
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data AugmentationCode0
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation0
Learning with Exact Invariances in Polynomial Time0
cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning0
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic EnvironmentsCode1
Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model0
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data AugmentationCode0
Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking0
Graph Augmentation for Cross Graph Domain Generalization0
Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models0
Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training0
Multi-Perspective Data Augmentation for Few-shot Object DetectionCode1
MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification0
Iterative Counterfactual Data AugmentationCode0
Contrastive Visual Data Augmentation0
GCC: Generative Color Constancy via Diffusing a Color Checker0
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
On Neural Inertial Classification Networks for Pedestrian Activity Recognition0
SQLong: Enhanced NL2SQL for Longer Contexts with LLMs0
Patch Stitching Data Augmentation for Cancer Classification in Pathology Images0
Single Domain Generalization with Model-aware Parametric Batch-wise Mixup0
Physics-Informed Gradient Estimation for Accelerating Deep Learning based AC-OPF0
USegMix: Unsupervised Segment Mix for Efficient Data Augmentation in Pathology Images0
Enhancing LLMs for Identifying and Prioritizing Important Medical Jargons from Electronic Health Record Notes Utilizing Data Augmentation0
SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-trainingCode1
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear InterpolationCode0
Inverse Surrogate Model of a Soft X-Ray Spectrometer using Domain Adaptation0
Steganographic Embeddings as an Effective Data AugmentationCode0
Show:102550
← PrevPage 9 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified