SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 251300 of 8378 papers

TitleStatusHype
Robust image representations with counterfactual contrastive learningCode1
From FDG to PSMA: A Hitchhiker's Guide to Multitracer, Multicenter Lesion Segmentation in PET/CT ImagingCode1
Effective Pre-Training of Audio Transformers for Sound Event DetectionCode1
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map ConstructionCode1
Data Augmentation via Latent Diffusion for Saliency PredictionCode1
EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image ClassificationCode1
Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image SegmentationCode1
OCMG-Net: Neural Oriented Normal Refinement for Unstructured Point CloudsCode1
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce ClassificationCode1
Legilimens: Practical and Unified Content Moderation for Large Language Model ServicesCode1
GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small DatasetsCode1
DIAGen: Diverse Image Augmentation with Generative ModelsCode1
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language ModelsCode1
SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised PretrainingCode1
Wave-Mask/Mix: Exploring Wavelet-Based Augmentations for Time Series ForecastingCode1
TDNetGen: Empowering Complex Network Resilience Prediction with Generative Augmentation of Topology and DynamicsCode1
Generative Dataset Distillation Based on Diffusion ModelCode1
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text GuidanceCode1
VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMsCode1
Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram SynthesisCode1
HINER: Neural Representation for Hyperspectral ImageCode1
Leveraging Foundation Models for Zero-Shot IoT SensingCode1
SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception NetworksCode1
Self-Supervision Improves Diffusion Models for Tabular Data ImputationCode1
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language ModelsCode1
Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action RecognitionCode1
Multi-Modal and Multi-Attribute Generation of Single Cells with CFGenCode1
SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 ChallengeCode1
Augmented Neural Fine-Tuning for Efficient Backdoor PurificationCode1
CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing ModalitiesCode1
SUMix: Mixup with Semantic and Uncertain InformationCode1
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and SynthesisCode1
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-InstructCode1
On the power of data augmentation for head pose estimationCode1
Fine-Grained and Interpretable Neural Speech EditingCode1
Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image GenerationCode1
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect DetectionCode1
LLMAEL: Large Language Models are Good Context Augmenters for Entity LinkingCode1
HRSAM: Efficient Interactive Segmentation in High-Resolution ImagesCode1
MedMNIST-C: Comprehensive benchmark and improved classifier robustness by simulating realistic image corruptionsCode1
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language ModelsCode1
RuleR: Improving LLM Controllability by Rule-based Data RecyclingCode1
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding BoxesCode1
Voice Disorder Analysis: a Transformer-based ApproachCode1
Advancing Fine-Grained Classification by Structure and Subject Preserving AugmentationCode1
Zero-Shot Image Denoising for High-Resolution Electron MicroscopyCode1
QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQLCode1
DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS CameraCode1
Dataset Enhancement with Instance-Level AugmentationsCode1
MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword SpottingCode1
Show:102550
← PrevPage 6 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified