SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 401425 of 8378 papers

TitleStatusHype
SeiT++: Masked Token Modeling Improves Storage-efficient TrainingCode1
Fusion of Audio and Visual Embeddings for Sound Event Localization and DetectionCode1
Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMixCode1
Toward Robustness in Multi-label Classification: A Data Augmentation Strategy against Imbalance and NoiseCode1
Progressive Multi-Modality Learning for Inverse Protein FoldingCode1
SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data AugmentationCode1
D3A-TS: Denoising-Driven Data Augmentation in Time SeriesCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation PerspectiveCode1
Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series ClassificationCode1
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic SegmentationCode1
GeNIe: Generative Hard Negative Images Through DiffusionCode1
Steerers: A framework for rotation equivariant keypoint descriptorsCode1
Toward Improving Robustness of Object Detectors Against Domain ShiftCode1
Dataset Distillation via Curriculum Data Synthesis in Large Data EraCode1
A Simple Recipe for Language-guided Domain Generalized SegmentationCode1
Alternate Diverse Teaching for Semi-supervised Medical Image SegmentationCode1
CLAP: Isolating Content from Style through Contrastive Learning with Augmented PromptsCode1
OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language modelsCode1
Unified Domain Adaptive Semantic SegmentationCode1
Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving TrendCode1
Immunohistochemistry guided segmentation of benign epithelial cells, in situ lesions, and invasive epithelial cells in breast cancer slidesCode1
Generating Progressive Images from Pathological Transitions via Diffusion ModelCode1
Adapting pretrained speech model for Mandarin lyrics transcription and alignmentCode1
NNG-Mix: Improving Semi-supervised Anomaly Detection with Pseudo-anomaly GenerationCode1
Show:102550
← PrevPage 17 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified