SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 14011450 of 8378 papers

TitleStatusHype
PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose EstimationCode1
Pose-disentangled Contrastive Learning for Self-supervised Facial RepresentationCode1
NCAGC: A Neighborhood Contrast Framework for Attributed Graph ClusteringCode1
Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive LearningCode1
Dual Contrastive Learning: Text Classification via Label-Aware Data AugmentationCode1
PRIME: A few primitives can boost robustness to common corruptionsCode1
ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentationCode1
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language ModelsCode1
AutoDC: Automated data-centric processingCode1
EEG-Inception: An Accurate and Robust End-to-End Neural Network for EEG-based Motor Imagery ClassificationCode1
Acoustic echo cancellation with the dual-signal transformation LSTM networkCode1
Easter2.0: Improving convolutional models for handwritten text recognitionCode1
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANsCode1
PromptMix: A Class Boundary Augmentation Method for Large Language Model DistillationCode1
IDA: Improved Data Augmentation Applied to Salient Object DetectionCode1
EEG-GAN: Generative adversarial networks for electroencephalograhic (EEG) brain signalsCode1
ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous Knowledge Resources for Commonsense Validation and ExplanationCode1
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from PixelsCode1
Hyperspectral Image Super-Resolution with Spectral Mixup and Heterogeneous DatasetsCode1
Overcoming challenges in leveraging GANs for few-shot data augmentationCode1
A Probabilistic Framework for Knowledge Graph Data AugmentationCode1
HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular DatasetsCode1
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile glovesCode1
AADG: Automatic Augmentation for Domain Generalization on Retinal Image SegmentationCode1
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation PerspectiveCode1
EEG Synthetic Data Generation Using Probabilistic Diffusion ModelsCode1
AutoCLINT: The Winning Method in AutoCV Challenge 2019Code1
AutoBalance: Optimized Loss Functions for Imbalanced DataCode1
Effective Pre-Training of Audio Transformers for Sound Event DetectionCode1
RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic ObservationsCode1
HybridAugment++: Unified Frequency Spectra Perturbations for Model RobustnessCode1
CarveMix: A Simple Data Augmentation Method for Brain Lesion SegmentationCode1
HypMix: Hyperbolic Interpolative Data AugmentationCode1
Raindrops on Windshield: Dataset and Lightweight Gradient-Based Detection AlgorithmCode1
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge DistillationCode1
Cascaded deep monocular 3D human pose estimation with evolutionary training dataCode1
EfficientDeRain: Learning Pixel-wise Dilation Filtering for High-Efficiency Single-Image DerainingCode1
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum LearningCode1
ChimeraMix: Image Classification on Small Datasets via Masked Feature MixingCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
AAPL: Adding Attributes to Prompt Learning for Vision-Language ModelsCode1
End-to-end lyrics Recognition with Voice to Singing Style TransferCode1
How to trust unlabeled data? Instance Credibility Inference for Few-Shot LearningCode1
Efficient Model for Image Classification With Regularization TricksCode1
HRSAM: Efficient Interactive Segmentation in High-Resolution ImagesCode1
CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color ConstancyCode1
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
Causal Action Influence Aware Counterfactual Data AugmentationCode1
Entailment as Few-Shot LearnerCode1
CCGL: Contrastive Cascade Graph LearningCode1
Show:102550
← PrevPage 29 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified