SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 68516900 of 8378 papers

TitleStatusHype
I Prefer not to Say: Protecting User Consent in Models with Optional Personal DataCode0
DocEmul: a Toolkit to Generate Structured Historical DocumentsCode0
Can Synthetic Faces Undo the Damage of Dataset Bias to Face Recognition and Facial Landmark Detection?Code0
In-Contextual Gender Bias Suppression for Large Language ModelsCode0
Harnessing Data Augmentation to Quantify Uncertainty in the Early Estimation of Single-Photon Source QualityCode0
Incipient Fault Detection in Power Distribution System: A Time-Frequency Embedded Deep Learning Based ApproachCode0
IMSurReal Too: IMS in the Surface Realization Shared Task 2020Code0
Improving the U-Net Configuration for Automated Delineation of Head and Neck Cancer on MRICode0
Adversarial Graph Contrastive Learning with Information RegularizationCode0
Improving the Training of Data-Efficient GANs via Quality Aware Dynamic Discriminator Rejection SamplingCode0
DMix: Adaptive Distance-aware Interpolative MixupCode0
Improving the Robustness of Question Answering Systems to Question ParaphrasingCode0
Improving the Robustness of Dense Retrievers Against Typos via Multi-Positive Contrastive LearningCode0
Isometric Transformations for Image Augmentation in Mueller Matrix PolarimetryCode0
DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender SystemsCode0
Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross EntropyCode0
You Only Need Half: Boosting Data Augmentation by Using Partial ContentCode0
Unsupervised hard Negative Augmentation for contrastive learningCode0
Improving Systematic Generalization Through Modularity and AugmentationCode0
DisturbLabel: Regularizing CNN on the Loss LayerCode0
ARM: Augment-REINFORCE-Merge Gradient for Stochastic Binary NetworksCode0
Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised LearningCode0
Probabilistic Spatial Transformer NetworksCode0
Iterative Counterfactual Data AugmentationCode0
Probabilistic Structural Latent Representation for Unsupervised EmbeddingCode0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
Probabilistic Structural Latent Representation for Unsupervised EmbeddingCode0
Investigating Prior Knowledge for Challenging Chinese Machine Reading ComprehensionCode0
Iterative Pseudo-Labeling for Speech RecognitionCode0
ARIEL: Adversarial Graph Contrastive LearningCode0
Improving SSVEP BCI Spellers With Data Augmentation and Language ModelsCode0
Improving Socratic Question Generation using Data Augmentation and Preference OptimizationCode0
Improving Skeleton-based Action Recognition with Interactive Object InformationCode0
Probing the Robustness of Pre-trained Language Models for Entity MatchingCode0
Distributional Data Augmentation Methods for Low Resource LanguageCode0
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard ClassificationCode0
Improving singing voice separation with the Wave-U-Net using Minimum Hyperspherical EnergyCode0
IUST at ClimateActivism 2024: Towards Optimal Stance Detection: A Systematic Study of Architectural Choices and Data Cleaning TechniquesCode0
Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression RecognitionCode0
Transfer learning and subword sampling for asymmetric-resource one-to-many neural translationCode0
Single-stage uav detection and classification with yolov5: Mosaic data augmentation and panetCode0
Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning MethodsCode0
Transfer Learning for Low-Resource Sentiment AnalysisCode0
Distinguishing rule- and exemplar-based generalization in learning systemsCode0
Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language ModelCode0
JoB-VS: Joint Brain-Vessel Segmentation in TOF-MRA ImagesCode0
Progressive Fusion for Unsupervised Binocular Depth Estimation using Cycled NetworksCode0
Improving satellite imagery segmentation using multiple Sentinel-2 revisitsCode0
The Effectiveness of Data Augmentation in Image Classification using Deep LearningCode0
Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image SegmentationCode0
Show:102550
← PrevPage 138 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified