SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 22512300 of 8378 papers

TitleStatusHype
Combining Weakly Supervised ML Techniques for Low-Resource NLU0
A Spatio-Temporal Neural Network Forecasting Approach for Emulation of Firefront Models0
AFSC: Adaptive Fourier Space Compression for Anomaly Detection0
Combining Transformer Generators with Convolutional Discriminators0
Combining Pyramid Pooling and Attention Mechanism for Pelvic MR Image Semantic Segmentaion0
A Span-based Model for Extracting Overlapping PICO Entities from RCT Publications0
Combining Noise-to-Image and Image-to-Image GANs: Brain MR Image Augmentation for Tumor Detection0
Combining Multi-Sequence and Synthetic Images for Improved Segmentation of Late Gadolinium Enhancement Cardiac MRI0
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection0
A breakthrough in Speech emotion recognition using Deep Retinal Convolution Neural Networks0
Domain-Agnostic Clustering with Self-Distillation0
Combining Image Features and Patient Metadata to Enhance Transfer Learning0
A Smartphone-Based Skin Disease Classification Using MobileNet CNN0
Combining High-Level Features of Raw Audio Waves and Mel-Spectrograms for Audio Tagging0
Combining Euclidean Alignment and Data Augmentation for BCI decoding0
A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets0
AfriNames: Most ASR models "butcher" African Names0
Combining Ensembles and Data Augmentation can Harm your Calibration0
Ask-n-Learn: Active Learning via Reliable Gradient Representations for Image Classification0
A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data0
Combination of multiple neural networks using transfer learning and extensive geometric data augmentation for assessing cellularity scores in histopathology images0
PixCell: A generative foundation model for digital histopathology images0
Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media0
Combating COVID-19 using Generative Adversarial Networks and Artificial Intelligence for Medical Images: A Scoping Review0
Color Variants Identification in Fashion e-commerce via Contrastive Self-Supervised Representation Learning0
ColorUNet: A convolutional classification approach to colorization0
A Simplified Framework for Contrastive Learning for Node Representations0
A Framework for Supervised and Unsupervised Segmentation and Classification of Materials Microstructure Images0
Supervised Graph Contrastive Learning for Few-shot Node Classification0
Adapting Multilingual Models for Code-Mixed Translation using Back-to-Back Translation0
ColMix -- A Simple Data Augmentation Framework to Improve Object Detector Performance and Robustness in Aerial Images0
A Simple Strategy to Provable Invariance via Orbit Mapping0
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation0
Does Synthetic Data Make Large Language Models More Efficient?0
Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?0
Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration0
Domain Disentanglement with Interpolative Data Augmentation for Dual-Target Cross-Domain Recommendation0
Domain Generalized Recaptured Screen Image Identification Using SWIN Transformer0
IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation0
Does Data Augmentation Lead to Positive Margin?0
Cold Start Streaming Learning for Deep Networks0
Does enhanced shape bias improve neural network robustness to common corruptions?0
Cognitive Biases in Large Language Models for News Recommendation0
CoDo: Contrastive Learning with Downstream Background Invariance for Detection0
A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI0
Code-Switching without Switching: Language Agnostic End-to-End Speech Translation0
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition0
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes0
Does equivariance matter at scale?0
A Fourier Perspective on Model Robustness in Computer Vision0
Show:102550
← PrevPage 46 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified