SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 55015550 of 8378 papers

TitleStatusHype
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training DebiasingCode1
StackMix and Blot Augmentations for Handwritten Text RecognitionCode1
TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured ScenariosCode1
When Do Contrastive Learning Signals Help Spatio-Temporal Graph Forecasting?Code1
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning0
ChessMix: Spatial Context Data Augmentation for Remote Sensing Semantic SegmentationCode0
Data Augmentation for Low-Resource Named Entity Recognition Using BacktranslationCode0
StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures0
OOWL500: Overcoming Dataset Collection Bias in the Wild0
Contrastive Learning of User Behavior Sequence for Context-Aware Document RankingCode1
Self-Supervised Graph Co-Training for Session-based RecommendationCode1
Jointly Learnable Data Augmentations for Self-Supervised GNNsCode1
Sarcasm Detection in Twitter -- Performance Impact while using Data Augmentation: Word EmbeddingsCode0
Influence-guided Data Augmentation for Neural Tensor CompletionCode0
Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches0
A Unified Transformer-based Framework for Duplex Text Normalization0
DTWSSE: Data Augmentation with a Siamese Encoder for Time Series0
Data Augmentation Using Many-To-Many RNNs for Session-Aware Recommender SystemsCode0
SemiFed: Semi-supervised Federated Learning with Consistency and Pseudo-Labeling0
Mitigating Greenhouse Gas Emissions Through Generative Adversarial Networks Based Wildfire Prediction0
Exploring Data Aggregation and Transformations to Generalize across Visual DomainsCode1
Segmentation of Lungs COVID Infected Regions by Attention Mechanism and Synthetic Data0
Neural TMDlayer: Modeling Instantaneous flow of features via SDE GeneratorsCode0
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency DomainCode1
Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image CaptioningCode0
Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained ModelsCode0
Directing the violence or admonishing it? A survey of contronymy and androcentrism in Google Translate and some recommendationsCode0
Practical X-ray Gastric Cancer Diagnostic Support Using Refined Stochastic Data Augmentation and Hard Boundary Box TrainingCode0
Scarce Data Driven Deep Learning of Drones via Generalized Data Distribution Space0
Semantic Perturbations with Normalizing Flows for Improved GeneralizationCode1
Tailor: Generating and Perturbing Text with Semantic Controls0
An Empirical Survey of Data Augmentation \ Limited Data Learning in NLP0
Adapting Multilingual Models for Code-Mixed Translation using Back-to-Back Translation0
A Comparison of Strategies for Source-Free Domain Adaptation0
Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNsCode1
KCNet: An Insect-Inspired Single-Hidden-Layer Neural Network with Randomized Binary Weights for Prediction and Classification TasksCode0
Data Efficient Human Intention Prediction: Leveraging Neural Network Verification and Expert Guidance0
Data Augmentation for Scene Text RecognitionCode1
Data Augmentation and CNN Classification For Automatic COVID-19 Diagnosis From CT-Scan Images On Small Dataset0
CarveMix: A Simple Data Augmentation Method for Brain Lesion SegmentationCode1
SSH: A Self-Supervised Framework for Image HarmonizationCode1
ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D Object Detection0
FlipDA: Effective and Robust Data Augmentation for Few-Shot LearningCode1
SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation0
Cascade Bagging for Accuracy Prediction with Few Training SamplesCode0
Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)Code1
Log-Spectral Matching GAN: PPG-based Atrial Fibrillation Detection can be Enhanced by GAN-based Data Augmentation with Integration of Spectral LossCode0
An empirical investigation into audio pipeline approaches for classifying bird species0
InfoGAN-MSF: a data augmentation approach for correlative bridge monitoring factors0
Towards artificially intelligent recycling Improving image processing for waste classification0
Show:102550
← PrevPage 111 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified