SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 69517000 of 8378 papers

TitleStatusHype
The Penalty Imposed by Ablated Data Augmentation0
Learning Diagnosis of COVID-19 from a Single Radiological ImageCode0
Enhancing Facial Data Diversity with Style-based Face Aging0
An Empirical Analysis of the Impact of Data Augmentation on Knowledge Distillation0
Data Augmentation using Generative Adversarial Networks (GANs) for GAN-based Detection of Pneumonia and COVID-19 in Chest X-ray Images0
Learning Neural Light Transport0
Data Augmentation for Enhancing EEG-based Emotion Recognition with Deep Generative Models0
Handling missing data in model-based clustering0
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT20200
Self-supervised Training of Graph Convolutional NetworksCode1
SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better RegularizationCode1
SmoothMix: A Simple Yet Effective Data Augmentation to Train Robust Classifiers0
Augmenting DL with Adversarial Training for Robust Prediction of Epilepsy SeizuresCode1
A U-Net Based Discriminator for Generative Adversarial Networks0
Probabilistic Structural Latent Representation for Unsupervised EmbeddingCode0
Composing Good Shots by Exploiting Mutual RelationsCode1
Learning Augmentation Network via Influence Functions0
Bayesian Adversarial Human Motion SynthesisCode1
A Comprehensive Study of Data Augmentation Strategies for Prostate Cancer Detection in Diffusion-weighted MRI using Convolutional Neural Networks0
Symbol Spotting on Digital Architectural Floor Plans Using a Deep Learning-based Framework0
Automatic classification between COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy on chest X-ray image: combination of data augmentation methodsCode0
Pseudo-Representation Labeling Semi-Supervised Learning0
Data Augmentation with Unsupervised Machine Translation Improves the Structural Similarity of Cross-lingual Word Embeddings0
A Comparative Study of Lexical Substitution Approaches based on Neural Language Models0
Synthesizing lesions using contextual GANs improves breast cancer classification on mammograms0
ACGAN-based Data Augmentation Integrated with Long-term Scalogram for Acoustic Scene Classification0
Generative Adversarial Networks for Bitcoin Data Augmentation0
Learning Robust Feature Representations for Scene Text Detection0
Towards a Robust WiFi-based Fall Detection with Adversarial Data Augmentation0
A Joint Pixel and Feature Alignment Framework for Cross-dataset Palmprint RecognitionCode1
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 20200
Networks with pixels embedding: a method to improve noise resistance in images classificationCode0
DeltaPy: A Framework for Tabular Data Augmentation in PythonCode1
Microphone Array Based Surveillance Audio Classification0
Graph Random Neural Network for Semi-Supervised Learning on GraphsCode1
Multistream CNN for Robust Acoustic Modeling0
Fluent Response Generation for Conversational Question AnsweringCode1
Training Keyword Spotting Models on Non-IID Data with Federated Learning0
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition0
Cross-Domain Few-Shot Learning with Meta Fine-Tuning0
What Makes for Good Views for Contrastive Learning?0
Lung Segmentation from Chest X-rays using Variational Data ImputationCode1
AutoML Segmentation for 3D Medical Image Data: Contribution to the MSD Challenge 2018Code1
Improving Deep Learning Models via Constraint-Based Domain Knowledge: a Brief Survey0
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets0
Iterative Pseudo-Labeling for Speech RecognitionCode0
The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge0
Feature Transformation Ensemble Model with Batch Spectral Regularization for Cross-Domain Few-Shot Classification0
On the effectiveness of GAN generated cardiac MRIs for segmentation0
Throwing Darts in the Dark? Detecting Bots with Limited Data using Neural Data Augmentation0
Show:102550
← PrevPage 140 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified