SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 63516400 of 8378 papers

TitleStatusHype
Data InStance Prior (DISP) in Generative Adversarial Networks0
Understanding How Dimension Reduction Tools Work: An Empirical Approach to Deciphering t-SNE, UMAP, TriMAP, and PaCMAP for Data VisualizationCode1
Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems0
Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting SummarizationCode1
Unsupervised Pre-training for Person Re-identificationCode1
VideoMix: Rethinking Data Augmentation for Video ClassificationCode1
GenScan: A Generative Method for Populating Parametric 3D Scan Datasets0
A Pseudo-labelling Auto-Encoder for unsupervised image classification0
Dataset of Random Relaxations for Crystal Structure Search of Li-Si System0
What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical PerspectiveCode1
Enhanced Offensive Language Detection Through Data Augmentation0
Generating Synthetic Multispectral Satellite Imagery from Sentinel-20
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation0
Data-Efficient Methods for Dialogue Systems0
Kernel-convoluted Deep Neural Networks with Data AugmentationCode0
Delexicalized Paraphrase Generation0
Boosting offline handwritten text recognition in historical documents with few labeled lines0
Localization of Malaria Parasites and White Blood Cells in Thick Blood Smears0
Aerial Imagery Pixel-level SegmentationCode1
Intervention Design for Effective Sim2Real TransferCode0
Multi-Label Contrastive Learning for Abstract Visual ReasoningCode0
Learning Two-Stream CNN for Multi-Modal Age-related Macular Degeneration CategorizationCode1
FenceBox: A Platform for Defeating Adversarial Examples with Data Augmentation TechniquesCode0
Red Blood Cell Segmentation with Overlapping Cell Separation and Classification on Imbalanced DatasetCode1
Improved Contrastive Divergence Training of Energy Based ModelsCode1
How Robust are Randomized Smoothing based Defenses to Data Poisoning?0
A Self-Supervised Feature Map Augmentation (FMA) Loss and Combined Augmentations Finetuning to Efficiently Improve the Robustness of CNNs0
The ADAPT Centre’s Neural MT Systems for the WAT 2020 Document-Level Translation Task0
TMU Japanese-English Multimodal Machine Translation System for WAT 20200
Chinese Grammatical Error Correction Based on Hybrid Models with Data Augmentation0
Twitter Data Augmentation for Monitoring Public Opinion on COVID-19 Intervention Measures0
FiNLP at FinCausal 2020 Task 1: Mixture of BERTs for Causal Sentence Identification in Financial TextsCode0
PhraseOut: A Code Mixed Data Augmentation Method for MultilingualNeural Machine Tranlsation0
Arabic dialect identification: An Arabic-BERT model with data augmentation and ensembling strategy0
Parallel resources for Tunisian Arabic Dialect Translation0
SMM4H Shared Task 2020 - A Hybrid Pipeline for Identifying Prescription Drug Abuse from Twitter: Machine Learning, Deep Learning, and Post-Processing0
Medication Mention Detection in Tweets Using ELECTRA Transformers and Decision Trees0
IMSurReal Too: IMS in the Surface Realization Shared Task 2020Code0
ADAPT at SR’20: How Preprocessing and Data Augmentation Help to Improve Surface Realization0
AraBench: Benchmarking Dialectal Arabic-English Machine Translation0
Data Augmentation for Multiclass Utterance Classification -- A Systematic Study0
Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection0
Augmenting NLP models using Latent Feature Interpolations0
Towards building a Robust Industry-scale Question Answering System0
Data Selection for Bilingual Lexicon Induction from Specialized Comparable Corpora0
Scalable Cross-lingual Treebank Synthesis for Improved Production Dependency Parsers0
Data Augmentation via Subtree Swapping for Dependency Parsing of Low-Resource Languages0
HateGAN: Adversarial Generative-Based Data Augmentation for Hate Speech Detection0
Domain Transfer based Data Augmentation for Neural Query Translation0
Unifying Input and Output Smoothing in Neural Machine Translation0
Show:102550
← PrevPage 128 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified