Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5451–5500 of 8378 papers

Title	Date	Tasks	Status	Hype
SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks	Sep 9, 2021	Data AugmentationSelf-Supervised Learning	—Unverified	0
Table-based Fact Verification with Salience-aware Learning	Sep 9, 2021	counterfactualData Augmentation	CodeCode Available	0
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints	Sep 9, 2021	Data AugmentationDecoder	—Unverified	0
ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding	Sep 9, 2021	Contrastive LearningData Augmentation	CodeCode Available	1
Learning with Different Amounts of Annotation: From Zero to Many Labels	Sep 9, 2021	Data AugmentationEntity Typing	CodeCode Available	0
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data	Sep 9, 2021	Data AugmentationSemantic Parsing	—Unverified	0
Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation	Sep 8, 2021	AMR-to-Text GenerationData Augmentation	CodeCode Available	0
It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books	Sep 8, 2021	Answer GenerationData Augmentation	CodeCode Available	1
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach	Sep 8, 2021	Data AugmentationDecoder	CodeCode Available	0
Generatively Augmented Neural Network Watchdog for Image Classification Networks	Sep 7, 2021	ClassificationData Augmentation	—Unverified	0
GANSER: A Self-supervised Data Augmentation Framework for EEG-based Emotion Recognition	Sep 7, 2021	Data AugmentationEEG	—Unverified	0
CRNNTL: convolutional recurrent neural network and transfer learning for QSAR modelling	Sep 7, 2021	Data AugmentationTransfer Learning	—Unverified	0
Self-supervised Tumor Segmentation through Layer Decomposition	Sep 7, 2021	Brain Tumor SegmentationData Augmentation	—Unverified	0
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation	Sep 7, 2021	Data Augmentation	CodeCode Available	1
Evaluation of Convolutional Neural Networks for COVID-19 Classification on Chest X-Rays	Sep 6, 2021	Data AugmentationSpecificity	—Unverified	0
Sensor Data Augmentation by Resampling for Contrastive Learning in Human Activity Recognition	Sep 5, 2021	Activity RecognitionContrastive Learning	—Unverified	0
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis	Sep 5, 2021	Data AugmentationSurvey	—Unverified	0
Robust Mitosis Detection Using a Cascade Mask-RCNN Approach With Domain-Specific Residual Cycle-GAN Data Augmentation	Sep 4, 2021	Data AugmentationMitosis Detection	—Unverified	0
Self-Supervised Detection of Contextual Synonyms in a Multi-Class Setting: Phenotype Annotation Use Case	Sep 4, 2021	Data AugmentationWord Embeddings	—Unverified	0
Data Augmentation for Cross-Domain Named Entity Recognition	Sep 4, 2021	Cross-Domain Named Entity RecognitionData Augmentation	CodeCode Available	1
Self-supervised Pseudo Multi-class Pre-training for Unsupervised Anomaly Detection and Segmentation in Medical Images	Sep 3, 2021	Anomaly DetectionContrastive Learning	CodeCode Available	1
Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding	Sep 3, 2021	Data AugmentationDenoising	—Unverified	0
MitoDet: Simple and robust mitosis detection	Sep 2, 2021	Data AugmentationDomain Generalization	—Unverified	0
Generative Models for Multi-Illumination Color Constancy	Sep 2, 2021	Color ConstancyData Augmentation	—Unverified	0
Transformer Networks for Data Augmentation of Human Physical Activity Recognition	Sep 2, 2021	Activity RecognitionData Augmentation	CodeCode Available	1
Rotation Invariance and Extensive Data Augmentation: a strategy for the Mitosis Domain Generalization (MIDOG) Challenge	Sep 2, 2021	Data AugmentationDomain Generalization	—Unverified	0
Application of Mix-Up Method in Document Classification Task Using BERT	Sep 1, 2021	ClassificationData Augmentation	—Unverified	0
Application of Deep Learning Methods to SNOMED CT Encoding of Clinical Texts: From Data Collection to Extreme Multi-Label Text-Based Classification	Sep 1, 2021	ClassificationData Augmentation	—Unverified	0
Solving SCAN Tasks with Data Augmentation and Input Embeddings	Sep 1, 2021	Data Augmentation	CodeCode Available	0
Precog-LTRC-IIITH at GermEval 2021: Ensembling Pre-Trained Language Models with Feature Engineering	Sep 1, 2021	Data AugmentationFeature Engineering	CodeCode Available	0
DFKI SLT at GermEval 2021: Multilingual Pre-training and Data Augmentation for the Classification of Toxicity in Social Media Comments	Sep 1, 2021	Data Augmentation	CodeCode Available	0
Multi-Sample based Contrastive Loss for Top-k Recommendation	Sep 1, 2021	Contrastive LearningData Augmentation	CodeCode Available	1
Domain Adaptive Cascade R-CNN for MItosis DOmain Generalization (MIDOG) Challenge	Sep 1, 2021	Data AugmentationDomain Generalization	—Unverified	0
Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds	Sep 1, 2021	3D Object Detection3D Point Cloud Classification	CodeCode Available	1
Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification	Sep 1, 2021	Bayesian OptimizationClassification	CodeCode Available	1
Maximum F1-score training for end-to-end mispronunciation detection and diagnosis of L2 English speech	Aug 31, 2021	Data Augmentation	—Unverified	0
Using convolutional neural networks for the classification of breast cancer images	Aug 31, 2021	Data AugmentationTransfer Learning	CodeCode Available	0
MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER	Aug 31, 2021	Cross-Lingual NERData Augmentation	CodeCode Available	1
Cross-Lingual Text Classification of Transliterated Hindi and Malayalam	Aug 31, 2021	BenchmarkingClassification	CodeCode Available	0
Detecting Mitosis against Domain Shift using a Fused Detector and Deep Ensemble Classification Model for MIDOG Challenge	Aug 31, 2021	Data AugmentationPrognosis	—Unverified	0
ScatSimCLR: self-supervised contrastive learning with pretext task regularization for small-scale datasets	Aug 31, 2021	ClassificationContrastive Learning	CodeCode Available	1
Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization	Aug 30, 2021	BenchmarkingData Augmentation	—Unverified	0
AEDA: An Easier Data Augmentation Technique for Text Classification	Aug 30, 2021	ClassificationData Augmentation	CodeCode Available	1
Open Set RF Fingerprinting using Generative Outlier Augmentation	Aug 30, 2021	ClassificationData Augmentation	—Unverified	0
InSE-NET: A Perceptually Coded Audio Quality Model based on CNN	Aug 30, 2021	Audio Quality AssessmentData Augmentation	—Unverified	0
3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations	Aug 30, 2021	3D ReconstructionData Augmentation	—Unverified	0
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding	Aug 30, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Towards Fine-grained Image Classification with Generative Adversarial Networks and Facial Landmark Detection	Aug 28, 2021	ClassificationData Augmentation	CodeCode Available	1
High performing ensemble of convolutional neural networks for insect pest image detection	Aug 28, 2021	Data Augmentation	—Unverified	0
Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with Self-Supervised Depth Estimation	Aug 28, 2021	Data AugmentationDepth Estimation	CodeCode Available	1

Show:10 25 50

← PrevPage 110 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified