Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4851–4900 of 8378 papers

Title	Date	Tasks	Status	Hype
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training	Feb 8, 2022	Data AugmentationSpeech Separation	CodeCode Available	1
Robust Hybrid Learning With Expert Augmentation	Feb 8, 2022	Data Augmentationvalid	CodeCode Available	1
DeepSSN: a deep convolutional neural network to assess spatial scene similarity	Feb 7, 2022	Data AugmentationInformation Retrieval	CodeCode Available	0
Field-of-View IoU for Object Detection in 360° Images	Feb 7, 2022	Data AugmentationERP	—Unverified	0
SODA: Self-organizing data augmentation in deep neural networks -- Application to biomedical image segmentation tasks	Feb 7, 2022	Data AugmentationImage Segmentation	—Unverified	0
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study	Feb 7, 2022	Data AugmentationEvent Detection	—Unverified	0
SimGRACE: A Simple Framework for Graph Contrastive Learning without Data Augmentation	Feb 7, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Multi-modal data generation with a deep metric variational autoencoder	Feb 7, 2022	Data AugmentationTriplet	—Unverified	0
Data set creation and empirical analysis for detecting signs of depression from social media postings	Feb 7, 2022	Data Augmentation	CodeCode Available	1
LiDAR dataset distillation within bayesian active learning framework: Understanding the effect of data augmentation	Feb 6, 2022	Active LearningAutonomous Driving	—Unverified	0
TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network	Feb 6, 2022	Data AugmentationDimensionality Reduction	CodeCode Available	2
Exemplar-Based Contrastive Self-Supervised Learning with Few-Shot Class Incremental Learning	Feb 5, 2022	class-incremental learningClass Incremental Learning	—Unverified	0
Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods	Feb 4, 2022	counterfactualData Augmentation	—Unverified	0
Multi-Output Gaussian Process-Based Data Augmentation for Multi-Building and Multi-Floor Indoor Localization	Feb 4, 2022	Data AugmentationIndoor Localization	—Unverified	0
Deep invariant networks with differentiable augmentation layers	Feb 4, 2022	Bilevel OptimizationData Augmentation	CodeCode Available	1
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge	Feb 4, 2022	Action DetectionActivity Detection	—Unverified	0
Supervised Contrastive Learning for Product Matching	Feb 4, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Bootstrapped Representation Learning for Skeleton-Based Action Recognition	Feb 4, 2022	Action RecognitionData Augmentation	—Unverified	0
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes	Feb 3, 2022	Data AugmentationEvent Detection	—Unverified	0
Learning Mechanically Driven Emergent Behavior with Message Passing Neural Networks	Feb 3, 2022	BIG-bench Machine LearningData Augmentation	CodeCode Available	0
The RoyalFlush System of Speech Recognition for M2MeT Challenge	Feb 3, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
NoisyMix: Boosting Model Robustness to Common Corruptions	Feb 2, 2022	Data Augmentationmodel	—Unverified	0
Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls	Feb 1, 2022	BIG-bench Machine LearningData Augmentation	—Unverified	0
Deep Learning in fNIRS: A review	Jan 31, 2022	Brain Computer InterfaceClassification	—Unverified	0
Compositionality as Lexical Symmetry	Jan 30, 2022	Data AugmentationInductive Bias	CodeCode Available	0
Improving Robustness by Enhancing Weak Subnets	Jan 30, 2022	Adversarial RobustnessData Augmentation	CodeCode Available	0
Graph Representation Learning via Aggregation Enhancement	Jan 30, 2022	Data AugmentationGraph Representation Learning	CodeCode Available	1
FedMed-ATL: Misaligned Unpaired Brain Image Synthesis via Affine Transform Loss	Jan 29, 2022	Data AugmentationImage Generation	CodeCode Available	1
Improving End-to-End Models for Set Prediction in Spoken Language Understanding	Jan 28, 2022	Data AugmentationDecoder	—Unverified	0
You Only Cut Once: Boosting Data Augmentation with a Single Cut	Jan 28, 2022	Data AugmentationDiversity	CodeCode Available	1
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation	Jan 28, 2022	Data AugmentationReinforcement Learning (RL)	—Unverified	0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition	Jan 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing	Jan 27, 2022	Data AugmentationDependency Parsing	CodeCode Available	0
Arrhythmia Classification using CGAN-augmented ECG Signals	Jan 26, 2022	Arrhythmia DetectionClassification	CodeCode Available	1
Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques	Jan 26, 2022	Data AugmentationMachine Translation	CodeCode Available	0
Recency Dropout for Recurrent Recommender Systems	Jan 26, 2022	Data AugmentationRecommendation Systems	—Unverified	0
Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images	Jan 26, 2022	Active LearningBIG-bench Machine Learning	—Unverified	0
Cardiac Disease Diagnosis on Imbalanced Electrocardiography Data Through Optimal Transport Augmentation	Jan 25, 2022	Data Augmentation	—Unverified	0
ViT-HGR: Vision Transformer-based Hand Gesture Recognition from High Density Surface EMG Signals	Jan 25, 2022	Data AugmentationGesture Recognition	CodeCode Available	1
Neural Manifold Clustering and Embedding	Jan 24, 2022	ClusteringData Augmentation	CodeCode Available	1
On-Device Learning with Cloud-Coordinated Data Augmentation for Extreme Model Personalization in Recommender Systems	Jan 24, 2022	Data AugmentationRecommendation Systems	—Unverified	0
Synthetic speech detection using meta-learning with prototypical loss	Jan 24, 2022	Data AugmentationMeta-Learning	—Unverified	0
Feature transforms for image data augmentation	Jan 24, 2022	Data Augmentationimage-classification	CodeCode Available	0
A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification	Jan 24, 2022	Data AugmentationPerson Re-Identification	—Unverified	0
VIPriors 2: Visual Inductive Priors for Data-Efficient Deep Learning Challenges	Jan 21, 2022	Data AugmentationDeep Learning	—Unverified	0
Fair Node Representation Learning via Adaptive Data Augmentation	Jan 21, 2022	Contrastive LearningData Augmentation	—Unverified	0
Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation	Jan 21, 2022	ClassificationContrastive Learning	CodeCode Available	1
Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze Estimation	Jan 20, 2022	3D Face ReconstructionData Augmentation	CodeCode Available	0
Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction	Jan 20, 2022	Data AugmentationDecoder	—Unverified	0
Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest Radiographs	Jan 18, 2022	Data Augmentation	CodeCode Available	0

Show:10 25 50

← PrevPage 98 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified