Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4901–4950 of 8378 papers

Title	Date	Tasks	Status	Hype
Syntax-based data augmentation for Hungarian-English machine translation	Jan 18, 2022	Data AugmentationMachine Translation	CodeCode Available	0
Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation	Jan 18, 2022	Data AugmentationDecoder	—Unverified	0
Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning	Jan 18, 2022	Data AugmentationDeep Reinforcement Learning	—Unverified	0
MODALS: Data augmentation that works for everyone	Jan 17, 2022	Data Augmentation	—Unverified	0
AugLy: Data Augmentations for Robustness	Jan 17, 2022	Adversarial RobustnessData Augmentation	CodeCode Available	5
Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies	Jan 16, 2022	Data AugmentationDiversity	—Unverified	0
Improving Robustness in Multilingual Machine Translation via Data Augmentation	Jan 16, 2022	Data AugmentationMachine Translation	—Unverified	0
SUBS: Subtree Substitution for Compositional Semantic Parsing	Jan 16, 2022	Data AugmentationSemantic Parsing	—Unverified	0
Data Augmentation for Low-Resource Dialogue Summarization	Jan 16, 2022	Data AugmentationMeeting Summarization	—Unverified	0
Sentence-Level Resampling for Named Entity Recognition	Jan 16, 2022	Data Augmentationnamed-entity-recognition	—Unverified	0
Label-guided Data Augmentation for Prompt-based Few Shot Learners	Jan 16, 2022	Data AugmentationFew-Shot Learning	—Unverified	0
Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices	Jan 16, 2022	Answer GenerationData Augmentation	—Unverified	0
Improving Data Augmentation in Low-resource Question Answering with Active Learning in Multiple Stages	Jan 16, 2022	Active LearningAnswer Generation	—Unverified	0
Enhancing Robustness in Aspect-based Sentiment Analysis by Better Exploiting Data Augmentation	Jan 16, 2022	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	—Unverified	0
Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework	Jan 16, 2022	Data Augmentationde-en	—Unverified	0
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning	Jan 16, 2022	Data AugmentationGraph Attention	—Unverified	0
Data Augmentation for Biomedical Factoid Question Answering	Jan 16, 2022	Data AugmentationInformation Retrieval	—Unverified	0
DG2: Data Augmentation Through Document Grounded Dialogue Generation	Jan 16, 2022	Data AugmentationDialogue Generation	—Unverified	0
Improving negation detection with negation-focused pre-training	Jan 16, 2022	Data AugmentationDiversity	—Unverified	0
SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples	Jan 16, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Recent Progress in the CUHK Dysarthric Speech Recognition System	Jan 15, 2022	Audio-Visual Speech RecognitionAutomatic Speech Recognition	—Unverified	0
Time Series Generation with Masked Autoencoder	Jan 14, 2022	Data AugmentationDecoder	CodeCode Available	1
ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization	Jan 14, 2022	Abstractive Text SummarizationData Augmentation	—Unverified	0
Investigation of Data Augmentation Techniques for Disordered Speech Recognition	Jan 14, 2022	Data Augmentationspeech-recognition	—Unverified	0
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition	Jan 14, 2022	Data Augmentationspeech-recognition	—Unverified	0
Making a (Counterfactual) Difference One Rationale at a Time	Jan 13, 2022	counterfactualData Augmentation	CodeCode Available	0
Multi-task Pre-training Language Model for Semantic Network Completion	Jan 13, 2022	Contrastive LearningData Augmentation	CodeCode Available	0
On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles	Jan 13, 2022	Adversarial AttackAdversarial Robustness	CodeCode Available	1
VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting	Jan 13, 2022	3D-Aware Image SynthesisData Augmentation	—Unverified	0
Data augmentation through multivariate scenario forecasting in Data Centers using Generative Adversarial Networks	Jan 12, 2022	Data AugmentationTime Series	CodeCode Available	0
Motion-Focused Contrastive Learning of Video Representations	Jan 11, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Learning Fair Node Representations with Graph Counterfactual Fairness	Jan 10, 2022	Attributecounterfactual	CodeCode Available	1
MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance Images	Jan 10, 2022	Data AugmentationManagement	—Unverified	0
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks	Jan 10, 2022	Data Augmentationimage-classification	—Unverified	0
Model-Based Image Signal Processors via Learnable Dictionaries	Jan 10, 2022	Color ConstancyData Augmentation	—Unverified	0
Iterative training of robust k-space interpolation networks for improved image reconstruction with limited scan specific training samples	Jan 10, 2022	Data AugmentationImage Reconstruction	—Unverified	0
A study on cross-corpus speech emotion recognition and data augmentation	Jan 10, 2022	Cross-corpusData Augmentation	—Unverified	0
Invariance encoding in sliced-Wasserstein space for image classification with limited training data	Jan 9, 2022	Data Augmentationimage-classification	CodeCode Available	0
Semantic-based Data Augmentation for Math Word Problems	Jan 7, 2022	Data AugmentationMath	—Unverified	0
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition	Jan 7, 2022	Data AugmentationLanguage Modeling	—Unverified	0
GenLabel: Mixup Relabeling using Generative Models	Jan 7, 2022	Adversarial RobustnessData Augmentation	—Unverified	0
Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining	Jan 7, 2022	Data AugmentationRain Removal	CodeCode Available	1
A 1D CNN for high accuracy classification and transfer learning in motor imagery EEG-based brain-computer interface	Jan 6, 2022	Brain Computer InterfaceData Augmentation	CodeCode Available	1
EM-driven unsupervised learning for efficient motion segmentation	Jan 6, 2022	Data AugmentationMotion Segmentation	CodeCode Available	1
Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection	Jan 5, 2022	Data AugmentationObject	—Unverified	0
FROTE: Feedback Rule-Driven Oversampling for Editing Models	Jan 4, 2022	Data AugmentationManagement	—Unverified	0
AutoBalance: Optimized Loss Functions for Imbalanced Data	Jan 4, 2022	Data AugmentationFairness	CodeCode Available	1
Data Augmentation for Depression Detection Using Skeleton-Based Gait Information	Jan 4, 2022	Data AugmentationDepression Detection	—Unverified	0
Quantifying Uncertainty in Deep Learning Approaches to Radio Galaxy Classification	Jan 4, 2022	ClassificationData Augmentation	CodeCode Available	0
Learning to Generate Novel Classes for Deep Metric Learning	Jan 4, 2022	Data AugmentationMetric Learning	—Unverified	0

Show:10 25 50

← PrevPage 99 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified