Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5901–5950 of 8378 papers

Title	Date	Tasks	Status
Avoiding Overlap in Data Augmentation for AMR-to-Text Generation	Aug 1, 2021	AMR-to-Text GenerationData Augmentation	—Unverified
Awareness of uncertainty in classification using a multivariate model and multi-views	Apr 16, 2024	Data Augmentation	—Unverified
Backdoor Attack and Defense in Federated Generative Adversarial Network-based Medical Image Synthesis	Oct 19, 2022	Backdoor AttackData Augmentation	—Unverified
Background Mixup Data Augmentation for Hand and Object-in-Contact Detection	Feb 28, 2022	Contact DetectionData Augmentation	—Unverified
Back-Translation-Style Data Augmentation for End-to-End ASR	Jul 28, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation	Nov 17, 2022	Data AugmentationMachine Translation	—Unverified
Bag of Tricks for Developing Diabetic Retinopathy Analysis Framework to Overcome Data Scarcity	Oct 18, 2022	Data AugmentationEnsemble Learning	—Unverified
Bag of Tricks for Long-Tailed Multi-Label Classification on Chest X-Rays	Aug 17, 2023	Data AugmentationMulti-Label Classification	—Unverified
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data	Dec 19, 2024	AutoMLcross-modal alignment	—Unverified
Baidu Neural Machine Translation Systems for WMT19	Aug 1, 2019	Data AugmentationDomain Adaptation	—Unverified
BaIT: Barometer for Information Trustworthiness	Jun 15, 2022	Data AugmentationNatural Language Inference	—Unverified
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance	Feb 18, 2024	Data Augmentation	—Unverified
Balanced Masked and Standard Face Recognition	Oct 4, 2021	Data AugmentationFace Detection	—Unverified
Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production	Nov 19, 2024	ClassificationData Augmentation	—Unverified
Balancing Act: Distribution-Guided Debiasing in Diffusion Models	Feb 28, 2024	AttributeData Augmentation	—Unverified
Balancing Data through Data Augmentation Improves the Generality of Transfer Learning for Diabetic Retinopathy Classification	May 25, 2022	Data AugmentationTransfer Learning	—Unverified
Banana Sub-Family Classification and Quality Prediction using Computer Vision	Apr 6, 2022	ClassificationData Augmentation	—Unverified
BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bengali	Oct 16, 2023	BenchmarkingData Augmentation	—Unverified
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation	Mar 14, 2024	Data AugmentationMachine Translation	—Unverified
Bayes-enhanced Multi-view Attention Networks for Robust POI Recommendation	Nov 1, 2023	Data AugmentationRepresentation Learning	—Unverified
Bayesian Analysis of Dynamic Linear Topic Models	Nov 12, 2015	Data AugmentationDynamic Topic Modeling	—Unverified
Bayesian CART models for insurance claims frequency	Mar 3, 2023	Data AugmentationModel Selection	—Unverified
Bayesian estimation of finite mixtures of Tobit models	Nov 14, 2024	Data Augmentation	—Unverified
Bayesian Generative Active Deep Learning	Apr 26, 2019	Active LearningData Augmentation	—Unverified
Bayesian Inference for Gamma Models	Jun 3, 2021	Bayesian InferenceData Augmentation	—Unverified
Data Augementation with Polya Inverse Gamma	May 29, 2019	Bayesian InferenceData Augmentation	—Unverified
Bayesian Mixed-Frequency Quantile Vector Autoregression: Eliciting tail risks of Monthly US GDP	Sep 5, 2022	Data Augmentationquantile regression	—Unverified
Bayesian Non-Homogeneous Markov Models via Polya-Gamma Data Augmentation with Applications to Rainfall Modeling	Jan 11, 2017	Data Augmentationspeech-recognition	—Unverified
Bayesian Semi-supervised Multi-category Classification under Nonparanormality	Jan 11, 2020	Binary ClassificationClassification	—Unverified
Bayesian Uncertainty Estimation by Hamiltonian Monte Carlo: Applications to Cardiac MRI Segmentation	Mar 4, 2024	Data AugmentationImage Segmentation	—Unverified
BBoxCut: A Targeted Data Augmentation Technique for Enhancing Wheat Head Detection Under Occlusions	Mar 31, 2025	Data AugmentationHead Detection	—Unverified
BCNet: A Deep Convolutional Neural Network for Breast Cancer Grading	Jul 11, 2021	Breast Cancer DetectionData Augmentation	—Unverified
Bemba Speech Translation: Exploring a Low-Resource African Language	May 5, 2025	Data AugmentationTranslation	—Unverified
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios	Apr 16, 2025	Audio Deepfake DetectionBenchmarking	—Unverified
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning	Jun 2, 2021	BenchmarkingData Augmentation	—Unverified
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise	Jan 3, 2023	BenchmarkingClassification	—Unverified
Benchmarking Machine Learning Methods for Distributed Acoustic Sensing	Mar 26, 2025	BenchmarkingData Augmentation	—Unverified
Benchmarking Robustness in Neural Radiance Fields	Jan 10, 2023	BenchmarkingCamera Calibration	—Unverified
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance	Mar 23, 2023	BenchmarkingData Augmentation	—Unverified
Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text	Aug 3, 2022	BenchmarkingData Augmentation	—Unverified
Benefits of Data Augmentation for NMT-based Text Normalization of User-Generated Content	Nov 1, 2019	Data AugmentationDecoder	—Unverified
Bengali Document Layout Analysis -- A YOLOV8 Based Ensembling Approach	Sep 2, 2023	Data AugmentationDocument Layout Analysis	—Unverified
Bengali Document Layout Analysis with Detectron2	Aug 26, 2023	Data AugmentationDocument Layout Analysis	—Unverified
“Be nice to your wife! The restaurants are closed”: Can Gender Stereotype Detection Improve Sexism Classification?	Nov 1, 2021	ClassificationData Augmentation	—Unverified
Benign Adversarial Attack: Tricking Models for Goodness	Jul 26, 2021	Adversarial AttackAttribute	—Unverified
BERT is Robust! A Case Against Synonym-Based Adversarial Examples in Text Classification	Sep 15, 2021	Data Augmentationtext-classification	—Unverified
BERT is Robust! A Case Against Synonym-Based Adversarial Examples in Text Classification	Nov 16, 2021	Data Augmentationtext-classification	—Unverified
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems	Apr 18, 2021	Adversarial AttackAutomatic Speech Recognition	—Unverified
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems	Jun 16, 2021	Data AugmentationEmotion Recognition	—Unverified
Best Practices in Pool-based Active Learning for Image Classification	Sep 29, 2021	Active LearningBenchmarking	—Unverified

Show:10 25 50

← PrevPage 119 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified