Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1025 of 8378 papers

Title	Date	Tasks	Status	Hype
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension	Oct 5, 2024	16kData Augmentation	—Unverified	0
RFBoost: Understanding and Boosting Deep WiFi Sensing via Physical Data Augmentation	Oct 4, 2024	Data Augmentation	CodeCode Available	1
CUDLE: Learning Under Label Scarcity to Detect Cannabis Use in Uncontrolled Environments	Oct 4, 2024	Contrastive LearningData Augmentation	—Unverified	0
Comparative Analysis and Ensemble Enhancement of Leading CNN Architectures for Breast Cancer Classification	Oct 4, 2024	Cancer ClassificationClassification	—Unverified	0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models	Oct 4, 2024	counterfactualData Augmentation	CodeCode Available	0
AlzhiNet: Traversing from 2DCNN to 3DCNN, Towards Early Detection and Diagnosis of Alzheimer's Disease	Oct 3, 2024	Data Augmentation	—Unverified	0
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves	Oct 3, 2024	Data Augmentation	CodeCode Available	1
Cognitive Biases in Large Language Models for News Recommendation	Oct 3, 2024	Data AugmentationMisinformation	—Unverified	0
QDGset: A Large Scale Grasping Dataset Generated with Quality-Diversity	Oct 3, 2024	Data AugmentationDiversity	—Unverified	0
Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inference	Oct 3, 2024	Data AugmentationText Generation	—Unverified	0
A Novel Method for Accurate & Real-time Food Classification: The Synergistic Integration of EfficientNetB7, CBAM, Transfer Learning, and Data Augmentation	Oct 3, 2024	Data AugmentationTransfer Learning	—Unverified	0
SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation	Oct 3, 2024	Bilevel OptimizationData Augmentation	—Unverified	0
Generate then Refine: Data Augmentation for Zero-shot Intent Detection	Oct 2, 2024	Data AugmentationDiversity	CodeCode Available	0
TAEGAN: Generating Synthetic Tabular Data For Data Augmentation	Oct 2, 2024	Data AugmentationGenerative Adversarial Network	—Unverified	0
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data	Oct 2, 2024	Audio ClassificationCaption Generation	CodeCode Available	1
Intent Detection in the Age of LLMs	Oct 2, 2024	Data AugmentationIn-Context Learning	—Unverified	0
PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation	Oct 2, 2024	Data AugmentationDiversity	—Unverified	0
Formula-Driven Data Augmentation and Partial Retinal Layer Copying for Retinal Layer Segmentation	Oct 2, 2024	Data AugmentationSegmentation	—Unverified	0
ProxiMix: Enhancing Fairness with Proximity Samples in Subgroups	Oct 2, 2024	Data AugmentationFairness	—Unverified	0
Data Extrapolation for Text-to-image Generation on Small Datasets	Oct 2, 2024	Data AugmentationImage Generation	CodeCode Available	1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Oct 2, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	1
Equivariant score-based generative models provably learn distributions with symmetries efficiently	Oct 2, 2024	Data AugmentationGeneralization Bounds	—Unverified	0
Ensembles provably learn equivariance through data augmentation	Oct 2, 2024	Data Augmentation	CodeCode Available	0
Pseudo-Non-Linear Data Augmentation via Energy Minimization	Oct 1, 2024	Data AugmentationDimensionality Reduction	—Unverified	0
Augmentation through Laundering Attacks for Audio Spoof Detection	Oct 1, 2024	Data AugmentationFace Swapping	—Unverified	0

Show:10 25 50

← PrevPage 41 of 336Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified