Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 8378 papers

Title	Date	Tasks	Status	Hype
SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition	Sep 16, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks?	Sep 16, 2024	Data Augmentationimage-classification	—Unverified	0
oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models	Sep 16, 2024	Data AugmentationSpeaker Recognition	—Unverified	0
Data-Centric Strategies for Overcoming PET/CT Heterogeneity: Insights from the AutoPET III Lesion Segmentation Challenge	Sep 16, 2024	Data AugmentationLesion Segmentation	CodeCode Available	0
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion	Sep 16, 2024	Action UnderstandingContrastive Learning	—Unverified	0
Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation	Sep 16, 2024	Data AugmentationDiagnostic	—Unverified	0
Contrastive Learning for Character Detection in Ancient Greek Papyri	Sep 16, 2024	Contrastive LearningData Augmentation	CodeCode Available	0
Deep-Wide Learning Assistance for Insect Pest Classification	Sep 16, 2024	ClassificationData Augmentation	CodeCode Available	1
Robust image representations with counterfactual contrastive learning	Sep 16, 2024	Contrastive Learningcounterfactual	CodeCode Available	1
Enhancing Lesion Segmentation in PET/CT Imaging with Deep Learning and Advanced Data Preprocessing Techniques	Sep 15, 2024	Data AugmentationDiagnostic	CodeCode Available	0
Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild	Sep 15, 2024	3D Hand Pose EstimationContrastive Learning	—Unverified	0
Enhancing EEG Signal Generation through a Hybrid Approach Integrating Reinforcement Learning and Diffusion Models	Sep 14, 2024	Brain Computer InterfaceData Augmentation	—Unverified	0
Effective Pre-Training of Audio Transformers for Sound Event Detection	Sep 14, 2024	Data AugmentationEvent Detection	CodeCode Available	1
From FDG to PSMA: A Hitchhiker's Guide to Multitracer, Multicenter Lesion Segmentation in PET/CT Imaging	Sep 14, 2024	Data AugmentationLesion Segmentation	CodeCode Available	1
NBBOX: Noisy Bounding Box Improves Remote Sensing Object Detection	Sep 14, 2024	Data AugmentationObject	CodeCode Available	0
An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation	Sep 14, 2024	Data AugmentationImage Segmentation	—Unverified	0
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages	Sep 13, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Test-time Training for Hyperspectral Image Super-resolution	Sep 13, 2024	Data AugmentationHyperspectral Image Super-Resolution	—Unverified	0
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction	Sep 13, 2024	Autonomous DrivingData Augmentation	CodeCode Available	1
FPMT: Enhanced Semi-Supervised Model for Traffic Incident Detection	Sep 12, 2024	Data Augmentation	—Unverified	0
AutoPET Challenge: Tumour Synthesis for Data Augmentation	Sep 12, 2024	Data AugmentationLesion Segmentation	—Unverified	0
Multi-scale decomposition of sea surface height snapshots using machine learning	Sep 11, 2024	Data AugmentationImage-to-Image Translation	CodeCode Available	0
Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models	Sep 11, 2024	Data AugmentationTask 2	—Unverified	0
Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy	Sep 11, 2024	Data AugmentationImage Generation	—Unverified	0
Data Augmentation via Latent Diffusion for Saliency Prediction	Sep 11, 2024	Data AugmentationDiversity	CodeCode Available	1
Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review	Sep 11, 2024	Data AugmentationDeep Learning	—Unverified	0
Synthetic continued pretraining	Sep 11, 2024	Data AugmentationLanguage Modelling	CodeCode Available	2
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Sep 10, 2024	Data AugmentationDepth Estimation	CodeCode Available	0
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models	Sep 10, 2024	Audio captioningAudio Question Answering	—Unverified	0
Automated Data Augmentation for Few-Shot Time Series Forecasting: A Reinforcement Learning Approach Guided by a Model Zoo	Sep 10, 2024	Data AugmentationDiversity	—Unverified	0
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking	Sep 10, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification	Sep 10, 2024	Data Augmentationimage-classification	CodeCode Available	1
Efficient Training of Self-Supervised Speech Foundation Models on a Compute Budget	Sep 9, 2024	Data AugmentationSelf-Supervised Learning	—Unverified	0
A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets	Sep 9, 2024	Data AugmentationLanguage Modelling	—Unverified	0
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance	Sep 9, 2024	Data AugmentationSegmentation	CodeCode Available	0
Graffin: Stand for Tails in Imbalanced Node Classification	Sep 9, 2024	ClassificationData Augmentation	—Unverified	0
Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models	Sep 9, 2024	Contrastive LearningData Augmentation	—Unverified	0
AD-Net: Attention-based dilated convolutional residual network with guided decoder for robust skin lesion segmentation	Sep 9, 2024	Data AugmentationDecoder	—Unverified	0
Efficient Classification of Histopathology Images	Sep 8, 2024	ClassificationData Augmentation	—Unverified	0
EdaCSC: Two Easy Data Augmentation Methods for Chinese Spelling Correction	Sep 8, 2024	Data AugmentationSpelling Correction	CodeCode Available	0
GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning	Sep 8, 2024	3DGS3D Object Classification	—Unverified	0
A Survey on Diffusion Models for Recommender Systems	Sep 8, 2024	Data AugmentationRecommendation Systems	CodeCode Available	2
Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection	Sep 8, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Untie the Knots: An Efficient Data Augmentation Strategy for Long-Context Pre-Training in Language Models	Sep 7, 2024	Data Augmentation	—Unverified	0
FreeAugment: Data Augmentation Search Across All Degrees of Freedom	Sep 7, 2024	AllData Augmentation	CodeCode Available	0
A Quantitative Approach for Evaluating Disease Focus and Interpretability of Deep Learning Models for Alzheimer's Disease Classification	Sep 7, 2024	Data Augmentation	CodeCode Available	0
Phrase-Level Adversarial Training for Mitigating Bias in Neural Network-based Automatic Essay Scoring	Sep 7, 2024	Data Augmentation	—Unverified	0
Medical Image Segmentation via Single-Source Domain Generalization with Random Amplitude Spectrum Synthesis	Sep 7, 2024	Data AugmentationDomain Generalization	CodeCode Available	0
Low-Complexity Own Voice Reconstruction for Hearables with an In-Ear Microphone	Sep 6, 2024	Data Augmentation	—Unverified	0
Bi-modality Images Transfer with a Discrete Process Matching Method	Sep 6, 2024	Data AugmentationDiagnostic	—Unverified	0

Show:10 25 50

← PrevPage 23 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified