SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 81518200 of 8378 papers

TitleStatusHype
Optimization Dynamics of Equivariant and Augmented Neural NetworksCode0
Optimization of Artificial Neural Networks models applied to the identification of images of asteroids' resonant argumentsCode0
Optimized Automated Cardiac MR Scar Quantification with GAN-Based Data AugmentationCode0
A Two-Stage Method for Text Line Detection in Historical DocumentsCode0
SemiCurv: Semi-Supervised Curvilinear Structure SegmentationCode0
G-PECNet: Towards a Generalizable Pedestrian Trajectory Prediction SystemCode0
ChessMix: Spatial Context Data Augmentation for Remote Sensing Semantic SegmentationCode0
Optimizing Heat Alert Issuance with Reinforcement LearningCode0
Empirical Study of Text Augmentation on Social Media Text in VietnameseCode0
NeuroMixGDP: A Neural Collapse-Inspired Random Mixup for Private Data ReleaseCode0
Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural NetworkCode0
Optimizing Synthetic Data for Enhanced Pancreatic Tumor SegmentationCode0
Cheap and Good? Simple and Effective Data Augmentation for Low Resource Machine ReadingCode0
Empirical Study of Mix-based Data Augmentation Methods in Physiological Time Series DataCode0
Empirical Advocacy of Bio-inspired Models for Robust Image RecognitionCode0
A Dataset of Laryngeal Endoscopic Images with Comparative Study on Convolution Neural Network Based Semantic SegmentationCode0
Order-preserving Consistency Regularization for Domain Adaptation and GeneralizationCode0
Analyzing Data Augmentation for Medical Images: A Case Study in Ultrasound ImagesCode0
Semi-supervised 3D Object Detection with PatchTeacher and PillarMixCode0
ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMsCode0
Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd CountingCode0
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language UnderstandingCode0
Embedding Hallucination for Few-Shot Language Fine-tuningCode0
ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent DetectionCode0
AugGPT: Leveraging ChatGPT for Text Data AugmentationCode0
Analysis and Optimization of Convolutional Neural Network ArchitecturesCode0
Effortless Deep Training for Traffic Sign Detection Using Templates and Arbitrary Natural ImagesCode0
Efficient Training Under Limited ResourcesCode0
Analysing the Robustness of Dual Encoders for Dense Retrieval Against MisspellingsCode0
Towards Understanding Gender Bias in Relation ExtractionCode0
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation ModelsCode0
Efficient Topology-aware Data Augmentation for High-Degree Graph Neural NetworksCode0
Efficient Gaussian Process Classification Using Polya-Gamma Data AugmentationCode0
T-CGAN: Conditional Generative Adversarial Network for Data Augmentation in Noisy Time Series with Irregular SamplingCode0
Towards Understanding How Data Augmentation Works with Imbalanced DataCode0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
Efficient Diffusion-Driven Corruption Editor for Test-Time AdaptationCode0
VideoDG: Generalizing Temporal Relations in Videos to Novel DomainsCode0
Semi-Supervised Few-Shot Learning via Multi-Factor ClusteringCode0
TdAttenMix: Top-Down Attention Guided MixupCode0
A Data Cartography based MixUp for Pre-trained Language ModelsCode0
PAGANDA: An Adaptive Task-Independent Automatic Data AugmentationCode0
Efficient Deep Learning Architectures for Fast Identification of Bacterial Strains in Resource-Constrained DevicesCode0
Attack-Augmentation Mixing-Contrastive Skeletal Representation LearningCode0
A Transductive Multi-Head Model for Cross-Domain Few-Shot LearningCode0
Semi-Supervised Learning by Disentangling and Self-Ensembling Over Stochastic Latent SpaceCode0
A Tale Of Two Long TailsCode0
AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised RankingCode0
Character-Level Question Answering with AttentionCode0
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data AugmentationCode0
Show:102550
← PrevPage 164 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified