SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 50515100 of 8378 papers

TitleStatusHype
Universal Adaptive Data Augmentation0
Attention, Filling in The Gaps for Generalization in Routing Problems0
Data Augmentation for Low-Resource Quechua ASR Improvement0
Neural Data-to-Text Generation Based on Small Datasets: Comparing the Added Value of Two Semi-Supervised Learning Approaches on Top of a Large Language Model0
Deepfake Video Detection with Spatiotemporal Dropout Transformer0
Developing a Component Comment Extractor from Product Reviews on E-Commerce Sites0
Fine-tuning Partition-aware Item Similarities for Efficient and Scalable RecommendationCode0
Efficient Augmentation for Imbalanced Deep LearningCode0
Building Korean Sign Language Augmentation (KoSLA) Corpus with Data Augmentation Technique0
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning0
Know Your Space: Inlier and Outlier Construction for Calibrating Medical OOD Detectors0
Brain-Aware Replacements for Supervised Contrastive Learning in Detection of Alzheimer's DiseaseCode0
Bootstrapping a User-Centered Task-Oriented Dialogue System0
Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine LearningCode0
Training Robust Deep Models for Time-Series Domain: Novel Algorithms and Theoretical AnalysisCode0
Unsupervised Joint Image Transfer and Uncertainty Quantification Using Patch Invariant NetworksCode0
UDRN: Unified Dimensional Reduction Neural Network for Feature Selection and Feature Projection0
On Improving the Performance of Glitch Classification for Gravitational Wave Detection by using Generative Adversarial Networks0
How many perturbations break this model? Evaluating robustness beyond adversarial accuracyCode0
Models Out of Line: A Fourier Lens on Distribution Shift Robustness0
StatMix: Data augmentation method that relies on image statistics in federated learning0
Harnessing Out-Of-Distribution Examples via Augmenting Content and StyleCode0
Supervised Contrastive Learning Approach for Contextual Ranking0
Don't overfit the history -- Recursive time series data augmentation0
Monkeypox Skin Lesion Detection Using Deep Learning Models: A Feasibility Study0
Towards Length-Versatile and Noise-Robust Radio Frequency Fingerprint Identification0
Generalization to translation shifts: a study in architectures and augmentations0
Predicting Out-of-Domain Generalization with Neighborhood Invariance0
DBN-Mix: Training Dual Branch Network Using Bilateral Mixup Augmentation for Long-Tailed Visual Recognition0
TractoFormer: A Novel Fiber-level Whole Brain Tractography Analysis Framework Using Spectral Embedding and Vision Transformers0
Block-SCL: Blocking Matters for Supervised Contrastive Learning in Product Matching0
A Robust Ensemble Model for Patasitic Egg Detection and Classification0
FakeNews: GAN-based generation of realistic 3D volumetric data -- A systematic review and taxonomy0
Efficient Semi-supervised Consistency Training for Natural Language Understanding0
SPDB Innovation Lab at SemEval-2022 Task 10: A Novel End-to-End Structured Sentiment Analysis Model based on the ERNIE-M0
SemEval-2022 Task 3: PreTENS-Evaluating Neural Networks on Presuppositional Semantic Knowledge0
CASIA at SemEval-2022 Task 11: Chinese Named Entity Recognition for Complex and Ambiguous Entities0
Learning to Generate Examples for Semantic Processing Tasks0
Amsqr at SemEval-2022 Task 4: Towards AutoNLP via Meta-Learning and Adversarial Data Augmentation for PCL Detection0
Infrrd.ai at SemEval-2022 Task 11: A system for named entity recognition using data augmentation, transformer-based sequence labeling model, and EnsembleCRF0
Compositional Generalization for Kinship Prediction through Data Augmentation0
IIIT-MLNS at SemEval-2022 Task 8: Siamese Architecture for Modeling Multilingual News Similarity0
I2C at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Deep Learning Techniques0
Improving Classification of Infrequent Cognitive Distortions: Domain-Specific Model vs. Data Augmentation0
Visual Transformer Meets CutMix for Improved Accuracy, Communication Efficiency, and Data Privacy in Split Learning0
Retrieval Based Response Letter Generation For a Customer Care Setting0
Data Augmentation for Low-Resource Dialogue Summarization0
UA-KO at SemEval-2022 Task 11: Data Augmentation and Ensembles for Korean Named Entity Recognition0
BIT-Xiaomi’s System for AutoSimTrans 20220
Fast Bilingual Grapheme-To-Phoneme Conversion0
Show:102550
← PrevPage 102 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified