SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 37513800 of 8378 papers

TitleStatusHype
CrowNER at Rocling 2022 Shared Task: NER using MacBERT and Adversarial Training0
Why is Winoground Hard? Investigating Failures in Visuolinguistic CompositionalityCode1
Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning TasksCode1
Augmentation Invariant Manifold Learning0
Self-supervised Character-to-Character Distillation for Text RecognitionCode1
SADT: Combining Sharpness-Aware Minimization with Self-Distillation for Improved Model GeneralizationCode0
Exploring Train and Test-Time Augmentations for Audio-Language Learning0
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position SelectorCode0
SAGE: Saliency-Guided Mixup with Optimal Rearrangements0
Embedding Space Augmentation for Weakly Supervised Learning in Whole-Slide Images0
Counterfactual Data Augmentation via Perspective Transition for Open-Domain DialoguesCode0
Speaker Representation Learning via Contrastive Loss with Maximal Speaker SeparabilityCode1
Differentiable Data Augmentation for Contrastive Sentence Representation LearningCode1
Interpretable CNN-Multilevel Attention Transformer for Rapid Recognition of Pneumonia from Chest X-Ray Images0
Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers0
A Comparative Study of Graph Neural Networks for Shape Classification in NeuroimagingCode0
Anomaly Detection in Additive Manufacturing Processes using Supervised Classification with Imbalanced Sensor Data based on Generative Adversarial Network0
RoChBert: Towards Robust BERT Fine-tuning for ChineseCode1
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition0
FUSSL: Fuzzy Uncertain Self Supervised Learning0
Latent Space is Feature Space: Regularization Term for GANs Training on Limited DatasetCode0
Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments0
U-Net-based Models for Skin Lesion Segmentation: More Attention and AugmentationCode1
GraphMAD: Graph Mixup for Data Augmentation using Data-Driven Convex ClusteringCode0
A Generic Shared Attention Mechanism for Various Backbone Neural Networks0
An Adversarial Active Sampling-based Data Augmentation Framework for Manufacturable Chip Design0
ScoreMix: A Scalable Augmentation Strategy for Training GANs with Limited Data0
A knowledge-driven vowel-based approach of depression classification from speech using data augmentationCode0
FreeVC: Towards High-Quality Text-Free One-Shot Voice ConversionCode2
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation0
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion0
Bridging the visual gap in VLN via semantically richer instructions0
Domain Adaptive Object Detection for Autonomous Driving under Foggy WeatherCode1
Dictionary-Assisted Supervised Contrastive LearningCode0
Addressing Heterogeneity in Federated Learning via Distributional TransformationCode0
TuneUp: A Simple Improved Training Strategy for Graph Neural Networks0
In search of strong embedding extractors for speaker diarisation0
Long-tailed Food Classification0
Pretrained audio neural networks for Speech emotion recognition in PortugueseCode0
Automatic Diagnosis of Myocarditis Disease in Cardiac MRI Modality using Deep Transformers and Explainable Artificial Intelligence0
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding0
Learning to Augment via Implicit Differentiation for Domain Generalization0
The Curious Case of Benign Memorization0
On Robust Incremental Learning over Many Multilingual Steps0
I Prefer not to Say: Protecting User Consent in Models with Optional Personal DataCode0
'A net for everyone': fully personalized and unsupervised neural networks trained with longitudinal data from a single patient0
Multi-Domain Long-Tailed Learning by Augmenting Disentangled RepresentationsCode0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
Provably Learning Diverse Features in Multi-View Data with Midpoint MixupCode0
Sufficient Invariant Learning for Distribution Shift0
Show:102550
← PrevPage 76 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified