SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 55515600 of 8378 papers

TitleStatusHype
NeuroMixGDP: A Neural Collapse-Inspired Random Mixup for Private Data ReleaseCode0
Adversarial Graph Contrastive Learning with Information RegularizationCode0
LMN at SemEval-2022 Task 11: A Transformer-based System for English Named Entity Recognition0
Distribution augmentation for low-resource expressive text-to-speech0
A Data Augmentation Method for Fully Automatic Brain Tumor Segmentation0
Fast Adversarial Training with Noise Augmentation: A Unified Perspective on RandStart and GradAlign0
FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals0
Audio Defect Detection in Music with Deep Networks0
HaT5: Hate Language Identification using Text-to-Text Transfer Transformer0
A Deep Learning Approach for Digital Color Reconstruction of Lenticular Films0
Adults as Augmentations for Children in Facial Emotion Recognition with Contrastive Learning0
Feature-level augmentation to improve robustness of deep neural networks to affine transformations0
Cross-speaker style transfer for text-to-speech using data augmentation0
A multiscale spatiotemporal approach for smallholder irrigation detection0
Fault Detection and Diagnosis with Imbalanced and Noisy Data: A Hybrid Framework for Rotating Machinery0
Social Media as an Instant Source of Feedback on Water Quality0
The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
TransformNet: Self-supervised representation learning through predicting geometric transformationsCode0
Equivariance versus Augmentation for Spherical ImagesCode0
DeepSSN: a deep convolutional neural network to assess spatial scene similarityCode0
Field-of-View IoU for Object Detection in 360° Images0
SODA: Self-organizing data augmentation in deep neural networks -- Application to biomedical image segmentation tasks0
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study0
Multi-modal data generation with a deep metric variational autoencoder0
LiDAR dataset distillation within bayesian active learning framework: Understanding the effect of data augmentation0
Exemplar-Based Contrastive Self-Supervised Learning with Few-Shot Class Incremental Learning0
Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
Multi-Output Gaussian Process-Based Data Augmentation for Multi-Building and Multi-Floor Indoor Localization0
Bootstrapped Representation Learning for Skeleton-Based Action Recognition0
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes0
The RoyalFlush System of Speech Recognition for M2MeT Challenge0
Learning Mechanically Driven Emergent Behavior with Message Passing Neural NetworksCode0
NoisyMix: Boosting Model Robustness to Common Corruptions0
Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls0
Deep Learning in fNIRS: A review0
Compositionality as Lexical SymmetryCode0
Improving Robustness by Enhancing Weak SubnetsCode0
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation0
Improving End-to-End Models for Set Prediction in Spoken Language Understanding0
Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency ParsingCode0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniquesCode0
Recency Dropout for Recurrent Recommender Systems0
Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images0
Cardiac Disease Diagnosis on Imbalanced Electrocardiography Data Through Optimal Transport Augmentation0
Feature transforms for image data augmentationCode0
Synthetic speech detection using meta-learning with prototypical loss0
A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification0
On-Device Learning with Cloud-Coordinated Data Augmentation for Extreme Model Personalization in Recommender Systems0
Show:102550
← PrevPage 112 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified