SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 58015850 of 8378 papers

TitleStatusHype
Data augmentation to improve robustness of image captioning solutions0
Tensor feature hallucination for few-shot learningCode0
Neighborhood Contrastive Learning Applied to Online Patient MonitoringCode1
Grounding inductive biases in natural images:invariance stems from variations in dataCode1
A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition0
Offline Inverse Reinforcement Learning0
AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT0
A multi-stage GAN for multi-organ chest X-ray image generation and segmentation0
Joint System-Wise Optimization for Pipeline Goal-Oriented Dialog System0
It Takes Two to Tango: Mixup for Deep Metric LearningCode1
Theoretically Motivated Data Augmentation and Regularization for Portfolio ConstructionCode0
Self-Supervised Learning with Data Augmentations Provably Isolates Content from StyleCode1
Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question AnsweringCode1
Data-Efficient Instance Generation from Instance DiscriminationCode1
RobustNav: Towards Benchmarking Robustness in Embodied NavigationCode1
Generative adversarial network with object detector discriminator for enhanced defect detection on ultrasonic B-scans0
Cheap and Good? Simple and Effective Data Augmentation for Low Resource Machine ReadingCode0
EventDrop: data augmentation for event-based learningCode0
Rotating spiders and reflecting dogs: a class conditional approach to learning data augmentation distributions0
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages StudyCode0
RegMix: Data Mixing Augmentation for Regression0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
On the Language Coverage Bias for Neural Machine Translation0
CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue SystemCode1
Go with the Flows: Mixtures of Normalizing Flows for Point Cloud Generation and Reconstruction0
Feature-based Style Randomization for Domain Generalization0
Training Robust Graph Neural Networks with Topology Adaptive Edge Dropping0
AOSLO-net: A deep learning-based method for automatic segmentation of retinal microaneurysms from adaptive optics scanning laser ophthalmoscope images0
Cross-language Sentence Selection via Data Augmentation and Rationale Training0
Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene0
Self-Guided Contrastive Learning for BERT Sentence RepresentationsCode1
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis0
Learning from Counterfactual Links for Link PredictionCode1
Finding and Fixing Spurious Patterns with Explanations0
LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification0
Bayesian Inference for Gamma Models0
Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children's mindreading ability0
Pathology-Aware Generative Adversarial Networks for Medical Image Augmentation0
Semantic Palette: Guiding Scene Generation with Class ProportionsCode1
Noisy student-teacher training for robust keyword spotting0
Long Term Object Detection and Tracking in Collaborative Learning Environments0
SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate TrainingCode1
Data augmentation and pre-trained networks for extremely low data regimes unsupervised visual inspection0
Knowing More About Questions Can Help: Improving Calibration in Question AnsweringCode1
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning0
Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and BeyondCode1
Automatic Classification of Attributes in German Adjective-Noun PhrasesCode0
TopGuNN: Fast NLP Training Data Augmentation using Large CorporaCode0
IIITN NLP at SMM4H 2021 Tasks: Transformer Models for Classification on Health-Related Imbalanced Twitter Datasets0
Joint Summarization-Entailment Optimization for Consumer Health Question UnderstandingCode1
Show:102550
← PrevPage 117 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified