SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 74017450 of 8378 papers

TitleStatusHype
Music Source Separation in the Waveform Domain0
PanDA: Panoptic Data Augmentation0
Enhancing Out-Of-Domain Utterance Detection with Data Augmentation Based on Word Embeddings0
DeepSmartFuzzer: Reward Guided Test Generation For Deep LearningCode0
Unsupervised Neural Sensor Models for Synthetic LiDAR Data Augmentation0
Visualizing Point Cloud Classifiers by Curvature SmoothingCode0
Computational Ceramicology0
GANkyoku: a Generative Adversarial Network for Shakuhachi MusicCode0
Improving N-gram Language Models with Pre-trained Deep Transformer0
Improving Conditioning in Context-Aware Sequence to Sequence Models0
Generating Diverse Translation by Manipulating Multi-Head Attention0
On Using SpecAugment for End-to-End Speech Translation0
The Origins and Prevalence of Texture Bias in Convolutional Neural Networks0
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology0
Action Recognition Using Volumetric Motion RepresentationsCode0
Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification With K-means FeaturesCode0
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling0
Faster AutoAugment: Learning Augmentation Strategies using BackpropagationCode0
Signed Input Regularization0
Robustness to Capitalization Errors in Named Entity Recognition0
A Smartphone-Based Skin Disease Classification Using MobileNet CNN0
Learning from Data-Rich Problems: A Case Study on Genetic Variant Calling0
Improving Robustness of Task Oriented Dialog Systems0
Logo-2K+: A Large-Scale Logo Dataset for Scalable Logo ClassificationCode0
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation0
XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture ClassificationCode0
Towards Understanding Gender Bias in Relation ExtractionCode0
Transforming Wikipedia into Augmented Data for Query-Focused Summarization0
Not Enough Data? Deep Learning to the Rescue!0
Microsoft Research Asia's Systems for WMT190
SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic KnowledgeCode0
SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation0
An "augmentation-free" rotation invariant classification scheme on point-cloud and its application to neuroimaging0
Scalable Deep Generative Relational Models with High-Order Node Dependence0
Learning from Explanations with Neural Execution TreeCode0
Enhanced Convolutional Neural Tangent Kernels0
Training Data Augmentation for Detecting Adverse Drug Reactions in User-Generated Content0
Data augmentation using back-translation for context-aware neural machine translation0
KNU-HYUNDAI's NMT system for Scientific Paper and Patent Tasks onWAT 20190
End-to-end Speech Translation System Description of LIT for IWSLT 20190
Understanding Data Augmentation in Neural Machine Translation: Two Perspectives towards Generalization0
Improving Language Generation from Feature-Rich Tree-Structured Data with Relational Graph Convolutional Encoders0
SYSTRAN @ WAT 2019: Russian-Japanese News Commentary task0
Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back-TranslationCode0
Benefits of Data Augmentation for NMT-based Text Normalization of User-Generated Content0
Supervised neural machine translation based on data augmentation and improved training \& inference process0
Character-Based Models for Adversarial Phone Extraction: Preventing Human Sex Trafficking0
Abstract Text Summarization: A Low Resource Challenge0
Enhanced Transformer Model for Data-to-Text Generation0
Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data0
Show:102550
← PrevPage 149 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified