SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 65016550 of 8378 papers

TitleStatusHype
How Should Markup Tags Be Translated?Code0
PATQUEST: Papago Translation Quality Estimation0
Tencent Neural Machine Translation Systems for the WMT20 News Translation Task0
Noising Scheme for Data Augmentation in Automatic Post-Editing0
Facebook AI’s WMT20 News Translation Task Submission0
ColloQL: Robust Text-to-SQL Over Search QueriesCode0
A multi-source approach for Breton–French hybrid machine translation0
TextAttack: Lessons learned in designing Python frameworks for NLP0
Linguist Geeks on WNUT-2020 Task 2: COVID-19 Informative Tweet Identification using Progressive Trained Language Models and Data Augmentation0
Quantifying the Evaluation of Heuristic Methods for Textual Data Augmentation0
Detecting Entailment in Code-Mixed Hindi-English ConversationsCode0
Advancing Seq2seq with Joint Paraphrase Learning0
Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented Dialogue0
Textual Data Augmentation for Efficient Active Learning on Tiny Datasets0
Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection0
Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging0
Controllable Meaning Representation to Text Generation: Linearization and Data Augmentation Strategies0
MixKD: Towards Efficient Distillation of Large-scale Language Models0
BLT: Balancing Long-Tailed Datasets with Adversarially-Perturbed ImagesCode0
On the Performance of Convolutional Neural Networks under High and Low Frequency Information0
COVID-FACT: A Fully-Automated Capsule Network-based Framework for Identification of COVID-19 Cases from Chest CT scansCode0
Improving Dialogue Breakdown Detection with Semi-Supervised Learning0
All-Weather Object Recognition Using Radar and Infrared Sensing0
A Model of Two Tales: Dual Transfer Learning Framework for Improved Long-tail Item Recommendation0
Self-paced Data Augmentation for Training Neural Networks0
Identification of complex mixtures for Raman spectroscopy using a novel scheme based on a new multi-label deep neural network0
Conversation Graph: Data Augmentation, Training and Evaluation for Non-Deterministic Dialogue ManagementCode0
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation LeaningCode1
Augmenting transferred representations for stock classification0
Evaluating data augmentation for financial time series classificationCode0
Classification Beats Regression: Counting of Cells from Greyscale Microscopic Images based on Annotation-free Training SamplesCode0
Improving Text Relationship Modeling with Artificial Data0
Acoustic echo cancellation with the dual-signal transformation LSTM networkCode1
Graph Contrastive Learning with Adaptive AugmentationCode1
CopyPaste: An Augmentation Method for Speech Emotion Recognition0
Perception for Autonomous Systems (PAZ)Code1
Impact of Spherical Coordinates Transformation Pre-processing in Deep Convolution Neural Networks for Brain Tumor Segmentation and Survival Prediction0
Cross-directional Feature Fusion Network for Building Damage Assessment from Satellite Imagery0
Contrastive Learning for Sequential RecommendationCode1
Restrained Generative Adversarial Network against Overfitting in Numeric Data Augmentation0
Hierarchical Metadata-Aware Document Categorization under Weak SupervisionCode1
Method and Dataset Entity Mining in Scientific Literature: A CNN + Bi-LSTM Model with Self-attention0
P^2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation0
Exploiting Neural Query Translation into Cross Lingual Information Retrieval0
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach0
What is the best data augmentation for 3D brain tumor segmentation?Code1
Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition0
Two-stage Textual Knowledge Distillation for End-to-End Spoken Language UnderstandingCode0
Show:102550
← PrevPage 131 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified