SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 58015850 of 8378 papers

TitleStatusHype
Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models0
Character-level HyperNetworks for Hate Speech DetectionCode0
Improving Novelty Detection using the Reconstructions of Nearest NeighboursCode0
Graph Transplant: Node Saliency-Guided Graph Mixup with Local Structure Preservation0
Data Augmentation Can Improve Robustness0
Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature Fusion0
Procurements with Bidder Asymmetry in Cost and Risk-Aversion0
A Relational Model for One-Shot Classification0
Off-policy Imitation Learning from Visual Inputs0
LiMoSeg: Real-time Bird's Eye View based LiDAR Motion Segmentation0
Developing neural machine translation models for Hungarian-English0
Solving the Class Imbalance Problem Using a Counterfactual Method for Data AugmentationCode0
A Syntax-Guided Grammatical Error Correction Model with Dependency Tree Correction0
Sexism Identification in Tweets and Gabs using Deep Neural Networks0
CGCL: Collaborative Graph Contrastive Learning without Handcrafted Graph Data AugmentationsCode0
Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices0
Human Age Estimation from Gene Expression Data using Artificial Neural Networks0
Voice Conversion Can Improve ASR in Very Low-Resource Settings0
A PubMedBERT-based Classifier with Data Augmentation Strategy for Detecting Medication Mentions in Tweets0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
A Hybrid Approach for Learning to Shift and Grasp with Elaborate Motion Primitives0
Meta-Learning to Improve Pre-Training0
ISP-Agnostic Image Reconstruction for Under-Display Cameras0
Data Augmentation of Incorporating Real Error Patterns and Linguistic Knowledge for Grammatical Error Correction0
DMix: Distance Constrained Interpolative Mixup0
SpanAlign: Efficient Sequence Tagging Annotation Projection into Translated Data applied to Cross-Lingual Opinion Mining0
TermMind: Alibaba’s WMT21 Machine Translation Using Terminologies Task Submission0
Contrastive Learning for Context-aware Neural Machine Translation Using Coreference Information0
The LMU Munich System for the WMT 2021 Large-Scale Multilingual Machine Translation Shared Task0
PDALN: Progressive Domain Adaptation over a Pre-trained Model for Low-Resource Cross-Domain Named Entity Recognition0
How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding0
A New Tool for Efficiently Generating Quality Estimation Datasets0
Can We Improve Model Robustness through Secondary Attribute Counterfactuals?0
Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis0
Are Factuality Checkers Reliable? Adversarial Meta-evaluation of Factuality in SummarizationCode0
Transductive Data Augmentation with Relational Path Rule Mining for Knowledge Graph Embedding0
Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization0
CVAE-based Re-anchoring for Implicit Discourse Relation Classification0
NVIDIA NeMo’s Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT210
“Be nice to your wife! The restaurants are closed”: Can Gender Stereotype Detection Improve Sexism Classification?0
Learning Data Augmentation Schedules for Natural Language ProcessingCode0
Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data AugmentationCode0
A Semantic Feature-Wise Transformation Relation Network for Automatic Short Answer Grading0
Sequence Mixup for Zero-Shot Cross-Lingual Part-Of-Speech Tagging0
HW-TSC’s Participation at WMT 2021 Quality Estimation Shared Task0
Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition0
Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap0
Simulating Realistic MRI variations to Improve Deep Learning model and visual explanations using GradCAMCode0
Long Term Stock Prediction based on Financial StatementsCode0
Reinforced Counterfactual Data Augmentation for Dual Sentiment ClassificationCode0
Show:102550
← PrevPage 117 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified