SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 51015150 of 8378 papers

TitleStatusHype
Single-stage uav detection and classification with yolov5: Mosaic data augmentation and panetCode0
Retrieval-guided Counterfactual Generation for QA0
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation0
QA Domain Adaptation using Data Augmentation and Contrastive Adaptation0
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models0
Learning to Ignore Adversarial Attacks0
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems0
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation0
Retrieval Data Augmentation Informed by Downstream Question Answering Performance0
EveMRC: A Two-stage Evidence Modeling For Multi-choice Machine Reading Comprehension0
Contrastive Learning for Low Resource Machine Translation0
Data Augmentation with Sentence Recombination Method for Semi-supervised Text Classification0
PESTO: A Post-User Fusion Network for Rumour Detection on Social Media0
UNICON: Unsupervised Intent Discovery via Semantic-level Contrastive Learning0
TransSGAN: GAN based semi-superivsed learning for text classification with Transformer Encoder0
Continual Few-shot Relation Learning via Embedding Space Regularization and Data Augmentation0
Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks0
Explicit Modeling the Context for Chinese NER0
LINDA: Unsupervised Learning to Interpolate in Natural Language Processing0
DAML: Chinese Named Entity Recognition with a fusion method of data-augmentation and meta-learning0
CST5: Data augmentation for Code-Switched Semantic Parsing0
DAWSON: Data Augmentation using Weak Supervision On Natural Language0
Towards Better Citation Intent Classification0
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text0
BERT is Robust! A Case Against Synonym-Based Adversarial Examples in Text Classification0
Quantum-inspired Representation for Long-tail Senses of Word Sense Disambiguation0
Data Augmentation for Intent Classification with Generic Large Language Models0
Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue0
Global Mixup: Eliminating Ambiguity with Clustering Relationships0
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations0
Compositional Data Augmentation for Abstractive Conversation Summarization0
Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation0
SegMix: A Simple Structure-Aware Data Augmentation Method0
Vec2Node: Self-training with Tensor Augmentation for Text Classification with Few Labels0
Do We Need to Differentiate Negative Candidates Before Training a Neural Ranker?0
Improving Robustness of Language Models from a Geometry-aware Perspective0
TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts0
MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar0
NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT210
Data Augmentation using Random Image Cropping for High-resolution Virtual Try-On (VITON-CROP)0
Analysis of Data Augmentation Methods for Low-Resource Maltese ASR0
T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical Imaging0
Reinforcement Learning of Self Enhancing Camera Image and Signal ProcessingCode0
Evaluating Contrastive Learning on Wearable Timeseries for Downstream Clinical Outcomes0
Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models0
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer SummarizationCode1
Character-level HyperNetworks for Hate Speech DetectionCode0
Improving Novelty Detection using the Reconstructions of Nearest NeighboursCode0
Towards Domain-Independent and Real-Time Gesture Recognition Using mmWave SignalCode1
Graph Transplant: Node Saliency-Guided Graph Mixup with Local Structure Preservation0
Show:102550
← PrevPage 103 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified