SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 53015350 of 8378 papers

TitleStatusHype
Few-shot Mining of Naturally Occurring Inputs and Outputs0
Improving negation detection with negation-focused pre-training0
How Does Frequency Bias Affect the Robustness of Neural Image Classifiers against Common Corruption and Adversarial Perturbations?0
Alternative Data Augmentation for Industrial Monitoring using Adversarial Learning0
MixAugment & Mixup: Augmentation Methods for Facial Expression Recognition0
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System0
SAN-Net: Learning Generalization to Unseen Sites for Stroke Lesion Segmentation with Self-Adaptive NormalizationCode0
High-Resolution UAV Image Generation for Sorghum Panicle Detection0
A Data Cartography based MixUp for Pre-trained Language ModelsCode0
Text Detection on Technical Drawings for the Digitization of Brown-field Processes0
Building Brains: Subvolume Recombination for Data Augmentation in Large Vessel Occlusion Detection0
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects0
M2R2: Missing-Modality Robust emotion Recognition framework with iterative data augmentation0
GAN Inversion for Data Augmentation to Improve Colonoscopy Lesion Classification0
Analysing the Robustness of Dual Encoders for Dense Retrieval Against MisspellingsCode0
Embedding Hallucination for Few-Shot Language Fine-tuningCode0
Assessing Dataset Bias in Computer Vision0
SUBS: Subtree Substitution for Compositional Semantic ParsingCode0
Effect of Random Histogram Equalization on Breast Calcification Analysis Using Deep Learning0
Assessing unconstrained surgical cuttings in VR using CNNs0
Positive-Unlabeled Learning with Adversarial Data Augmentation for Knowledge Graph Completion0
FastGCL: Fast Self-Supervised Learning on Graphs via Contrastive Neighborhood Aggregation0
Improving Machine Translation Formality Control with Weakly-Labelled Data Augmentation and Post Editing Strategies0
BpHigh@TamilNLP-ACL2022: Effects of Data Augmentation on Indic-Transformer based classifier for Abusive Comments Detection in TamilCode0
Continuing Pre-trained Model with Multiple Training Strategies for Emotional Classification0
Retrieval Data Augmentation Informed by Downstream Question Answering Performance0
Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving0
Learning with Limited Text Data0
FilipN@LT-EDI-ACL2022-Detecting signs of Depression from Social Media: Examining the use of summarization methods as data augmentation for text classificationCode0
Improving Chinese Grammatical Error Detection via Data augmentation by Conditional Error Generation0
Decoding Part-of-Speech from Human EEG Signals0
A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness0
Disambiguation of morpho-syntactic features of African American English – the case of habitual be0
One Wug, Two Wug+s Transformer Inflection Models Hallucinate Affixes0
A Comparison of Strategies for Source-Free Domain AdaptationCode0
Resnet18 Model With Sequential Layer For Computing Accuracy On Image Classification Dataset0
Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices0
DMix: Adaptive Distance-aware Interpolative MixupCode0
Nozza@LT-EDI-ACL2022: Ensemble Modeling for Homophobia and Transphobia Detection0
Towards Better Characterization of ParaphrasesCode0
DD-TIG at Constraint@ACL2022: Multimodal Understanding and Reasoning for Role Labeling of Entities in Hateful Memes0
AugStatic - A Light-Weight Image Augmentation LibraryCode0
On the Impact of Data Augmentation on Downstream Performance in Natural Language Processing0
Horses to Zebras: Ontology-Guided Data Augmentation and Synthesis for ICD-9 Coding0
Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension0
The YiTrans Speech Translation System for IWSLT 2022 Offline Shared Task0
The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 20220
Augmented Balanced Image Dataset Generator Using AugStatic LibraryCode0
Data Augmentation for Rare Symptoms in Vaccine Side-Effect Detection0
Seq2Path: Generating Sentiment Tuples as Paths of a Tree0
Show:102550
← PrevPage 107 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified