SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 39013950 of 8378 papers

TitleStatusHype
In What Ways Are Deep Neural Networks Invariant and How Should We Measure This?0
Evaluating the Performance of StyleGAN2-ADA on Medical Images0
UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentationCode0
Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via Simulation-based Synthetic Data Augmentation and Multitask Learning0
Automated segmentation and morphological characterization of placental histology images based on a single labeled imageCode1
On the Effectiveness of Hybrid Pooling in Mixup-Based Graph Learning for Language ProcessingCode0
A ResNet is All You Need? Modeling A Strong Baseline for Detecting Referable Diabetic Retinopathy in Fundus ImagesCode0
BootAug: Boosting Text Augmentation via Hybrid Instance Filtering FrameworkCode1
MIXCODE: Enhancing Code Classification by Mixup-Based Data AugmentationCode1
Data-driven Approaches to Surrogate Machine Learning Model Development0
Data Augmentation-free Unsupervised Learning for 3D Point Cloud UnderstandingCode1
GT-GAN: General Purpose Time Series Synthesis with Generative Adversarial Networks0
TC-SKNet with GridMask for Low-complexity Classification of Acoustic scene0
The Vendi Score: A Diversity Evaluation Metric for Machine LearningCode1
The Calibration Generalization GapCode1
Transformer-based conditional generative adversarial network for multivariate time series generationCode1
Adaptively Weighted Data Augmentation Consistency Regularization for Robust Optimization under Concept Shift0
Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation0
Code-Switching without Switching: Language Agnostic End-to-End Speech Translation0
PlaneDepth: Self-supervised Depth Estimation via Orthogonal PlanesCode1
rPPG-Toolbox: Deep Remote PPG ToolboxCode2
MultiLoad-GAN: A GAN-Based Synthetic Load Group Generation Method Considering Spatial-Temporal Correlations0
Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems0
Random Data Augmentation based Enhancement: A Generalized Enhancement Approach for Medical DatasetsCode0
Smooth image-to-image translations with latent space interpolationsCode0
Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA ImagesCode1
Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object DetectionCode0
Probing the Robustness of Pre-trained Language Models for Entity MatchingCode0
Augmented Bio-SBERT: Improving Performance for Pairwise Sentence Tasks in Bio-medical Domain0
KUL@SMM4H’22: Template Augmented Adaptive Pre-training for Tweet Classification0
Low-Resource Neural Machine Translation: A Case Study of CantoneseCode1
The Only Chance to Understand: Machine Translation of the Severely Endangered Low-resource Languages of Eurasia0
CAISA@SMM4H’22: Robust Cross-Lingual Detection of Disease Mentions on Social Media with Adversarial Methods0
Lightweight Contextual Logical Structure Recovery0
BioInfo@UAVR@SMM4H’22: Classification and Extraction of Adverse Event mentions in Tweets using Transformer Models0
Data Augmentation for Improving the Prediction of Validity and Novelty of Argumentative Conclusions0
Data Augmentation for Few-Shot Knowledge Graph Completion from Hierarchical Perspective0
Coordination Generation via Synchronized Text-Infilling0
ParaZh-22M: A Large-Scale Chinese Parabank via Machine Translation0
Rethinking Data Augmentation in Text-to-text Paradigm0
Unsupervised Data Augmentation for Aspect Based Sentiment Analysis0
Evaluating and Mitigating Inherent Linguistic Bias of African American English through Inference0
Enhancing Task-Specific Distillation in Small Data Regimes through Language Generation0
Table-based Fact Verification with Self-labeled Keypoint Alignment0
Towards Summarizing Healthcare Questions in Low-Resource Setting0
Effective Data Augmentation for Sentence Classification Using One VAE per Class0
Dynamic Nonlinear Mixup with Distance-based Sample Selection0
Document-level Event Factuality Identification via Machine Reading Comprehension Frameworks with Transfer Learning0
BRCC and SentiBahasaRojak: The First Bahasa Rojak Corpus for Pretraining and Sentiment Analysis Dataset0
Summarizing Patients’ Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models0
Show:102550
← PrevPage 79 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified