SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 73517400 of 8378 papers

TitleStatusHype
Low-resource neural machine translation with morphological modelingCode0
Low Resource Text Classification with ULMFit and BacktranslationCode0
Research Trends and Applications of Data Augmentation AlgorithmsCode0
Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal HealthCode0
Turning Flowchart into Dialog: Augmenting Flowchart-grounded Troubleshooting Dialogs via Synthetic Data GenerationCode0
Multi-task Pre-training Language Model for Semantic Network CompletionCode0
Turning Waste into Wealth: Leveraging Low-Quality Samples for Enhancing Continuous Conditional Generative Adversarial NetworksCode0
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?Code0
TwinCL: A Twin Graph Contrastive Learning Model for Collaborative FilteringCode0
Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for ClassificationCode0
Time Series Data Augmentation as an Imbalanced Learning ProblemCode0
Beyond Deterministic Translation for Unsupervised Domain AdaptationCode0
LUMix: Improving Mixup by Better Modelling Label UncertaintyCode0
Lund jet images from generative and cycle-consistent adversarial networksCode0
A Comparison of Deep Learning Methods for Cell Detection in Digital CytologyCode0
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASRCode0
GuidedMixup: An Efficient Mixup Strategy Guided by Saliency MapsCode0
Data, Depth, and Design: Learning Reliable Models for Skin Lesion AnalysisCode0
Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest RadiographsCode0
Data-Centric Strategies for Overcoming PET/CT Heterogeneity: Insights from the AutoPET III Lesion Segmentation ChallengeCode0
SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image ClassificationCode0
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural NetworksCode0
Data Augmented 3D Semantic Scene Completion with 2D Segmentation PriorsCode0
Data Augmentation with Variational Autoencoder for Imbalanced DatasetCode0
Appearance and Pose-Conditioned Human Image Generation using Deformable GANsCode0
Aplicación de redes neuronales convolucionales profundas al diagnóstico asistido de la enfermedad de AlzheimerCode0
Aggression Identification Using Deep Learning and Data AugmentationCode0
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual GroundingCode0
Machine learning approaches for automatic defect detection in photovoltaic systemsCode0
Use the Detection Transformer as a Data AugmenterCode0
APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-TuningCode0
Data Augmentation with Atomic Templates for Spoken Language UnderstandingCode0
Machine learning for rapid discovery of laminar flow channel wall modifications that enhance heat transferCode0
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image ClassificationCode0
Machine Learning Models that Remember Too MuchCode0
QuestGen: Effectiveness of Question Generation Methods for Fact-Checking ApplicationsCode0
SSL-DG: Rethinking and Fusing Semi-supervised Learning and Domain Generalization in Medical Image SegmentationCode0
GSDFuse: Capturing Cognitive Inconsistencies from Multi-Dimensional Weak Signals in Social Media SteganalysisCode0
Data Augmentation View on Graph Convolutional Network and the Proposal of Monte Carlo Graph LearningCode0
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning ApproachCode0
Data Augmentation via Levy ProcessesCode0
Greedy AutoAugmentCode0
Data Augmentation via Dependency Tree Morphing for Low-Resource LanguagesCode0
GraphVICRegHSIC: Towards improved self-supervised representation learning for graphs with a hyrbid loss functionCode0
GraphMAD: Graph Mixup for Data Augmentation using Data-Driven Convex ClusteringCode0
Graph Contrastive Learning for Connectome ClassificationCode0
Data augmentation using synthetic data for time series classification with deep residual networksCode0
Data Augmentation using Random Image Cropping and Patching for Deep CNNsCode0
Exploring Inconsistent Knowledge Distillation for Object Detection with Data AugmentationCode0
Better Language Models of Code through Self-ImprovementCode0
Show:102550
← PrevPage 148 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified