SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 29513000 of 8378 papers

TitleStatusHype
SASMU: boost the performance of generalized recognition model using synthetic face dataset0
Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting0
EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost0
Simple Data Augmentation Techniques for Chinese Disease NormalizationCode0
Improving the Robustness of Summarization Systems with Dual AugmentationCode1
Addressing Discrepancies in Semantic and Visual Alignment in Neural Networks0
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint ModelsCode1
Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression0
AfriNames: Most ASR models "butcher" African Names0
CAISA at SemEval-2023 Task 8: Counterfactual Data Augmentation for Mitigating Class Imbalance in Causal Claim Identification0
A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image ModelingCode0
Geo-Tiles for Semantic Segmentation of Earth Observation ImageryCode1
Provable Benefit of Mixup for Finding Optimal Decision Boundaries0
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERCode1
Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations0
On the Limitations of Temperature Scaling for Distributions with OverlapsCode0
Fish-TViT: A novel fish species classification method in multi water areas based on transfer learning and vision transformer0
Building Manufacturing Deep Learning Models with Minimal and Imbalanced Training Data Using Domain Adaptation and Data Augmentation0
Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning0
Source Code Data Augmentation for Deep Learning: A SurveyCode1
Multi-Epoch Learning for Deep Click-Through Rate Prediction Models0
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges0
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
Breast Cancer Detection and Diagnosis: A comparative study of state-of-the-arts deep learning architectures0
MSMix:An Interpolation-Based Text Data Augmentation Method Manifold Swap Mixup0
Cross Encoding as Augmentation: Towards Effective Educational Text Classification0
A Shapelet-based Framework for Unsupervised Multivariate Time Series Representation LearningCode1
Data Augmentation Methods of Dynamic Model Identification for Harbor Maneuvers using Feedforward Neural Network0
A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation0
ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden StatesCode0
Simulation-Aided Deep Learning for Laser Ultrasonic Visualization Testing0
Joint Optimization of Class-Specific Training- and Test-Time Data Augmentation in SegmentationCode0
Improving Generalization for Multimodal Fake News DetectionCode0
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-TuningCode1
Conditional Diffusion Models for Semantic 3D Brain MRI SynthesisCode2
Extrinsic Factors Affecting the Accuracy of Biomedical NER0
Data Augmentation for Low-Resource Keyphrase GenerationCode0
On Counterfactual Data Augmentation Under Confounding0
Improved Probabilistic Image-Text RepresentationsCode1
Augmenting Character Designers Creativity Using Generative Adversarial Networks0
Targeted Data Generation: Finding and Fixing Model Weaknesses0
Spot keywords from very noisy and mixed speech0
Disambiguated Lexically Constrained Neural Machine Translation0
GIMM: InfoMin-Max for Automated Graph Contrastive Learning0
SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL (extended)0
Double Descent and Overfitting under Noisy Inputs and Distribution Shift for Linear Denoisers0
CREST: A Joint Framework for Rationalization and Counterfactual Text GenerationCode0
Leveraging Domain Knowledge for Inclusive and Bias-aware Humanitarian Response Entry ClassificationCode0
TADA: Task-Agnostic Dialect Adapters for EnglishCode0
GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks0
Show:102550
← PrevPage 60 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified