SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 29012950 of 8378 papers

TitleStatusHype
Graph Structure and Feature Extrapolation for Out-of-Distribution Generalization0
Revisiting and Advancing Adversarial Training Through A Simple Baseline0
Generated Graph DetectionCode0
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment0
Rethink the Effectiveness of Text Data Augmentation: An Empirical Analysis0
Time-aware Graph Structure Learning via Sequence Prediction on Temporal GraphsCode1
Robustness and Generalization Performance of Deep Learning Models on Cyber-Physical Systems: A Comparative StudyCode0
Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili0
Gender-Inclusive Grammatical Error Correction through AugmentationCode0
Rotational augmentation techniques: a new perspective on ensemble learning for image classification0
Underwater Acoustic Target Recognition based on Smoothness-inducing Regularization and Spectrogram-based Data Augmentation0
AROID: Improving Adversarial Robustness Through Online Instance-Wise Data AugmentationCode0
Graph Mixup with Soft Alignments0
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach0
Medical Data Augmentation via ChatGPT: A Case Study on Medication Identification and Medication Event Classification0
Attention-stacked Generative Adversarial Network (AS-GAN)-empowered Sensor Data Augmentation for Online Monitoring of Manufacturing System0
Emotion Detection from EEG using Transfer Learning0
Improving LiDAR 3D Object Detection via Range-based Point Cloud Density Optimization0
LOST: A Mental Health Dataset of Low Self-esteem in Reddit Posts0
Enhancing Robustness of AI Offensive Code Generators via Data AugmentationCode0
KIT's Multilingual Speech Translation System for IWSLT 2023Code0
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLCode1
Data Augmentation for Improving Tail-traffic Robustness in Skill-routing for Dialogue Systems0
Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper CalibrationCode1
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data AugmentationCode0
Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairsCode0
T-ADAF: Adaptive Data Augmentation Framework for Image Classification Network based on Tensor T-product Operator0
Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental HealthCode0
Rec4Ad: A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao0
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline DataCode1
Towards Adaptable and Interactive Image Captioning with Data Augmentation and Episodic Memory0
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!Code1
An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language ModelsCode0
Synthesizing Affective Neurophysiological Signals Using Generative Models: A Review Paper0
Learning to Substitute Spans towards Improving Compositional GeneralizationCode0
PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients' Problems and Data Augmentation with Black-box Large Language ModelsCode0
Conformal Prediction with Missing ValuesCode1
Improving Conversational Recommendation Systems via Counterfactual Data SimulationCode1
R-Mixup: Riemannian Mixup for Biological Networks0
Graph Transformer for RecommendationCode1
Large Language Model Augmented Narrative Driven RecommendationsCode0
An Improved Model for Diabetic Retinopathy Detection by using Transfer Learning and Ensemble Learning0
Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet0
Generative Adversarial Networks for Data Augmentation0
Conditional Generation from Unconditional Diffusion Models using Denoiser RepresentationsCode0
DiffECG: A Versatile Probabilistic Diffusion Model for ECG Signals Synthesis0
EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost0
Affinity Clustering Framework for Data Debiasing Using Pairwise Distribution DiscrepancyCode0
ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity?0
Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting0
Show:102550
← PrevPage 59 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified