SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 19512000 of 8378 papers

TitleStatusHype
Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical StudyCode0
GenCode: A Generic Data Augmentation Framework for Boosting Deep Learning-Based Code Understanding0
Morphological Symmetries in RoboticsCode2
Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models0
ChildAugment: Data Augmentation Methods for Zero-Resource Children's Speaker VerificationCode0
On Evaluation Protocols for Data Augmentation in a Limited Data Scenario0
Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening0
CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation0
LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition0
Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task0
Dynamic Evaluation of Large Language Models by Meta Probing AgentsCode7
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks0
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems0
A Survey on Knowledge Distillation of Large Language ModelsCode5
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation0
FormulaReasoning: A Dataset for Formula-Based Numerical ReasoningCode0
Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network0
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
Regularization by denoising: Bayesian model and Langevin-within-split Gibbs sampling0
Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training0
Rock Classification Based on Residual Networks0
Revisiting Data Augmentation in Deep Reinforcement LearningCode0
ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMsCode0
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?Code0
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated TextCode0
Neighborhood-Enhanced Supervised Contrastive Learning for Collaborative FilteringCode2
Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency TrainingCode0
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance0
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationCode1
Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation StrategiesCode1
A Practical Method for Generating String CounterfactualsCode0
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models0
Parametric Augmentation for Time Series Contrastive LearningCode1
Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognition0
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-TuningCode3
Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm0
Affine transformation estimation improves visual self-supervised learning0
WERank: Towards Rank Degradation Prevention for Self-Supervised Learning Using Weight Regularization0
Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration0
Evaluation of Activated Sludge Settling Characteristics from Microscopy Images with Deep Convolutional Neural Networks and Transfer LearningCode0
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation0
Advancing Data-driven Weather Forecasting: Time-Sliding Data Augmentation of ERA50
Inference Stage Denoising for Undersampled MRI ReconstructionCode0
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models0
One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive LearningCode2
Enhancing Multi-field B2B Cloud Solution Matching via Contrastive Pre-training0
Generalizing Conversational Dense Retrieval via LLM-Cognition Data AugmentationCode0
Understanding Test-Time Augmentation0
For Better or For Worse? Learning Minimum Variance Features With Label Augmentation0
CultureLLM: Incorporating Cultural Differences into Large Language ModelsCode1
Show:102550
← PrevPage 40 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified