SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 13511400 of 8378 papers

TitleStatusHype
Invariance Principle Meets Vicinal Risk Minimization0
Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised LearningCode0
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 LanguagesCode2
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-InstructCode1
Fine-Grained and Interpretable Neural Speech EditingCode1
Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image GenerationCode1
On the power of data augmentation for head pose estimationCode1
SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text DetoxificationCode0
Enhanced Long-Tailed Recognition with Contrastive CutMix AugmentationCode0
Conditional Semi-Supervised Data Augmentation for Spam Message Detection with Low Resource Data0
Synthetic Data Aided Federated Learning Using Foundation Models0
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation0
LLMAEL: Large Language Models are Good Context Augmenters for Entity LinkingCode1
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis0
DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality AssessmentCode0
Query-oriented Data Augmentation for Session Search0
Generative Technology for Human Emotion Recognition: A Scope Review0
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect DetectionCode1
A Survey of Data Synthesis ApproachesCode0
Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions0
DACB-Net: Dual Attention Guided Compact Bilinear Convolution Neural Network for Skin Disease Classification0
Boosting Biomedical Concept Extraction by Rule-Based Data Augmentation0
Self-supervised Vision Transformer are Scalable Generative Models for Domain GeneralizationCode0
HRSAM: Efficient Interactive Segmentation in High-Resolution ImagesCode1
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse WeatherCode2
Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets0
Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?Code0
Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly DetectionCode0
Evaluating the Role of Data Enrichment Approaches Towards Rare Event Analysis in Manufacturing0
Robust and Reliable Early-Stage Website Fingerprinting Attacks via Spatial-Temporal Distribution AnalysisCode2
Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY)0
SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data Augmentation and Confidence Correction Mechanism0
Analyzing Persuasive Strategies in Meme Texts: A Fusion of Language Models with Paraphrase Enrichment0
Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label SmoothingCode0
Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions0
SAFE: a SAR Feature Extractor based on self-supervised learning and masked Siamese ViTsCode0
Heterogeneous Graph Contrastive Learning with Spectral Augmentation0
Iterative Data Generation with Large Language Models for Aspect-based Sentiment Analysis0
LLM-Generated Natural Language Meets Scaling Laws: New Explorations and Data Augmentation Methods0
A Survey on Deep Clustering: From the Prior Perspective0
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation0
Exact Bayesian Gaussian Cox Processes Using Random IntegralCode0
Multi-Epoch learning with Data Augmentation for Deep Click-Through Rate Prediction0
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation0
Zero-shot domain adaptation based on dual-level mix and contrast0
Sequential Disentanglement by Extracting Static Information From A Single Sequence Element0
Effects of Using Synthetic Data on Deep Recommender Models' Performance0
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges0
RouteLLM: Learning to Route LLMs with Preference DataCode7
Show:102550
← PrevPage 28 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified