SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 351400 of 8378 papers

TitleStatusHype
7T MRI Synthesization from 3T AcquisitionsCode1
Dataset Condensation for Time Series Classification via Dual Domain MatchingCode1
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization PerspectiveCode1
Repeated Padding for Sequential RecommendationCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via PromptsCode1
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code GeneratorsCode1
FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal DecouplingCode1
Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image ClassificationCode1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language ModelsCode1
WHU-Synthetic: A Synthetic Perception Dataset for 3-D Multitask Model ResearchCode1
3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labellingCode1
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationCode1
Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation StrategiesCode1
Parametric Augmentation for Time Series Contrastive LearningCode1
ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMsCode1
CultureLLM: Incorporating Cultural Differences into Large Language ModelsCode1
SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge GraphCode1
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning DatasetCode1
CodeIt: Self-Improving Language Models with Prioritized Hindsight ReplayCode1
Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced SegmentationCode1
TimeSiam: A Pre-Training Framework for Siamese Time-Series ModelingCode1
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapesCode1
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of ViewCode1
Learning High-Quality and General-Purpose Phrase RepresentationsCode1
Trapped in texture bias? A large scale comparison of deep instance segmentationCode1
SymTC: A Symbiotic Transformer-CNN Net for Instance Segmentation of Lumbar Spine MRICode1
Phase-shifted remote photoplethysmography for estimating heart rate and blood pressure from facial videoCode1
Detection and Classification of Diabetic Retinopathy using Deep Learning Algorithms for Segmentation to Facilitate Referral Recommendation for Test and Treatment PredictionCode1
Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image SegmentationCode1
Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End PipelineCode1
SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent DetectionCode1
Data Augmentation for Supervised Graph Outlier Detection via Latent Diffusion ModelsCode1
DiffKG: Knowledge Graph Diffusion Model for RecommendationCode1
Generalizable Visual Reinforcement Learning with Segment Anything ModelCode1
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement LearningCode1
MonoLSS: Learnable Sample Selection For Monocular 3D DetectionCode1
Controllable 3D Face Generation with Conditional Style Code DiffusionCode1
Video Recognition in Portrait ModeCode1
AdvST: Revisiting Data Augmentations for Single Domain GeneralizationCode1
Calibrating Wireless Ray Tracing for Digital Twinning using Local Phase Error EstimatesCode1
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimesCode1
TAO-Amodal: A Benchmark for Tracking Any Object AmodallyCode1
Object-Aware Domain Generalization for Object DetectionCode1
Time-Transformer: Integrating Local and Global Features for Better Time Series GenerationCode1
ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentationCode1
Compositional Generalization for Multi-label Text Classification: A Data-Augmentation ApproachCode1
A Study on Transferability of Deep Learning Models for Network Intrusion DetectionCode1
SeiT++: Masked Token Modeling Improves Storage-efficient TrainingCode1
Show:102550
← PrevPage 8 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified