SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 551600 of 8378 papers

TitleStatusHype
Diversify Your Vision Datasets with Automatic Diffusion-Based AugmentationCode1
Training on Thin Air: Improve Image Classification with Generated DataCode1
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs CollaborationCode1
Tied-Augment: Controlling Representation Similarity Improves Data AugmentationCode1
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMsCode1
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical ReasoningCode1
Adaptive Graph Contrastive Learning for RecommendationCode1
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationCode1
Rethinking Data Augmentation for Tabular Data in Deep LearningCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
Learning Better Contrastive View from Radiologist's GazeCode1
DAC-MR: Data Augmentation Consistency Based Meta-Regularization for Meta-LearningCode1
Target-Side Augmentation for Document-Level Machine TranslationCode1
Graph Masked Autoencoder for Sequential RecommendationCode1
Semantic-aware Generation of Multi-view Portrait DrawingsCode1
Improving Contrastive Learning of Sentence Embeddings from AI FeedbackCode1
The Training Process of Many Deep Networks Explores the Same Low-Dimensional ManifoldCode1
Part Aware Contrastive Learning for Self-Supervised Action RecognitionCode1
Generating images of rare concepts using pre-trained diffusion modelsCode1
The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification TasksCode1
Learning to Predict Navigational Patterns from Partial ObservationsCode1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision TransformerCode1
Meta-optimized Contrastive Learning for Sequential RecommendationCode1
PARFormer: Transformer-based Multi-Task Network for Pedestrian Attribute RecognitionCode1
Fracture Detection in Pediatric Wrist Trauma X-ray Images Using YOLOv8 AlgorithmCode1
Isolated Sign Language Recognition based on Tree Structure Skeleton ImagesCode1
Hierarchical Disentanglement-Alignment Network for Robust SAR Vehicle RecognitionCode1
HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular DatasetsCode1
A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluationCode1
DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic SegmentationCode1
FakET: Simulating Cryo-Electron Tomograms with Neural Style TransferCode1
Cross-modulated Few-shot Image Generation for Colorectal Tissue ClassificationCode1
Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object DetectionCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
One-shot Unsupervised Domain Adaptation with Personalized Diffusion ModelsCode1
Mixed Autoencoder for Self-supervised Visual Representation LearningCode1
WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion ModelsCode1
Improving the Transferability of Adversarial Samples by Path-Augmented MethodCode1
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource LanguagesCode1
Learning the Unlearnable: Adversarial Augmentations Suppress Unlearnable Example AttacksCode1
EEGMatch: Learning with Incomplete Labels for Semi-Supervised EEG-based Cross-Subject Emotion RecognitionCode1
VisDA 2022 Challenge: Domain Adaptation for Industrial Waste SortingCode1
Towards Diverse and Coherent Augmentation for Time-Series ForecastingCode1
Debiased Contrastive Learning for Sequential RecommendationCode1
MSTFormer: Motion Inspired Spatial-temporal Transformer with Dynamic-aware Attention for long-term Vessel Trajectory PredictionCode1
A Survey on Causal Inference for RecommendationCode1
Motion Matters: Neural Motion Transfer for Better Camera Physiological MeasurementCode1
MixCycle: Mixup Assisted Semi-Supervised 3D Single Object Tracking with Cycle ConsistencyCode1
MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous DrivingCode1
Show:102550
← PrevPage 12 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified