SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 82018250 of 8378 papers

TitleStatusHype
Character-level HyperNetworks for Hate Speech DetectionCode0
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-TranslationCode0
Efficient Classification with Counterfactual Reasoning and Active LearningCode0
Parallel Grid Pooling for Data AugmentationCode0
A Systematic Evaluation of Recent Deep Learning Architectures for Fine-Grained Vehicle ClassificationCode0
Adversarial Pulmonary Pathology Translation for Pairwise Chest X-ray Data AugmentationCode0
Teaching with CommentariesCode0
Channel Augmented Joint Learning for Visible-Infrared RecognitionCode0
Efficient Augmentation for Imbalanced Deep LearningCode0
Weakly Supervised Deep Detection NetworksCode0
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter DatasetCode0
Paraphrase Augmented Task-Oriented Dialog GenerationCode0
Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of MinoritiesCode0
ToW: Thoughts of Words Improve Reasoning in Large Language ModelsCode0
Fine-tuning Partition-aware Item Similarities for Efficient and Scalable RecommendationCode0
Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentationCode0
Effective Rotation-invariant Point CNN with Spherical Harmonics kernelsCode0
Effective Image Tampering Localization with Multi-Scale ConvNeXt Feature FusionCode0
Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled DataCode0
A Bayesian Data Augmentation Approach for Learning Deep ModelsCode0
Certified Robustness to Adversarial Word SubstitutionsCode0
An Active Learning Approach for Reducing Annotation Cost in Skin Lesion AnalysisCode0
Universum-inspired Supervised Contrastive LearningCode0
AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in TransformerCode0
Cell Segmentation by Combining Marker-Controlled Watershed and Deep LearningCode0
UniVision: A Unified Framework for Vision-Centric 3D PerceptionCode0
CEB Improves Model RobustnessCode0
PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker VerificationCode0
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated TextCode0
Learning Representations by Maximizing Mutual Information Across ViewsCode0
PatchAugment: Local Neighborhood Augmentation in Point Cloud ClassificationCode0
EDDA: A Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance DetectionCode0
CAVA: A Visual Analytics System for Exploratory Columnar Data Augmentation Using Knowledge GraphsCode0
Causal Optimal Transport of AbstractionsCode0
SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor VariationsCode0
Technical Report: Combining knowledge from Transfer Learning during training and Wide ResnetsCode0
Tracking Passengers and Baggage Items using Multi-camera Systems at Security CheckpointsCode0
Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security CheckpointsCode0
Adversarial Momentum-Contrastive Pre-TrainingCode0
PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology ImagesCode0
Word Embedding Perturbation for Sentence ClassificationCode0
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous SpaceCode0
Robust deep learning for eye fundus images: Bridging real and synthetic data for enhancing generalizationCode0
Sentence-Level Resampling for Named Entity RecognitionCode0
SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic KnowledgeCode0
PCC: Paraphrasing with Bottom-k Sampling and Cyclic Learning for Curriculum Data AugmentationCode0
tempoGAN: A Temporally Coherent, Volumetric GAN for Super-resolution Fluid FlowCode0
Sentiment Analysis on Financial News Headlines using Training Dataset AugmentationCode0
PDE-based Group Equivariant Convolutional Neural NetworksCode0
EDADepth: Enhanced Data Augmentation for Monocular Depth EstimationCode0
Show:102550
← PrevPage 165 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified