SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 13011350 of 8378 papers

TitleStatusHype
Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text GenerationCode1
Bootstrap Your Object Detector via Mixed TrainingCode1
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language ModelsCode1
AutoDC: Automated data-centric processingCode1
Improved Regularization of Convolutional Neural Networks with CutoutCode1
Improving Aspect Sentiment Quad Prediction via Template-Order Data AugmentationCode1
Contrastive Learning for Sequential RecommendationCode1
Improving Compositional Generalization with Latent Structure and Data AugmentationCode1
Improving Contrastive Learning of Sentence Embeddings from AI FeedbackCode1
Improving Contrastive Learning with Model AugmentationCode1
Unleashing the Power of Graph Data Augmentation on Covariate Distribution ShiftCode1
Circumventing Outliers of AutoAugment with Knowledge DistillationCode1
Improving Equivariance in State-of-the-Art Supervised Depth and Normal PredictorsCode1
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 ChallengeCode1
3rd Place Solution to "Google Landmark Retrieval 2020"Code1
Improving Generalizability of Graph Anomaly Detection Models via Data AugmentationCode1
Improving Generalization in Reinforcement Learning with Mixture RegularizationCode1
Improving Model Generalization by Agreement of Learned Representations from Data AugmentationCode1
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence ModelingCode1
Break-It-Fix-It: Unsupervised Learning for Program RepairCode1
CipherDAug: Ciphertext based Data Augmentation for Neural Machine TranslationCode1
No Reason for No Supervision: Improved Generalization in Supervised ModelsCode1
Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data AugmentationCode1
Incorporating External Knowledge through Pre-training for Natural Language to Code GenerationCode1
Incremental Generative Occlusion Adversarial Suppression Network for Person ReIDCode1
Indiscriminate Poisoning Attacks on Unsupervised Contrastive LearningCode1
Controllable 3D Face Generation with Conditional Style Code DiffusionCode1
Contrastive Learning with Large Memory Bank and Negative Embedding Subtraction for Accurate Copy DetectionCode1
Inside Out Visual Place RecognitionCode1
Instance Credibility Inference for Few-Shot LearningCode1
Integrating Large Circular Kernels into CNNs through Neural Architecture SearchCode1
Intent-aware Diffusion with Contrastive Learning for Sequential RecommendationCode1
Contrastive Learning of User Behavior Sequence for Context-Aware Document RankingCode1
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs CollaborationCode1
CL4CTR: A Contrastive Learning Framework for CTR PredictionCode1
Invariance Analysis of Saliency Models versus Human Gaze During Scene Free ViewingCode1
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-InstructCode1
Investigating Personalization Methods in Text to Music GenerationCode1
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer SummarizationCode1
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code GeneratorsCode1
BSUV-Net: A Fully-Convolutional Neural Network forBackground Subtraction of Unseen VideosCode1
Is Artificial Intelligence Generated Image Detection a Solved Problem?Code1
7T MRI Synthesization from 3T AcquisitionsCode1
It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story BooksCode1
It Takes Two to Tango: Mixup for Deep Metric LearningCode1
Joint Appearance and Motion Learning for Efficient Rolling Shutter CorrectionCode1
Jointly Learnable Data Augmentations for Self-Supervised GNNsCode1
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical ReasoningCode1
JUICER: Data-Efficient Imitation Learning for Robotic AssemblyCode1
AutoCLINT: The Winning Method in AutoCV Challenge 2019Code1
Show:102550
← PrevPage 27 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified