SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 151200 of 8378 papers

TitleStatusHype
Addressing Concept Shift in Online Time Series Forecasting: Detect-then-AdaptCode2
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data TrainingCode2
GAN-Supervised Dense Visual AlignmentCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
Identity Decoupling for Multi-Subject Personalization of Text-to-Image ModelsCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Intriguing Properties of Contrastive LossesCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical ReasoningCode2
LibFewShot: A Comprehensive Library for Few-shot LearningCode2
LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementCode2
Many-MobileNet: Multi-Model Augmentation for Robust Retinal Disease ClassificationCode2
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
MolNexTR: A Generalized Deep Learning Model for Molecular Image RecognitionCode2
Multi-Modal Self-Supervised Learning for RecommendationCode2
SECOND: Sparsely Embedded Convolutional DetectionCode2
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content DetectorsCode2
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature AnalysisCode1
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion ModelCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Exploring Discontinuity for Video Frame InterpolationCode1
Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image SegmentationCode1
Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuningCode1
Better plain ViT baselines for ImageNet-1kCode1
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust FinetuningCode1
Behavior Injection: Preparing Language Models for Reinforcement LearningCode1
3D Common Corruptions and Data AugmentationCode1
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification ContextCode1
Bag of Tricks for Long-Tailed Visual Recognition with Deep Convolutional Neural NetworksCode1
BAGAN: Data Augmentation with Balancing GANCode1
Bayesian Adversarial Human Motion SynthesisCode1
Background-Mixed Augmentation for Weakly Supervised Change DetectionCode1
A Winning Hand: Compressing Deep Networks Can Improve Out-Of-Distribution RobustnessCode1
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense EncodersCode1
Bayesian inference for logistic models using Polya-Gamma latent variablesCode1
AutoML Segmentation for 3D Medical Image Data: Contribution to the MSD Challenge 2018Code1
AutoMix: Unveiling the Power of Mixup for Stronger ClassifiersCode1
PIGNet2: A Versatile Deep Learning-based Protein-Ligand Interaction Prediction Model for Binding Affinity Scoring and Virtual ScreeningCode1
Automatic Data Augmentation for 3D Medical Image SegmentationCode1
Automatic Data Augmentation for Generalization in Reinforcement LearningCode1
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentationCode1
BDANet: Multiscale Convolutional Neural Network with Cross-directional Attention for Building Damage Assessment from Satellite ImagesCode1
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language ModelsCode1
AutoDC: Automated data-centric processingCode1
Automated Chest CT Image Segmentation of COVID-19 Lung Infection based on 3D U-NetCode1
A Cartesian Encoding Graph Neural Network for Crystal Structures Property Prediction: Application to Thermal Ellipsoid EstimationCode1
Show:102550
← PrevPage 4 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified