SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 901950 of 8378 papers

TitleStatusHype
Learning Better Contrastive View from Radiologist's GazeCode1
Learning Data Augmentation Strategies for Object DetectionCode1
Learning domain-agnostic visual representation for computational pathology using medically-irrelevant style transfer augmentationCode1
Learning Fair Node Representations with Graph Counterfactual FairnessCode1
Generative Latent Implicit Conditional Optimization when Learning from Small SampleCode1
Learning High-Quality and General-Purpose Phrase RepresentationsCode1
ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data FusionCode1
Learning Multimodal Data Augmentation in Feature SpaceCode1
Learning Performance-Improving Code EditsCode1
Learning Representational Invariances for Data-Efficient Action RecognitionCode1
Confident Sinkhorn Allocation for Pseudo-LabelingCode1
Learning the Unlearnable: Adversarial Augmentations Suppress Unlearnable Example AttacksCode1
Learning to Perturb Word Embeddings for Out-of-distribution QACode1
Attribute Descent: Simulating Object-Centric Datasets on the Content Level and BeyondCode1
Learning Two-Stream CNN for Multi-Modal Age-related Macular Degeneration CategorizationCode1
Learn to Augment: Joint Data Augmentation and Network Optimization for Text RecognitionCode1
A Two-Stage Approach to Device-Robust Acoustic Scene ClassificationCode1
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationCode1
Conformal Prediction with Missing ValuesCode1
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect DetectionCode1
AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breathCode1
A Light Recipe to Train Robust Vision TransformersCode1
Context Decoupling Augmentation for Weakly Supervised Semantic SegmentationCode1
Concatenated Masked Autoencoders as Spatial-Temporal LearnerCode1
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile DevicesCode1
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-TuningCode1
Local Additivity Based Data Augmentation for Semi-supervised NERCode1
Logic-Guided Data Augmentation and Regularization for Consistent Question AnsweringCode1
Long-tail Augmented Graph Contrastive Learning for RecommendationCode1
Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper CalibrationCode1
Low-Resource Neural Machine Translation: A Case Study of CantoneseCode1
AugCSE: Contrastive Sentence Embedding with Diverse AugmentationsCode1
Lung Segmentation from Chest X-rays using Variational Data ImputationCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without RetrainingCode1
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationCode1
AugLiChem: Data Augmentation Library of Chemical Structures for Machine LearningCode1
AlignMixup: Improving Representations By Interpolating Aligned FeaturesCode1
CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate SpeechCode1
Anatomical Data Augmentation via Fluid-based Image RegistrationCode1
BDANet: Multiscale Convolutional Neural Network with Cross-directional Attention for Building Damage Assessment from Satellite ImagesCode1
Behavior Injection: Preparing Language Models for Reinforcement LearningCode1
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image SegmentationCode1
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement LearningCode1
Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and RobustnessCode1
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic SegmentationCode1
Composing Good Shots by Exploiting Mutual RelationsCode1
Compositional Generalization for Multi-label Text Classification: A Data-Augmentation ApproachCode1
Show:102550
← PrevPage 19 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified