SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 12511300 of 8378 papers

TitleStatusHype
BSUV-Net: A Fully-Convolutional Neural Network forBackground Subtraction of Unseen VideosCode1
LiteNeXt: A Novel Lightweight ConvMixer-based Model with Self-embedding Representation Parallel for Medical Image SegmentationCode1
Generative Data Augmentation for Aspect Sentiment Quad PredictionCode1
Data augmentation for learning predictive models on EEG: a systematic comparisonCode1
A Competitive Method for Dog Nose-print Re-identificationCode1
Data Augmentation for Meta-LearningCode1
Data Augmentation for Supervised Graph Outlier Detection via Latent Diffusion ModelsCode1
Local Additivity Based Data Augmentation for Semi-supervised NERCode1
Data Augmentation for Scene Text RecognitionCode1
Data Augmentation for Spoken Language Understanding via Pretrained Language ModelsCode1
Automated Chest CT Image Segmentation of COVID-19 Lung Infection based on 3D U-NetCode1
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLCode1
An Accurate Car Counting in Aerial Images Based on Convolutional Neural NetworksCode1
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement LearningCode1
Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G NetworksCode1
Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational AutoencoderCode1
Data Augmentation of Wearable Sensor Data for Parkinson's Disease Monitoring using Convolutional Neural NetworksCode1
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without RetrainingCode1
Data Augmentation on Graphs: A Technical SurveyCode1
MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye trackingCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive LearningCode1
Masked Autoencoders are Robust Data AugmentorsCode1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language ModelsCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge DistillationCode1
Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and RobustnessCode1
MaxStyle: Adversarial Style Composition for Robust Medical Image SegmentationCode1
CADTransformer: Panoptic Symbol Spotting Transformer for CAD DrawingsCode1
Data Augmentation with norm-VAE for Unsupervised Domain AdaptationCode1
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat ReportsCode1
Data augmentation with Mobius transformationsCode1
Meta-optimized Contrastive Learning for Sequential RecommendationCode1
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual RecognitionCode1
Metric Based Few-Shot Graph ClassificationCode1
Data Augmenting Contrastive Learning of Speech Representations in the Time DomainCode1
Genie: A Generator of Natural Language Semantic Parsers for Virtual Assistant CommandsCode1
Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNsCode1
Minority-Focused Text-to-Image Generation via Prompt OptimizationCode1
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question AnsweringCode1
Data-Efficient Instance Generation from Instance DiscriminationCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data AugmentationCode1
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial NetworksCode1
DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch InferenceCode1
Data Optimization in Deep Learning: A SurveyCode1
Bootstrapping Relation Extractors using Syntactic Search by ExamplesCode1
Data set creation and empirical analysis for detecting signs of depression from social media postingsCode1
Generation of microbial colonies dataset with deep learning style transferCode1
Show:102550
← PrevPage 26 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified