SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 2650 of 822 papers

TitleStatusHype
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMsCode2
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time SeriesCode2
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series ForecastingCode2
Mellow: a small audio language model for reasoningCode2
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic DataCode2
Improved Multi-Task Brain Tumour Segmentation with Synthetic Data AugmentationCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated MotionCode2
REaLTabFormer: Generating Realistic Relational and Tabular Data using TransformersCode2
UAVD4L: A Large-Scale Dataset for UAV 6-DoF LocalizationCode2
Pedagogical Alignment of Large Language ModelsCode2
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANsCode1
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic DataCode1
EEG Synthetic Data Generation Using Probabilistic Diffusion ModelsCode1
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data GenerationCode1
Black-Box Attacks on Sequential Recommenders via Data-Free Model ExtractionCode1
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM GuardrailsCode1
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging DomainsCode1
Diffusion-based Conditional ECG Generation with Structured State Space ModelsCode1
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data GenerationCode1
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian LanguagesCode1
Differentially Private Synthetic Medical Data Generation using Convolutional GANsCode1
dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data GenerationCode1
Show:102550
← PrevPage 2 of 33Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified