SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 2650 of 822 papers

TitleStatusHype
A Synthetic Dataset for Personal Attribute InferenceCode2
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicCode2
Pedagogical Alignment of Large Language ModelsCode2
UAVD4L: A Large-Scale Dataset for UAV 6-DoF LocalizationCode2
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series ForecastingCode2
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic DataCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated MotionCode2
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time SeriesCode2
Towards Realistic Generative 3D Face ModelsCode2
REaLTabFormer: Generating Realistic Relational and Tabular Data using TransformersCode2
DigiFace-1M: 1 Million Digital Face Images for Face RecognitionCode2
Synthetic QA Corpora Generation with Roundtrip ConsistencyCode2
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data GenerationCode1
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency DetectionCode1
ConText-CIR: Learning from Concepts in Text for Composed Image RetrievalCode1
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel SimulationCode1
BLEUBERI: BLEU is a surprisingly effective reward for instruction followingCode1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component OptimizationCode1
Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real TransferCode1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONSCode1
A Comprehensive Survey of Synthetic Tabular Data GenerationCode1
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity RecognitionCode1
Unraveling the Effects of Synthetic Data on End-to-End Autonomous DrivingCode1
Show:102550
← PrevPage 2 of 33Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified