SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 51100 of 822 papers

TitleStatusHype
AnthroNet: Conditional Generation of Humans via AnthropometricsCode1
Learning Compact Metrics for MTCode1
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMsCode1
Learning from synthetic data generated with GRADECode1
RetailSynth: Synthetic Data Generation for Retail AI Systems EvaluationCode1
Partially Synthetic Data for Recommender Systems: Prediction Performance and Preference HidingCode1
An evaluation framework for synthetic data generation modelsCode1
POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical ActivitiesCode1
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic DataCode1
Privacy-Preserving Synthetic Data Generation for Recommendation SystemsCode1
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity RecognitionCode1
Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic DataCode1
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion PlannerCode1
Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentationCode1
Generating Multidimensional Clusters With Support LinesCode1
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based MethodCode1
FinDiff: Diffusion Models for Financial Tabular Data GenerationCode1
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information ExtractionCode1
Exploring Transformer Text Generation for Medical Dataset AugmentationCode1
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANsCode1
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM GuardrailsCode1
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapesCode1
GECTurk: Grammatical Error Correction and Detection Dataset for TurkishCode1
Improved Training of Wasserstein GANsCode1
Diffusion-based Conditional ECG Generation with Structured State Space ModelsCode1
Differentially Private Synthetic Medical Data Generation using Convolutional GANsCode1
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging DomainsCode1
DFNet: Enhance Absolute Pose Regression with Direct Feature MatchingCode1
A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation ModelsCode1
AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizingCode1
Delving into High-Quality Synthetic Face Occlusion Segmentation DatasetsCode1
EEG Synthetic Data Generation Using Probabilistic Diffusion ModelsCode1
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data GenerationCode1
EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language ModelsCode1
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency DetectionCode1
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop DataCode1
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
GenerateCT: Text-Conditional Generation of 3D Chest CT VolumesCode1
Generating Synthetic Handwritten Historical Documents With OCR Constrained GANsCode1
Generating tabular datasets under differential privacyCode1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing IndustryCode1
GeoPointGAN: Synthetic Spatial Data with Local Label Differential PrivacyCode1
Data-Free Knowledge Distillation via Feature Exchange and Activation Region ConstraintCode1
DeepNAG: Deep Non-Adversarial Gesture GenerationCode1
BLEUBERI: BLEU is a surprisingly effective reward for instruction followingCode1
DeltaPy: A Framework for Tabular Data Augmentation in PythonCode1
dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data GenerationCode1
Black-Box Attacks on Sequential Recommenders via Data-Free Model ExtractionCode1
Leveraging Generative AI Models for Synthetic Data Generation in Healthcare: Balancing Research and PrivacyCode1
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian LanguagesCode1
Show:102550
← PrevPage 2 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified