SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 2650 of 822 papers

TitleStatusHype
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data0
Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models0
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data0
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data GenerationCode1
VietMix: A Naturally Occurring Vietnamese-English Code-Mixed Corpus with Iterative Augmentation for Machine Translation0
Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison0
CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis0
StressTest: Can YOUR Speech LM Handle the Stress?0
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency DetectionCode1
ConText-CIR: Learning from Concepts in Text for Composed Image RetrievalCode1
Phir Hera Fairy: An English Fairytaler is a Strong Faker of Fluent Speech in Low-Resource Indian Languages0
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data GenerationCode4
Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation0
Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations0
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data0
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking0
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed EnvironmentsCode0
PIGPVAE: Physics-Informed Gaussian Process Variational Autoencoders0
The Prompt is Mightier than the Example0
Large language model as user daily behavior data generator: balancing population diversity and individual personality0
Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review0
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel SimulationCode1
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation0
Show:102550
← PrevPage 2 of 33Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified