SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 351400 of 822 papers

TitleStatusHype
GUIDE-VAE: Advancing Data Generation with User Information and Pattern DictionariesCode0
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models0
Enhancing Table Representations with LLM-powered Synthetic Data Generation0
Retrieval-enriched zero-shot image classification in low-resource domains0
Scalable AI Framework for Defect Detection in Metal Additive Manufacturing0
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities0
Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy0
Neural spell-checker: Beyond words with synthetic data generationCode0
Synthetic Data Generation with Large Language Models for Personalized Community Question AnsweringCode0
Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components0
Evaluating utility in synthetic banking microdata applications0
SoccerGuard: Investigating Injury Risk Factors for Professional Soccer Players with Machine Learning0
zGAN: An Outlier-focused Generative Adversarial Network For Realistic Synthetic Data Generation0
Synthetica: Large Scale Synthetic Data for Robot Perception0
Large Language Model Benchmarks in Medical Tasks0
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation0
Little Giants: Synthesizing High-Quality Embedding Data at ScaleCode0
Privacy-hardened and hallucination-resistant synthetic data generation with logic-solvers0
LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation0
Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning0
No more hard prompts: SoftSRV prompting for synthetic data generation0
Synthetic Data Generation for Residential Load Patterns via Recurrent GAN and Ensemble Method0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
CCUP: A Controllable Synthetic Data Generation Pipeline for Pretraining Cloth-Changing Person Re-Identification ModelsCode0
A Little Human Data Goes A Long WayCode0
Controlled Automatic Task-Specific Synthetic Data Generation for Hallucination Detection0
CONSULT: Contrastive Self-Supervised Learning for Few-shot Tumor Detection0
TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMsCode0
LLM-based Code-Switched Text Generation for Grammatical Error Correction0
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person ScenariosCode0
Driving Privacy Forward: Mitigating Information Leakage within Smart Vehicles through Synthetic Data GenerationCode0
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles0
SimpleStrat: Diversifying Language Model Generation with Stratification0
Unsupervised Data Validation Methods for Efficient Model Training0
Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains0
Fill In The Gaps: Model Calibration and Generalization with Synthetic Data0
Privacy Vulnerabilities in Marginals-based Synthetic Data0
Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images0
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval Augmented Generation0
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck PerspectiveCode0
Targeted synthetic data generation for tabular data via hardness characterizationCode0
Improved Generation of Synthetic Imaging Data Using Feature-Aligned DiffusionCode0
Restoring Super-High Resolution GPS Mobility Data0
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data GenerationCode0
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining0
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs0
Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees0
Artificial Data Point Generation in Clustered Latent Space for Small Medical Datasets0
Preserving logical and functional dependencies in synthetic tabular dataCode0
KIPPS: Knowledge infusion in Privacy Preserving Synthetic Data Generation0
Show:102550
← PrevPage 8 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified