SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 251300 of 822 papers

TitleStatusHype
Synthetica: Large Scale Synthetic Data for Robot Perception0
zGAN: An Outlier-focused Generative Adversarial Network For Realistic Synthetic Data Generation0
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation0
Little Giants: Synthesizing High-Quality Embedding Data at ScaleCode0
Privacy-hardened and hallucination-resistant synthetic data generation with logic-solvers0
No more hard prompts: SoftSRV prompting for synthetic data generation0
LLM4GRN: Discovering Causal Gene Regulatory Networks with LLMs -- Evaluation through Synthetic Data Generation0
Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning0
Synthetic Data Generation for Residential Load Patterns via Recurrent GAN and Ensemble Method0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
CCUP: A Controllable Synthetic Data Generation Pipeline for Pretraining Cloth-Changing Person Re-Identification ModelsCode0
A Little Human Data Goes A Long WayCode0
Controlled Automatic Task-Specific Synthetic Data Generation for Hallucination Detection0
CONSULT: Contrastive Self-Supervised Learning for Few-shot Tumor Detection0
LLM-based Code-Switched Text Generation for Grammatical Error Correction0
TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMsCode0
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person ScenariosCode0
Driving Privacy Forward: Mitigating Information Leakage within Smart Vehicles through Synthetic Data GenerationCode0
SimpleStrat: Diversifying Language Model Generation with Stratification0
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles0
Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains0
Unsupervised Data Validation Methods for Efficient Model Training0
Fill In The Gaps: Model Calibration and Generalization with Synthetic Data0
Privacy Vulnerabilities in Marginals-based Synthetic Data0
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image ClassificationCode1
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval Augmented Generation0
Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images0
Training Language Models on Synthetic Edit Sequences Improves Code SynthesisCode1
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck PerspectiveCode0
Restoring Super-High Resolution GPS Mobility Data0
Targeted synthetic data generation for tabular data via hardness characterizationCode0
Improved Generation of Synthetic Imaging Data Using Feature-Aligned DiffusionCode0
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data GenerationCode0
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining0
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs0
Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees0
Preserving logical and functional dependencies in synthetic tabular dataCode0
Artificial Data Point Generation in Clustered Latent Space for Small Medical Datasets0
KIPPS: Knowledge infusion in Privacy Preserving Synthetic Data Generation0
Towards Synthetic Data Generation for Improved Pain Recognition in Videos under Patient ConstraintsCode0
MANTA -- Model Adapter Native generations that's Affordable0
CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code RepairCode0
Making Large Language Models into World Models with Precondition and Effect Knowledge0
Harnessing LLMs for API Interactions: A Framework for Classification and Synthetic Data Generation0
Qwen2.5-Coder Technical ReportCode11
Synthetic data augmentation for robotic mobility aids to support blind and low vision people0
Enhanced segmentation of femoral bone metastasis in CT scans of patients using synthetic data generation with 3D diffusion models0
SynSUM -- Synthetic Benchmark with Structured and Unstructured Medical RecordsCode0
Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data0
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources0
Show:102550
← PrevPage 6 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified