SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 76100 of 822 papers

TitleStatusHype
Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing0
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONSCode1
TarDiff: Target-Oriented Diffusion Guidance for Synthetic Electronic Health Record Time Series Generation0
Optimizing the Privacy-Utility Balance using Synthetic Data and Configurable Perturbation Pipelines0
A Comprehensive Survey of Synthetic Tabular Data GenerationCode1
ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving0
Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code0
A Statistical Approach for Synthetic EEG Data GenerationCode0
Learning from Reasoning Failures via Synthetic Data Generation0
Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation0
MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation0
Synthetic Data for Blood Vessel Network Extraction0
Evaluating the Diversity and Quality of LLM Generated Content0
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task0
Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data GenerationCode0
SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data0
Enhancing Metabolic Syndrome Prediction with Hybrid Data Balancing and CounterfactualsCode0
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection0
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access CatalogCode0
A Self-Supervised Framework for Space Object Behaviour Characterisation0
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use0
CORTEX-AVD: A Framework for CORner Case Testing and EXploration in Autonomous Vehicle Development0
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data0
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation0
Show:102550
← PrevPage 4 of 33Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified