SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 201250 of 822 papers

TitleStatusHype
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari LanguagesCode0
Generative Zoo0
Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial VehiclesCode0
Data Augmentation with Variational Autoencoder for Imbalanced DatasetCode0
Improving text-conditioned latent diffusion for cancer pathologyCode0
CALICO: Conversational Agent Localization via Synthetic Data Generation0
A text-to-tabular approach to generate synthetic patient data using LLMsCode0
Give me Some Hard Questions: Synthetic Data Generation for Clinical QACode0
ALMA: Alignment with Minimal Annotation0
End to End Collaborative Synthetic Data Generation0
Domain-Agnostic Stroke Lesion Segmentation Using Physics-Constrained Synthetic DataCode0
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining0
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models0
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion ModelsCode1
Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces0
MALT: Improving Reasoning with Multi-Agent LLM Training0
Enhancing Amyloid PET Quantification: MRI-Guided Super-Resolution Using Latent Diffusion ModelsCode0
Needle: A Generative AI-Powered Multi-modal Database for Answering Complex Natural Language Queries0
Well log data generation and imputation using sequence-based generative adversarial networks0
LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes0
MAG-V: A Multi-Agent Framework for Synthetic Data Generation and Verification0
Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts0
Synthetic Data Generation with LLM for Improved Depression Prediction0
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR0
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks0
Beyond Data Scarcity: A Frequency-Driven Framework for Zero-Shot Forecasting0
Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in ThaiCode1
LLM for Barcodes: Generating Diverse Synthetic Data for Identity Documents0
Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations0
Watermarking Generative Categorical Data0
Generation of synthetic gait data: application to multiple sclerosis patients' gait patterns0
Hierarchical Conditional Tabular GAN for Multi-Tabular Synthetic Data Generation0
DRIFTS: Optimizing Domain Randomization with Synthetic Data and Weight Interpolation for Fetal Brain Tissue Segmentation0
Differential Privacy Under Class Imbalance: Methods and Empirical Insights0
Improved Multi-Task Brain Tumour Segmentation with Synthetic Data AugmentationCode2
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian LanguagesCode1
Debiasing Synthetic Data Generated by Deep Generative ModelsCode0
GUIDE-VAE: Advancing Data Generation with User Information and Pattern DictionariesCode0
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models0
Enhancing Table Representations with LLM-powered Synthetic Data Generation0
Retrieval-enriched zero-shot image classification in low-resource domains0
Scalable AI Framework for Defect Detection in Metal Additive Manufacturing0
Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy0
Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities0
Neural spell-checker: Beyond words with synthetic data generationCode0
SoccerGuard: Investigating Injury Risk Factors for Professional Soccer Players with Machine Learning0
Synthetic Data Generation with Large Language Models for Personalized Community Question AnsweringCode0
Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components0
Evaluating utility in synthetic banking microdata applications0
Large Language Model Benchmarks in Medical Tasks0
Show:102550
← PrevPage 5 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified