SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 351400 of 822 papers

TitleStatusHype
Nemotron-4 340B Technical ReportCode4
MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data0
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR0
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics0
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey0
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming0
SimGen: Simulator-conditioned Driving Scene Generation0
A Synthetic Dataset for Personal Attribute InferenceCode2
Curating Grounded Synthetic Data with Global Perspectives for Equitable AI0
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection0
DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection0
Enhancing human action recognition with GAN-based data augmentationCode0
CTSyn: A Foundational Model for Cross Tabular Data Generation0
Enhancing Indoor Temperature Forecasting through Synthetic Data in Low-Data Environments0
Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data ImbalanceCode0
Tiny models from tiny data: Textual and null-text inversion for few-shot distillationCode0
ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction0
Synthetic Data Outliers: Navigating Identity Disclosure0
Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy0
Synthetic Data Generation for 3D Myocardium Deformation AnalysisCode0
GenPalm: Contactless Palmprint Generation with Diffusion Models0
MegActor: Harness the Power of Raw Video for Vivid Portrait AnimationCode4
Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis0
Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router0
Differentially Private Synthetic Data Generation for Relational DatabasesCode0
Interpretable classification of wiki-review streams0
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark0
Conditioning on Time is All You Need for Synthetic Survival Data GenerationCode0
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models0
KiNETGAN: Enabling Distributed Network Intrusion Detection through Knowledge-Infused Synthetic Data Generation0
Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure0
Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking0
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicCode2
Advancing fNIRS Neuroimaging through Synthetic Data Generation and Machine Learning Applications0
SynthesizRR: Generating Diverse Datasets with Retrieval AugmentationCode1
Prompting-based Synthetic Data Generation for Few-Shot Question AnsweringCode0
Permissioned Blockchain-based Framework for Ranking Synthetic Data Generators0
Inference With Combining Rules From Multiple Differentially Private Synthetic Datasets0
Clustering of Disease Trajectories with Explainable Machine Learning: A Case Study on Postoperative Delirium Phenotypes0
Comparative study of models trained on synthetic data for Ukrainian grammatical error correctionCode0
Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation0
Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure0
Online Data Augmentation for Forecasting with Deep LearningCode0
Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection0
Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language ModelsCode1
UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues0
Better Synthetic Data by Retrieving and Transforming Existing DatasetsCode7
Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming Generative Adversarial Networks0
A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language ModelsCode0
Aligning Actions and Walking to LLM-Generated Textual DescriptionsCode0
Show:102550
← PrevPage 8 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified