SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 101150 of 822 papers

TitleStatusHype
DeltaPy: A Framework for Tabular Data Augmentation in PythonCode1
SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop ScenesCode1
Privacy-preserving data sharing via probabilistic modellingCode1
AnthroNet: Conditional Generation of Humans via AnthropometricsCode1
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information ExtractionCode1
Black-Box Attacks on Sequential Recommenders via Data-Free Model ExtractionCode1
Partially Synthetic Data for Recommender Systems: Prediction Performance and Preference HidingCode1
Exploring Transformer Text Generation for Medical Dataset AugmentationCode1
Boosting Synthetic Data Generation with Effective Nonlinear Causal DiscoveryCode1
Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption ModelsCode1
Overcoming Barriers to Data Sharing with Medical Image Generation: A Comprehensive EvaluationCode1
TimeVAE: A Variational Auto-Encoder for Multivariate Time Series GenerationCode1
POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical ActivitiesCode1
Noise-Aware Statistical Inference with Differentially Private Synthetic DataCode1
BLEUBERI: BLEU is a surprisingly effective reward for instruction followingCode1
GECTurk: Grammatical Error Correction and Detection Dataset for TurkishCode1
NViSII: A Scriptable Tool for Photorealistic Image GenerationCode1
TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual EnvironmentsCode1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing IndustryCode1
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic DataCode1
DFNet: Enhance Absolute Pose Regression with Direct Feature MatchingCode1
GeoPointGAN: Synthetic Spatial Data with Local Label Differential PrivacyCode1
CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic DataCode1
Generating tabular datasets under differential privacyCode1
Using matrix-product states for time-series machine learningCode1
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based MethodCode1
LEyes: A Lightweight Framework for Deep Learning-Based Eye Tracking using Synthetic Eye ImagesCode1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONSCode1
D3A-TS: Denoising-Driven Data Augmentation in Time SeriesCode1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component OptimizationCode1
Leveraging Generative AI Models for Synthetic Data Generation in Healthcare: Balancing Research and PrivacyCode1
MTSS-GAN: Multivariate Time Series Simulation Generative Adversarial NetworksCode1
ConText-CIR: Learning from Concepts in Text for Composed Image RetrievalCode1
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and AppearanceCode1
Improved Training of Wasserstein GANsCode1
Copula-based synthetic data augmentation for machine-learning emulatorsCode1
CLIPPER: Compression enables long-context synthetic data generationCode1
CorGAN: Correlation-Capturing Convolutional Generative Adversarial Networks for Generating Synthetic Healthcare RecordsCode1
Learning Compact Metrics for MTCode1
MarkushGrapher: Joint Visual and Textual Recognition of Markush StructuresCode1
Characterization and Greedy Learning of Gaussian Structural Causal Models under Unknown InterventionsCode1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic DataCode1
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data GenerationCode1
A Comprehensive Survey of Synthetic Tabular Data GenerationCode1
Learning from synthetic data generated with GRADECode1
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
Data-Free Knowledge Distillation via Feature Exchange and Activation Region ConstraintCode1
SocialDial: A Benchmark for Socially-Aware Dialogue SystemsCode1
Will we run out of data? Limits of LLM scaling based on human-generated dataCode1
Show:102550
← PrevPage 3 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified