SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 101150 of 822 papers

TitleStatusHype
SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop ScenesCode1
Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANsCode1
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian LanguagesCode1
AnthroNet: Conditional Generation of Humans via AnthropometricsCode1
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information ExtractionCode1
Black-Box Attacks on Sequential Recommenders via Data-Free Model ExtractionCode1
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and AccountabilityCode1
Exploring Transformer Text Generation for Medical Dataset AugmentationCode1
Partially Synthetic Data for Recommender Systems: Prediction Performance and Preference HidingCode1
Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention PredictionCode1
BLEUBERI: BLEU is a surprisingly effective reward for instruction followingCode1
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity RecognitionCode1
DFNet: Enhance Absolute Pose Regression with Direct Feature MatchingCode1
Overcoming Barriers to Data Sharing with Medical Image Generation: A Comprehensive EvaluationCode1
Privacy-preserving data sharing via probabilistic modellingCode1
GECTurk: Grammatical Error Correction and Detection Dataset for TurkishCode1
CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic DataCode1
TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual EnvironmentsCode1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing IndustryCode1
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic DataCode1
MarkushGrapher: Joint Visual and Textual Recognition of Markush StructuresCode1
Generating tabular datasets under differential privacyCode1
MTSS-GAN: Multivariate Time Series Simulation Generative Adversarial NetworksCode1
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion PlannerCode1
UnrealROX+: An Improved Tool for Acquiring Synthetic Data from Virtual 3D EnvironmentsCode1
Using matrix-product states for time-series machine learningCode1
D3A-TS: Denoising-Driven Data Augmentation in Time SeriesCode1
Copula-based synthetic data augmentation for machine-learning emulatorsCode1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component OptimizationCode1
LEyes: A Lightweight Framework for Deep Learning-Based Eye Tracking using Synthetic Eye ImagesCode1
Learning Compact Metrics for MTCode1
ConText-CIR: Learning from Concepts in Text for Composed Image RetrievalCode1
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and AppearanceCode1
Learning from synthetic data generated with GRADECode1
Characterization and Greedy Learning of Gaussian Structural Causal Models under Unknown InterventionsCode1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
CLIPPER: Compression enables long-context synthetic data generationCode1
CorGAN: Correlation-Capturing Convolutional Generative Adversarial Networks for Generating Synthetic Healthcare RecordsCode1
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data GenerationCode1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONSCode1
A Comprehensive Survey of Synthetic Tabular Data GenerationCode1
Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic DataCode1
Improved Training of Wasserstein GANsCode1
Leveraging Generative AI Models for Synthetic Data Generation in Healthcare: Balancing Research and PrivacyCode1
Noise-Aware Statistical Inference with Differentially Private Synthetic DataCode1
NViSII: A Scriptable Tool for Photorealistic Image GenerationCode1
Natural Language-Based Synthetic Data Generation for Cluster AnalysisCode1
Data-Free Knowledge Distillation via Feature Exchange and Activation Region ConstraintCode1
SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss FunctionsCode1
Will we run out of data? Limits of LLM scaling based on human-generated dataCode1
Show:102550
← PrevPage 3 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified