| Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees | Sep 18, 2023 | GPUImputation | CodeCode Available | 4 |
| TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data | Jan 21, 2025 | FairnessImputation | CodeCode Available | 4 |
| Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation | Nov 24, 2022 | FairnessFraud Detection | CodeCode Available | 2 |
| Language Models are Realistic Tabular Data Generators | Oct 12, 2022 | Tabular Data Generation | CodeCode Available | 2 |
| TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation | Oct 27, 2024 | ImputationTabular Data Generation | CodeCode Available | 2 |
| FinDiff: Diffusion Models for Financial Tabular Data Generation | Sep 4, 2023 | Fraud DetectionSynthetic Data Generation | CodeCode Available | 1 |
| Modeling Tabular data using Conditional GAN | Jul 1, 2019 | Generative Adversarial NetworkTabular Data Generation | CodeCode Available | 1 |
| Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space | Oct 14, 2023 | Tabular Data Generation | CodeCode Available | 1 |
| A Comprehensive Survey of Synthetic Tabular Data Generation | Apr 23, 2025 | Privacy PreservingSurvey | CodeCode Available | 1 |
| Generative Table Pre-training Empowers Models for Tabular Prediction | May 16, 2023 | imbalanced classificationImputation | CodeCode Available | 1 |
| Unmasking Trees for Tabular Data | Jul 8, 2024 | Density EstimationImputation | CodeCode Available | 1 |
| Continuous Diffusion for Mixed-Type Tabular Data | Dec 16, 2023 | Tabular Data Generation | CodeCode Available | 1 |
| DATGAN: Integrating expert knowledge into deep learning for synthetic tabular data | Mar 7, 2022 | Tabular Data Generation | CodeCode Available | 1 |
| TabuLa: Harnessing Language Models for Tabular Data Synthesis | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TabPFGen -- Tabular Data Generation with TabPFN | Jun 7, 2024 | Data AugmentationImputation | CodeCode Available | 1 |
| Scaling Up Diffusion and Flow-based XGBoost Models | Aug 28, 2024 | Tabular Data Generation | CodeCode Available | 1 |
| Diffusion Transformers for Tabular Data Time Series Generation | Apr 10, 2025 | Tabular Data GenerationTime Series | CodeCode Available | 1 |
| dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation | May 31, 2025 | Synthetic Data GenerationTabular Data Generation | CodeCode Available | 1 |
| TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks | Sep 2, 2021 | Decision MakingFairness | CodeCode Available | 1 |
| EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language Models | Apr 15, 2024 | In-Context LearningSynthetic Data Generation | CodeCode Available | 1 |
| FedTabDiff: Federated Learning of Diffusion Probabilistic Models for Synthetic Mixed-Type Tabular Data Generation | Jan 11, 2024 | AttributeDenoising | CodeCode Available | 1 |
| PiShield: A PyTorch Package for Learning with Requirements | Feb 28, 2024 | Autonomous DrivingDeep Learning | —Unverified | 0 |
| A self-attention-based differentially private tabular GAN with high data utility | Dec 20, 2023 | Generative Adversarial NetworkImage Generation | —Unverified | 0 |
| Assessing Generative Models for Structured Data | Mar 26, 2025 | Synthetic Data GenerationTabular Data Generation | —Unverified | 0 |
| A Survey on Tabular Data Generation: Utility, Alignment, Fidelity, Privacy, and Beyond | Mar 7, 2025 | NavigatePrivacy Preserving | —Unverified | 0 |
| Causal-TGAN: Causally-Aware Synthetic Tabular Data Generative Adversarial Network | Sep 29, 2021 | Generative Adversarial NetworkImage Generation | —Unverified | 0 |
| Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases | Oct 16, 2022 | Deep LearningGenerative Adversarial Network | —Unverified | 0 |
| Composable Generative Models | Feb 18, 2021 | ImputationPrivacy Preserving | —Unverified | 0 |
| CTSyn: A Foundational Model for Cross Tabular Data Generation | Jun 7, 2024 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation | Dec 20, 2024 | DenoisingFederated Learning | —Unverified | 0 |
| Differentially Private Tabular Data Synthesis using Large Language Models | Jun 3, 2024 | FairnessTabular Data Generation | —Unverified | 0 |
| DP-TBART: A Transformer-based Autoregressive Model for Differentially Private Tabular Data Generation | Jul 19, 2023 | Deep LearningTabular Data Generation | —Unverified | 0 |
| Generating Realistic Tabular Data with Large Language Models | Oct 29, 2024 | Tabular Data Generation | —Unverified | 0 |
| Generative Forests | Aug 7, 2023 | Density EstimationImputation | —Unverified | 0 |
| GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction | Mar 19, 2025 | In-Context LearningTabular Data Generation | —Unverified | 0 |
| High-Quality Tabular Data Generation using Post-Selected VAE | Jul 17, 2024 | Tabular Data Generation | —Unverified | 0 |
| MargCTGAN: A "Marginally'' Better CTGAN for the Low Sample Regime | Jul 16, 2023 | Tabular Data Generation | —Unverified | 0 |
| On The Role of Prompt Construction In Enhancing Efficacy and Efficiency of LLM-Based Tabular Data Generation | Sep 6, 2024 | Tabular Data Generation | —Unverified | 0 |
| On the Usefulness of Synthetic Tabular Data Generation | Jun 27, 2023 | Data AugmentationData Summarization | —Unverified | 0 |
| HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection | Aug 6, 2024 | Privacy PreservingSynthetic Data Generation | —Unverified | 0 |
| ResBit: Residual Bit Vector for Categorical Values | Sep 29, 2023 | Image ClassificationTabular Data Generation | —Unverified | 0 |
| TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer | Jan 2, 2025 | BenchmarkingQuantization | —Unverified | 0 |
| TAEGAN: Generating Synthetic Tabular Data For Data Augmentation | Oct 2, 2024 | Data AugmentationGenerative Adversarial Network | —Unverified | 0 |
| The Prompt is Mightier than the Example | May 24, 2025 | In-Context LearningSynthetic Data Generation | —Unverified | 0 |
| Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations | Nov 17, 2024 | DecoderSynthetic Data Generation | —Unverified | 0 |
| Understanding and Mitigating Memorization in Diffusion Models for Tabular Data | Dec 15, 2024 | Data AugmentationMemorization | —Unverified | 0 |
| Under the Hood of Tabular Data Generation Models: Benchmarks with Extensive Tuning | Jun 18, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| CuTS: Customizable Tabular Synthetic Data Generation | Jul 7, 2023 | FairnessSynthetic Data Generation | CodeCode Available | 0 |
| Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark | Oct 25, 2023 | feature selectionModel Selection | CodeCode Available | 0 |
| A Note on Statistically Accurate Tabular Data Generation Using Large Language Models | May 5, 2025 | Tabular Data Generation | CodeCode Available | 0 |