| Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees | Sep 18, 2023 | GPUImputation | CodeCode Available | 4 | 5 |
| TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data | Jan 21, 2025 | FairnessImputation | CodeCode Available | 4 | 5 |
| Language Models are Realistic Tabular Data Generators | Oct 12, 2022 | Tabular Data Generation | CodeCode Available | 2 | 5 |
| TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation | Oct 27, 2024 | ImputationTabular Data Generation | CodeCode Available | 2 | 5 |
| Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation | Nov 24, 2022 | FairnessFraud Detection | CodeCode Available | 2 | 5 |
| Generative Table Pre-training Empowers Models for Tabular Prediction | May 16, 2023 | imbalanced classificationImputation | CodeCode Available | 1 | 5 |
| TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks | Sep 2, 2021 | Decision MakingFairness | CodeCode Available | 1 | 5 |
| Continuous Diffusion for Mixed-Type Tabular Data | Dec 16, 2023 | Tabular Data Generation | CodeCode Available | 1 | 5 |
| dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation | May 31, 2025 | Synthetic Data GenerationTabular Data Generation | CodeCode Available | 1 | 5 |
| Diffusion Transformers for Tabular Data Time Series Generation | Apr 10, 2025 | Tabular Data GenerationTime Series | CodeCode Available | 1 | 5 |
| Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space | Oct 14, 2023 | Tabular Data Generation | CodeCode Available | 1 | 5 |
| Modeling Tabular data using Conditional GAN | Jul 1, 2019 | Generative Adversarial NetworkTabular Data Generation | CodeCode Available | 1 | 5 |
| EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language Models | Apr 15, 2024 | In-Context LearningSynthetic Data Generation | CodeCode Available | 1 | 5 |
| DATGAN: Integrating expert knowledge into deep learning for synthetic tabular data | Mar 7, 2022 | Tabular Data Generation | CodeCode Available | 1 | 5 |
| FedTabDiff: Federated Learning of Diffusion Probabilistic Models for Synthetic Mixed-Type Tabular Data Generation | Jan 11, 2024 | AttributeDenoising | CodeCode Available | 1 | 5 |
| FinDiff: Diffusion Models for Financial Tabular Data Generation | Sep 4, 2023 | Fraud DetectionSynthetic Data Generation | CodeCode Available | 1 | 5 |
| Unmasking Trees for Tabular Data | Jul 8, 2024 | Density EstimationImputation | CodeCode Available | 1 | 5 |
| TabuLa: Harnessing Language Models for Tabular Data Synthesis | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Scaling Up Diffusion and Flow-based XGBoost Models | Aug 28, 2024 | Tabular Data Generation | CodeCode Available | 1 | 5 |
| TabPFGen -- Tabular Data Generation with TabPFN | Jun 7, 2024 | Data AugmentationImputation | CodeCode Available | 1 | 5 |
| A Comprehensive Survey of Synthetic Tabular Data Generation | Apr 23, 2025 | Privacy PreservingSurvey | CodeCode Available | 1 | 5 |
| ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets | Jun 20, 2022 | BenchmarkingFraud Detection | CodeCode Available | 0 | 5 |
| MMM and MMMSynth: Clustering of heterogeneous tabular data, and synthetic data generation | Oct 30, 2023 | ClusteringSynthetic Data Generation | CodeCode Available | 0 | 5 |
| Are LLMs Naturally Good at Synthetic Tabular Data Generation? | Jun 20, 2024 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models | Apr 12, 2024 | FairnessTabular Data Generation | CodeCode Available | 0 | 5 |
| CuTS: Customizable Tabular Synthetic Data Generation | Jul 7, 2023 | FairnessSynthetic Data Generation | CodeCode Available | 0 | 5 |
| Preserving logical and functional dependencies in synthetic tabular data | Sep 26, 2024 | AttributeSynthetic Data Generation | CodeCode Available | 0 | 5 |
| Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark | Oct 25, 2023 | feature selectionModel Selection | CodeCode Available | 0 | 5 |
| Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints | Feb 25, 2025 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data Generation and Evaluation in Learning Analytics | Jan 12, 2024 | Privacy PreservingSynthetic Data Generation | CodeCode Available | 0 | 5 |
| Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class | Dec 20, 2024 | imbalanced classificationTabular Data Generation | CodeCode Available | 0 | 5 |
| Synthetic Tabular Data Generation for Class Imbalance and Fairness: A Comparative Study | Sep 8, 2024 | FairnessTabular Data Generation | CodeCode Available | 0 | 5 |
| TabGen-ICL: Residual-Aware In-Context Example Selection for Tabular Data Generation | Feb 23, 2025 | In-Context LearningTabular Data Generation | CodeCode Available | 0 | 5 |
| TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models | Apr 7, 2025 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| TabSynDex: A Universal Metric for Robust Evaluation of Synthetic Tabular Data | Jul 12, 2022 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| Tabular Data Generation using Binary Diffusion | Sep 20, 2024 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| Tabular data generation with tensor contraction layers and transformers | Dec 6, 2024 | Density EstimationTabular Data Generation | CodeCode Available | 0 | 5 |
| Tabular GANs for uneven distribution | Oct 1, 2020 | Image GenerationTabular Data Generation | CodeCode Available | 0 | 5 |
| DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators | Dec 3, 2024 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation | Jun 17, 2025 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| FLAIM: AIM-based Synthetic Data Generation in the Federated Setting | Oct 5, 2023 | Synthetic Data GenerationTabular Data Generation | CodeCode Available | 0 | 5 |
| Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios | Jul 3, 2024 | Generative Adversarial NetworkInductive Bias | CodeCode Available | 0 | 5 |
| Generating Tabular Data Using Heterogeneous Sequential Feature Forest Flow Matching | Oct 20, 2024 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation | Feb 20, 2025 | Generative Adversarial NetworkLanguage Modeling | CodeCode Available | 0 | 5 |
| Graph Conditional Flow Matching for Relational Data Generation | May 21, 2025 | Graph Neural NetworkTabular Data Generation | CodeCode Available | 0 | 5 |
| A Note on Statistically Accurate Tabular Data Generation Using Large Language Models | May 5, 2025 | Tabular Data Generation | CodeCode Available | 0 | 5 |
| LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation | Mar 4, 2025 | Large Language ModelTabular Data Generation | CodeCode Available | 0 | 5 |
| PiShield: A PyTorch Package for Learning with Requirements | Feb 28, 2024 | Autonomous DrivingDeep Learning | —Unverified | 0 | 0 |
| A self-attention-based differentially private tabular GAN with high data utility | Dec 20, 2023 | Generative Adversarial NetworkImage Generation | —Unverified | 0 | 0 |
| Assessing Generative Models for Structured Data | Mar 26, 2025 | Synthetic Data GenerationTabular Data Generation | —Unverified | 0 | 0 |