| AnthroNet: Conditional Generation of Humans via Anthropometrics | Sep 7, 2023 | 3D human pose and shape estimation3D Human Reconstruction | CodeCode Available | 1 |
| Learning Compact Metrics for MT | Oct 12, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Feb 23, 2024 | Benchmarkingslot-filling | CodeCode Available | 1 |
| Learning from synthetic data generated with GRADE | May 7, 2023 | Pose EstimationSynthetic Data Generation | CodeCode Available | 1 |
| RetailSynth: Synthetic Data Generation for Retail AI Systems Evaluation | Dec 21, 2023 | BenchmarkingProduct Recommendation | CodeCode Available | 1 |
| Partially Synthetic Data for Recommender Systems: Prediction Performance and Preference Hiding | Aug 9, 2020 | Recommendation SystemsSynthetic Data Generation | CodeCode Available | 1 |
| An evaluation framework for synthetic data generation models | Apr 13, 2024 | Data AugmentationSynthetic Data Generation | CodeCode Available | 1 |
| POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities | Jul 19, 2023 | 3D Hand Pose Estimationhand-object pose | CodeCode Available | 1 |
| AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data | Mar 7, 2025 | DiversityFairness | CodeCode Available | 1 |
| Privacy-Preserving Synthetic Data Generation for Recommendation Systems | Sep 27, 2022 | Privacy PreservingRecommendation Systems | CodeCode Available | 1 |
| GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition | Apr 1, 2025 | Computational Efficiencynamed-entity-recognition | CodeCode Available | 1 |
| Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data | Aug 26, 2020 | DecoderMusic Genre Transfer | CodeCode Available | 1 |
| Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Dec 24, 2024 | Autonomous DrivingDataset Generation | CodeCode Available | 1 |
| Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentation | Nov 25, 2021 | Data AugmentationRhythm | CodeCode Available | 1 |
| Generating Multidimensional Clusters With Support Lines | Jan 24, 2023 | ClusteringSynthetic Data Generation | CodeCode Available | 1 |
| Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based Method | Aug 19, 2021 | BenchmarkingSynthetic Data Generation | CodeCode Available | 1 |
| FinDiff: Diffusion Models for Financial Tabular Data Generation | Sep 4, 2023 | Fraud DetectionSynthetic Data Generation | CodeCode Available | 1 |
| Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction | Mar 7, 2023 | Synthetic Data Generation | CodeCode Available | 1 |
| Exploring Transformer Text Generation for Medical Dataset Augmentation | May 1, 2020 | Synthetic Data GenerationText Generation | CodeCode Available | 1 |
| EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANs | Dec 26, 2020 | ClassificationData Augmentation | CodeCode Available | 1 |
| DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails | Feb 7, 2025 | Reinforcement Learning (RL)Synthetic Data Generation | CodeCode Available | 1 |
| Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes | Jan 29, 2024 | Data AugmentationSound Event Localization and Detection | CodeCode Available | 1 |
| GECTurk: Grammatical Error Correction and Detection Dataset for Turkish | Sep 20, 2023 | ArticlesDecoder | CodeCode Available | 1 |
| Improved Training of Wasserstein GANs | Mar 31, 2017 | Conditional Image GenerationImage Generation | CodeCode Available | 1 |
| Diffusion-based Conditional ECG Generation with Structured State Space Models | Jan 19, 2023 | State Space ModelsSynthetic Data Generation | CodeCode Available | 1 |
| Differentially Private Synthetic Medical Data Generation using Convolutional GANs | Dec 22, 2020 | Deep Learningimage-classification | CodeCode Available | 1 |
| Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains | Mar 16, 2023 | Human Mesh RecoverySynthetic Data Generation | CodeCode Available | 1 |
| DFNet: Enhance Absolute Pose Regression with Direct Feature Matching | Apr 1, 2022 | Camera Pose EstimationCamera Relocalization | CodeCode Available | 1 |
| A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation Models | Aug 2, 2022 | BenchmarkingSynthetic Data Generation | CodeCode Available | 1 |
| AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets | May 12, 2022 | SegmentationSynthetic Data Generation | CodeCode Available | 1 |
| EEG Synthetic Data Generation Using Probabilistic Diffusion Models | Mar 6, 2023 | Brain Computer InterfaceData Augmentation | CodeCode Available | 1 |
| DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data Generation | Feb 26, 2020 | Privacy PreservingSensitivity | CodeCode Available | 1 |
| EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language Models | Apr 15, 2024 | In-Context LearningSynthetic Data Generation | CodeCode Available | 1 |
| Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection | May 28, 2025 | DiversitySynthetic Data Generation | CodeCode Available | 1 |
| FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data | Jan 28, 2025 | Natural Language InferenceSynthetic Data Generation | CodeCode Available | 1 |
| Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability | Jun 2, 2025 | DescriptiveSynthetic Data Generation | CodeCode Available | 1 |
| GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes | May 25, 2023 | Computed Tomography (CT)Image Generation | CodeCode Available | 1 |
| Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs | Mar 15, 2021 | Optical Character Recognition (OCR)Synthetic Data Generation | CodeCode Available | 1 |
| Generating tabular datasets under differential privacy | Aug 28, 2023 | Synthetic Data Generation | CodeCode Available | 1 |
| CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing Industry | Nov 25, 2022 | GPUobject-detection | CodeCode Available | 1 |
| GeoPointGAN: Synthetic Spatial Data with Local Label Differential Privacy | May 18, 2022 | ManagementPrivacy Preserving | CodeCode Available | 1 |
| Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint | Jan 1, 2023 | Data AugmentationData-free Knowledge Distillation | CodeCode Available | 1 |
| DeepNAG: Deep Non-Adversarial Gesture Generation | Nov 18, 2020 | Data AugmentationDynamic Time Warping | CodeCode Available | 1 |
| BLEUBERI: BLEU is a surprisingly effective reward for instruction following | May 16, 2025 | Instruction FollowingSynthetic Data Generation | CodeCode Available | 1 |
| DeltaPy: A Framework for Tabular Data Augmentation in Python | May 22, 2020 | BIG-bench Machine LearningData Augmentation | CodeCode Available | 1 |
| dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data Generation | Jul 12, 2022 | Synthetic Data Generation | CodeCode Available | 1 |
| Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction | Sep 1, 2021 | Data PoisoningKnowledge Distillation | CodeCode Available | 1 |
| Leveraging Generative AI Models for Synthetic Data Generation in Healthcare: Balancing Research and Privacy | May 9, 2023 | Synthetic Data Generation | CodeCode Available | 1 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 |