| Targeted synthetic data generation for tabular data via hardness characterization | Oct 1, 2024 | Data AugmentationData Valuation | CodeCode Available | 0 |
| Generating Synthetic Data with Locally Estimated Distributions for Disclosure Control | Oct 3, 2022 | ClusteringHyperparameter Optimization | CodeCode Available | 0 |
| RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages | Dec 14, 2024 | Machine TranslationOptical Character Recognition | CodeCode Available | 0 |
| Tensor feature hallucination for few-shot learning | Jun 9, 2021 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 |
| A Kernelised Stein Statistic for Assessing Implicit Generative Models | May 31, 2022 | Data AugmentationSynthetic Data Generation | CodeCode Available | 0 |
| Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions | Sep 30, 2019 | Synthetic Data GenerationTime Series | CodeCode Available | 0 |
| SafeSea: Synthetic Data Generation for Adverse & Low Probability Maritime Conditions | Nov 24, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| Using U-Nets to Create High-Fidelity Virtual Observations of the Solar Corona | Nov 10, 2019 | DecoderImage-to-Image Translation | CodeCode Available | 0 |
| SaGess: Sampling Graph Denoising Diffusion Model for Scalable Graph Generation | Jun 29, 2023 | DenoisingGraph Generation | CodeCode Available | 0 |
| A Little Human Data Goes A Long Way | Oct 17, 2024 | Fact VerificationQuestion Answering | CodeCode Available | 0 |
| UniPoll: A Unified Social Media Poll Generation Framework via Multi-Objective Optimization | Jun 12, 2023 | Answer GenerationPoll Generation | CodeCode Available | 0 |
| Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition | Aug 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Utility Assessment of Synthetic Data Generation Methods | Nov 23, 2022 | ImputationSynthetic Data Generation | CodeCode Available | 0 |
| Beyond Classification: Financial Reasoning in State-of-the-Art Language Models | Apr 30, 2023 | Decision MakingSynthetic Data Generation | CodeCode Available | 0 |
| GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding | Nov 14, 2023 | DecoderDenoising | CodeCode Available | 0 |
| Synthetic Data Generation for 3D Myocardium Deformation Analysis | Jun 3, 2024 | Optical Flow EstimationSynthetic Data Generation | CodeCode Available | 0 |
| Synthetic data generation for a longitudinal cohort study -- Evaluation, method extension and reproduction of published data analysis results | May 12, 2023 | DescriptiveNutrition | CodeCode Available | 0 |
| Little Giants: Synthesizing High-Quality Embedding Data at Scale | Oct 24, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| Synthetic Data Generation for Anomaly Detection on Table Grapes | Dec 17, 2024 | Anomaly DetectionClassification | CodeCode Available | 0 |
| WinSyn: A High Resolution Testbed for Synthetic Data | Oct 9, 2023 | Semantic SegmentationSynthetic Data Generation | CodeCode Available | 0 |
| LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation | May 17, 2025 | Automated Theorem ProvingSynthetic Data Generation | CodeCode Available | 0 |
| Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data Generation and Evaluation in Learning Analytics | Jan 12, 2024 | Privacy PreservingSynthetic Data Generation | CodeCode Available | 0 |
| Correlation inference attacks against machine learning models | Dec 16, 2021 | AttributeBIG-bench Machine Learning | CodeCode Available | 0 |
| Data Generation for Neural Programming by Example | Nov 6, 2019 | BIG-bench Machine LearningSynthetic Data Generation | CodeCode Available | 0 |
| Data-driven modeling of time-domain induced polarization | Jul 30, 2021 | DenoisingGeophysics | CodeCode Available | 0 |
| Data Augmentation with Variational Autoencoder for Imbalanced Dataset | Dec 9, 2024 | Data Augmentationregression | CodeCode Available | 0 |
| Unnatural Language Processing: Bridging the Gap Between Synthetic and Natural Language Data | Apr 28, 2020 | SentenceSentence Embeddings | CodeCode Available | 0 |
| LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task | Aug 25, 2024 | Computational EfficiencyImage Augmentation | CodeCode Available | 0 |
| A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing | Aug 11, 2017 | image smoothingReflection Removal | CodeCode Available | 0 |
| CyclePose -- Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy | Mar 14, 2025 | Instance SegmentationSegmentation | CodeCode Available | 0 |
| Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Apr 11, 2025 | Depth EstimationInstance Segmentation | CodeCode Available | 0 |
| Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation | Jan 3, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog | Apr 9, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic Data | Jan 24, 2023 | AttributeInference Attack | CodeCode Available | 0 |
| A Systematic Evaluation of Generative Models on Tabular Transportation Data | Feb 13, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| FLAIM: AIM-based Synthetic Data Generation in the Federated Setting | Oct 5, 2023 | Synthetic Data GenerationTabular Data Generation | CodeCode Available | 0 |
| Beyond a Single Mode: GAN Ensembles for Diverse Medical Data Generation | Mar 31, 2025 | DiagnosticDiversity | CodeCode Available | 0 |
| Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Jun 5, 2024 | Few-Shot Image Classificationimage-classification | CodeCode Available | 0 |
| ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets | Jun 20, 2022 | BenchmarkingFraud Detection | CodeCode Available | 0 |
| MC-GEN:Multi-level Clustering for Private Synthetic Data Generation | May 28, 2022 | BIG-bench Machine LearningClustering | CodeCode Available | 0 |
| Few-shot_LLM_Synthetic_Data_with_Distribution_Matching | Feb 9, 2025 | AttributeEfficient Exploration | CodeCode Available | 0 |
| Exploring the Limits of Synthetic Creation of Solar EUV Images via Image-to-Image Translation | Aug 19, 2022 | DecoderImage-to-Image Translation | CodeCode Available | 0 |
| Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval | Apr 24, 2022 | Information RetrievalQuestion Generation | CodeCode Available | 0 |
| TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs | Oct 14, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| A Survey on Deep Learning for Skin Lesion Segmentation | Jun 1, 2022 | Deep LearningLesion Segmentation | CodeCode Available | 0 |
| Convex space learning for tabular synthetic data generation | Jul 13, 2024 | Deep Learningimbalanced classification | CodeCode Available | 0 |
| UnrealROX: An eXtremely Photorealistic Virtual Reality Environment for Robotics Simulations and Synthetic Data Generation | Oct 16, 2018 | Depth Estimationobject-detection | CodeCode Available | 0 |
| MMM and MMMSynth: Clustering of heterogeneous tabular data, and synthetic data generation | Oct 30, 2023 | ClusteringSynthetic Data Generation | CodeCode Available | 0 |
| Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data | Mar 25, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| Synthetic data generation for system identification: leveraging knowledge transfer from similar systems | Mar 8, 2024 | Synthetic Data GenerationTransfer Learning | CodeCode Available | 0 |