| Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework | Apr 2, 2025 | BenchmarkingSynthetic Data Generation | CodeCode Available | 2 |
| TAMIS: Tailored Membership Inference Attacks on Synthetic Data | Apr 1, 2025 | Synthetic Data Generation | —Unverified | 0 |
| GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition | Apr 1, 2025 | Computational Efficiencynamed-entity-recognition | CodeCode Available | 1 |
| Synthetic News Generation for Fake News Classification | Mar 31, 2025 | ArticlesClassification | —Unverified | 0 |
| Beyond a Single Mode: GAN Ensembles for Diverse Medical Data Generation | Mar 31, 2025 | DiagnosticDiversity | CodeCode Available | 0 |
| XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation | Mar 29, 2025 | 8kSynthetic Data Generation | —Unverified | 0 |
| Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications | Mar 26, 2025 | ArticlesData Augmentation | —Unverified | 0 |
| Assessing Generative Models for Structured Data | Mar 26, 2025 | Synthetic Data GenerationTabular Data Generation | —Unverified | 0 |
| Scaling Laws of Synthetic Data for Language Models | Mar 25, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving | Mar 23, 2025 | 3DGSAutonomous Driving | CodeCode Available | 1 |
| Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation | Mar 21, 2025 | Dataset GenerationGraph Generation | —Unverified | 0 |
| MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures | Mar 20, 2025 | Synthetic Data Generation | CodeCode Available | 1 |
| Project Jenkins: Turning Monkey Neural Data into Robotic Arm Movement, and Back | Mar 19, 2025 | Synthetic Data Generation | —Unverified | 0 |
| ELTEX: A Framework for Domain-Driven Synthetic Data Generation | Mar 19, 2025 | Synthetic Data GenerationTransfer Learning | CodeCode Available | 0 |
| Synthetic Data Generation Using Large Language Models: Advances in Text and Code | Mar 18, 2025 | Code TranslationPrompt Engineering | —Unverified | 0 |
| AugGen: Synthetic Augmentation Can Improve Discriminative Models | Mar 14, 2025 | Face RecognitionSynthetic Data Generation | —Unverified | 0 |
| CyclePose -- Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy | Mar 14, 2025 | Instance SegmentationSegmentation | CodeCode Available | 0 |
| Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving | Mar 12, 2025 | Automated Theorem ProvingReinforcement Learning (RL) | —Unverified | 0 |
| Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks | Mar 12, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets | Mar 12, 2025 | Active LearningConditional Image Generation | —Unverified | 0 |
| Synthetic Data Generation of Body Motion Data by Neural Gas Network for Emotion Recognition | Mar 11, 2025 | DiversityEmotion Recognition | CodeCode Available | 0 |
| Mellow: a small audio language model for reasoning | Mar 11, 2025 | Audio captioningLanguage Modeling | CodeCode Available | 2 |
| Synthetic Data Generation for Minimum-Exposure Navigation in a Time-Varying Environment using Generative AI Models | Mar 9, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records | Mar 8, 2025 | Survival AnalysisSynthetic Data Generation | —Unverified | 0 |
| AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data | Mar 7, 2025 | DiversityFairness | CodeCode Available | 1 |
| HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models | Mar 6, 2025 | Data AugmentationNER | —Unverified | 0 |
| CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation | Mar 6, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| A Consensus Privacy Metrics Framework for Synthetic Data | Mar 6, 2025 | AttributeSynthetic Data Generation | —Unverified | 0 |
| Neural Descriptors: Self-Supervised Learning of Robust Local Surface Descriptors Using Polynomial Patches | Mar 5, 2025 | Self-Supervised LearningSynthetic Data Generation | CodeCode Available | 0 |
| Rethinking Synthetic Data definitions: A privacy driven approach | Mar 5, 2025 | ClassificationSynthetic Data Generation | —Unverified | 0 |
| SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models | Mar 5, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Robust Learning of Diverse Code Edits | Mar 5, 2025 | Code GenerationInstruction Following | —Unverified | 0 |
| OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework | Mar 3, 2025 | GPUSensor Modeling | —Unverified | 0 |
| Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation | Feb 27, 2025 | Machine TranslationSynthetic Data Generation | —Unverified | 0 |
| On Synthetic Data Strategies for Domain-Specific Generative Retrieval | Feb 25, 2025 | Document RankingRetrieval | —Unverified | 0 |
| FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection | Feb 24, 2025 | Event DetectionSynthetic Data Generation | —Unverified | 0 |
| Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation | Feb 24, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis | Feb 23, 2025 | Code GenerationPrivacy Preserving | —Unverified | 0 |
| MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition | Feb 21, 2025 | 3D ClassificationSynthetic Data Generation | —Unverified | 0 |
| Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models | Feb 21, 2025 | Concept AlignmentRAG | CodeCode Available | 0 |
| Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Feb 20, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| CLIPPER: Compression enables long-context synthetic data generation | Feb 20, 2025 | Claim VerificationSynthetic Data Generation | CodeCode Available | 1 |
| The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text | Feb 19, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Theorem Prover as a Judge for Synthetic Data Generation | Feb 18, 2025 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 |
| From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations | Feb 17, 2025 | Simultaneous Localization and MappingSynthetic Data Generation | —Unverified | 0 |
| A Systematic Evaluation of Generative Models on Tabular Transportation Data | Feb 13, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| Typhoon T1: An Open Thai Reasoning Model | Feb 13, 2025 | modelSynthetic Data Generation | —Unverified | 0 |
| ShapeLib: Designing a library of programmatic 3D shape abstractions with Large Language Models | Feb 13, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Generative Distribution Prediction: A Unified Approach to Multimodal Learning | Feb 10, 2025 | Domain AdaptationImage Captioning | —Unverified | 0 |
| Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data? | Feb 10, 2025 | Synthetic Data Generation | —Unverified | 0 |