| Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets | Mar 12, 2025 | Active LearningConditional Image Generation | —Unverified | 0 |
| Synthetic Data Generation of Body Motion Data by Neural Gas Network for Emotion Recognition | Mar 11, 2025 | DiversityEmotion Recognition | CodeCode Available | 0 |
| Synthetic Data Generation for Minimum-Exposure Navigation in a Time-Varying Environment using Generative AI Models | Mar 9, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records | Mar 8, 2025 | Survival AnalysisSynthetic Data Generation | —Unverified | 0 |
| HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models | Mar 6, 2025 | Data AugmentationNER | —Unverified | 0 |
| CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation | Mar 6, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| A Consensus Privacy Metrics Framework for Synthetic Data | Mar 6, 2025 | AttributeSynthetic Data Generation | —Unverified | 0 |
| Robust Learning of Diverse Code Edits | Mar 5, 2025 | Code GenerationInstruction Following | —Unverified | 0 |
| Rethinking Synthetic Data definitions: A privacy driven approach | Mar 5, 2025 | ClassificationSynthetic Data Generation | —Unverified | 0 |
| Neural Descriptors: Self-Supervised Learning of Robust Local Surface Descriptors Using Polynomial Patches | Mar 5, 2025 | Self-Supervised LearningSynthetic Data Generation | CodeCode Available | 0 |
| SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models | Mar 5, 2025 | Synthetic Data Generation | —Unverified | 0 |
| OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework | Mar 3, 2025 | GPUSensor Modeling | —Unverified | 0 |
| Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation | Feb 27, 2025 | Machine TranslationSynthetic Data Generation | —Unverified | 0 |
| On Synthetic Data Strategies for Domain-Specific Generative Retrieval | Feb 25, 2025 | Document RankingRetrieval | —Unverified | 0 |
| FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection | Feb 24, 2025 | Event DetectionSynthetic Data Generation | —Unverified | 0 |
| Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation | Feb 24, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis | Feb 23, 2025 | Code GenerationPrivacy Preserving | —Unverified | 0 |
| MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition | Feb 21, 2025 | 3D ClassificationSynthetic Data Generation | —Unverified | 0 |
| Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning Models | Feb 21, 2025 | Concept AlignmentRAG | CodeCode Available | 0 |
| Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Feb 20, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text | Feb 19, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Theorem Prover as a Judge for Synthetic Data Generation | Feb 18, 2025 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 |
| From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations | Feb 17, 2025 | Simultaneous Localization and MappingSynthetic Data Generation | —Unverified | 0 |
| ShapeLib: Designing a library of programmatic 3D shape abstractions with Large Language Models | Feb 13, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Typhoon T1: An Open Thai Reasoning Model | Feb 13, 2025 | modelSynthetic Data Generation | —Unverified | 0 |
| A Systematic Evaluation of Generative Models on Tabular Transportation Data | Feb 13, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data? | Feb 10, 2025 | Synthetic Data Generation | —Unverified | 0 |
| Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs | Feb 10, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Generative Distribution Prediction: A Unified Approach to Multimodal Learning | Feb 10, 2025 | Domain AdaptationImage Captioning | —Unverified | 0 |
| Few-shot_LLM_Synthetic_Data_with_Distribution_Matching | Feb 9, 2025 | AttributeEfficient Exploration | CodeCode Available | 0 |
| Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis | Feb 6, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation | Feb 6, 2025 | Dataset GenerationImage to 3D | —Unverified | 0 |
| Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation | Feb 5, 2025 | Prompt EngineeringSynthetic Data Generation | —Unverified | 0 |
| Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation | Feb 3, 2025 | DiversityGSM8K | —Unverified | 0 |
| CoddLLM: Empowering Large Language Models for Data Analytics | Feb 1, 2025 | Multiple-choiceSynthetic Data Generation | —Unverified | 0 |
| Synthetic Data Generation for Augmenting Small Samples | Jan 30, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction | Jan 27, 2025 | Code GenerationInductive Bias | —Unverified | 0 |
| Making Sense of Data in the Wild: Data Analysis Automation at Scale | Jan 27, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement | Jan 23, 2025 | Data AugmentationSpeech Enhancement | —Unverified | 0 |
| Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation | Jan 20, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Data Enrichment Opportunities for Distribution Grid Cable Networks using Variational Autoencoders | Jan 19, 2025 | Feature ImportanceImputation | —Unverified | 0 |
| Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities | Jan 16, 2025 | Anomaly DetectionSynthetic Data Generation | —Unverified | 0 |
| Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning | Jan 15, 2025 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 |
| Quantum Down Sampling Filter for Variational Auto-encoder | Jan 9, 2025 | DecoderImage Reconstruction | CodeCode Available | 0 |
| User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation | Jan 8, 2025 | Synthetic Data GenerationUser Simulation | —Unverified | 0 |
| Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though | Jan 8, 2025 | Synthetic Data Generation | —Unverified | 0 |
| SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning | Jan 7, 2025 | DescriptiveSynthetic Data Generation | CodeCode Available | 0 |
| Reading with Intent -- Neutralizing Intent | Jan 7, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets | Jan 7, 2025 | Data Augmentationparameter estimation | —Unverified | 0 |