| Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs | Feb 10, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Few-shot_LLM_Synthetic_Data_with_Distribution_Matching | Feb 9, 2025 | AttributeEfficient Exploration | CodeCode Available | 0 |
| DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails | Feb 7, 2025 | Reinforcement Learning (RL)Synthetic Data Generation | CodeCode Available | 1 |
| MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation | Feb 6, 2025 | Dataset GenerationImage to 3D | —Unverified | 0 |
| Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis | Feb 6, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation | Feb 5, 2025 | Prompt EngineeringSynthetic Data Generation | —Unverified | 0 |
| SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Feb 4, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation | Feb 3, 2025 | DiversityGSM8K | —Unverified | 0 |
| CoddLLM: Empowering Large Language Models for Data Analytics | Feb 1, 2025 | Multiple-choiceSynthetic Data Generation | —Unverified | 0 |
| XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses | Jan 31, 2025 | Action LocalizationAction Recognition | CodeCode Available | 1 |
| Synthetic Data Generation for Augmenting Small Samples | Jan 30, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data | Jan 28, 2025 | Natural Language InferenceSynthetic Data Generation | CodeCode Available | 1 |
| Making Sense of Data in the Wild: Data Analysis Automation at Scale | Jan 27, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction | Jan 27, 2025 | Code GenerationInductive Bias | —Unverified | 0 |
| Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement | Jan 23, 2025 | Data AugmentationSpeech Enhancement | —Unverified | 0 |
| TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data | Jan 21, 2025 | FairnessImputation | CodeCode Available | 4 |
| Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Jan 21, 2025 | Synthetic Data GenerationWorld Knowledge | CodeCode Available | 1 |
| Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation | Jan 20, 2025 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition Data | Jan 19, 2025 | EEGEmotion Recognition | CodeCode Available | 1 |
| Data Enrichment Opportunities for Distribution Grid Cable Networks using Variational Autoencoders | Jan 19, 2025 | Feature ImportanceImputation | —Unverified | 0 |
| Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities | Jan 16, 2025 | Anomaly DetectionSynthetic Data Generation | —Unverified | 0 |
| Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning | Jan 15, 2025 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 |
| Quantum Down Sampling Filter for Variational Auto-encoder | Jan 9, 2025 | DecoderImage Reconstruction | CodeCode Available | 0 |
| Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though | Jan 8, 2025 | Synthetic Data Generation | —Unverified | 0 |
| User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation | Jan 8, 2025 | Synthetic Data GenerationUser Simulation | —Unverified | 0 |
| Reading with Intent -- Neutralizing Intent | Jan 7, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets | Jan 7, 2025 | Data Augmentationparameter estimation | —Unverified | 0 |
| SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning | Jan 7, 2025 | DescriptiveSynthetic Data Generation | CodeCode Available | 0 |
| License Plate Images Generation with Diffusion Models | Jan 6, 2025 | License Plate RecognitionSynthetic Data Generation | —Unverified | 0 |
| Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms | Jan 3, 2025 | FairnessSynthetic Data Generation | —Unverified | 0 |
| Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation | Jan 3, 2025 | Synthetic Data Generation | CodeCode Available | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 |
| SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy | Dec 30, 2024 | Privacy PreservingSynthetic Data Generation | —Unverified | 0 |
| TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data | Dec 27, 2024 | In-Context LearningKnowledge Base Question Answering | —Unverified | 0 |
| OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis | Dec 27, 2024 | DiversitySynthetic Data Generation | CodeCode Available | 3 |
| Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation | Dec 26, 2024 | Dichotomous Image SegmentationImage Segmentation | —Unverified | 0 |
| HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation | Dec 24, 2024 | Computational EfficiencyHandwritten Text Recognition | CodeCode Available | 0 |
| Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Dec 24, 2024 | Autonomous DrivingDataset Generation | CodeCode Available | 1 |
| Autonomous Crack Detection using Deep Learning on Synthetic Thermogram Datasets | Dec 21, 2024 | Data AugmentationDeep Learning | —Unverified | 0 |
| Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances | Dec 21, 2024 | Contrastive LearningDomain Adaptation | —Unverified | 0 |
| Stochastic Model of siRNA Endosomal Escape Mediated by Fusogenic Peptides in OVCAR-3 | Dec 20, 2024 | Bayesian InferenceImage Segmentation | CodeCode Available | 0 |
| Improving Equity in Health Modeling with GPT4-Turbo Generated Synthetic Data: A Comparative Study | Dec 20, 2024 | FairnessSynthetic Data Generation | —Unverified | 0 |
| Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation | Dec 20, 2024 | DenoisingFederated Learning | —Unverified | 0 |
| Using matrix-product states for time-series machine learning | Dec 20, 2024 | AstronomyImputation | CodeCode Available | 1 |
| ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis | Dec 19, 2024 | Data AugmentationSynthetic Data Generation | CodeCode Available | 1 |
| High-throughput digital twin framework for predicting neurite deterioration using MetaFormer attention | Dec 18, 2024 | Synthetic Data Generation | —Unverified | 0 |
| A Systematic Examination of Preference Learning through the Lens of Instruction-Following | Dec 18, 2024 | Instruction FollowingSynthetic Data Generation | —Unverified | 0 |
| Synthetic Data Generation for Anomaly Detection on Table Grapes | Dec 17, 2024 | Anomaly DetectionClassification | CodeCode Available | 0 |
| Auto-Cypher: Improving LLMs on Cypher generation via LLM-supervised generation-verification framework | Dec 17, 2024 | Knowledge GraphsSynthetic Data Generation | —Unverified | 0 |