| RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages | Dec 14, 2024 | Machine TranslationOptical Character Recognition | CodeCode Available | 0 |
| Generative Zoo | Dec 11, 2024 | Conditional Image GenerationImage Generation | —Unverified | 0 |
| Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial Vehicles | Dec 10, 2024 | Autonomous VehiclesBayesian Optimization | CodeCode Available | 0 |
| Data Augmentation with Variational Autoencoder for Imbalanced Dataset | Dec 9, 2024 | Data Augmentationregression | CodeCode Available | 0 |
| Improving text-conditioned latent diffusion for cancer pathology | Dec 9, 2024 | GPUSynthetic Data Generation | CodeCode Available | 0 |
| CALICO: Conversational Agent Localization via Synthetic Data Generation | Dec 6, 2024 | Synthetic Data GenerationTranslation | —Unverified | 0 |
| A text-to-tabular approach to generate synthetic patient data using LLMs | Dec 6, 2024 | In-Context LearningSynthetic Data Generation | CodeCode Available | 0 |
| Give me Some Hard Questions: Synthetic Data Generation for Clinical QA | Dec 5, 2024 | Question AnsweringQuestion Generation | CodeCode Available | 0 |
| ALMA: Alignment with Minimal Annotation | Dec 5, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| End to End Collaborative Synthetic Data Generation | Dec 4, 2024 | Privacy PreservingSynthetic Data Generation | —Unverified | 0 |
| Domain-Agnostic Stroke Lesion Segmentation Using Physics-Constrained Synthetic Data | Dec 4, 2024 | Lesion SegmentationQuantitative MRI | CodeCode Available | 0 |
| DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining | Dec 4, 2024 | DiagnosticSpecificity | —Unverified | 0 |
| Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models | Dec 4, 2024 | DiversityOut-of-Distribution Generalization | —Unverified | 0 |
| SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models | Dec 3, 2024 | Dataset GenerationImage-to-Image Translation | CodeCode Available | 1 |
| Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces | Dec 2, 2024 | Synthetic Data Generation | —Unverified | 0 |
| MALT: Improving Reasoning with Multi-Agent LLM Training | Dec 2, 2024 | Common Sense ReasoningGSM8K | —Unverified | 0 |
| Enhancing Amyloid PET Quantification: MRI-Guided Super-Resolution Using Latent Diffusion Models | Dec 1, 2024 | MS-SSIMSSIM | CodeCode Available | 0 |
| Needle: A Generative AI-Powered Multi-modal Database for Answering Complex Natural Language Queries | Dec 1, 2024 | Contrastive LearningImage Retrieval | —Unverified | 0 |
| Well log data generation and imputation using sequence-based generative adversarial networks | Dec 1, 2024 | ImputationSynthetic Data Generation | —Unverified | 0 |
| LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes | Nov 30, 2024 | Autonomous Drivingcounterfactual | —Unverified | 0 |
| MAG-V: A Multi-Agent Framework for Synthetic Data Generation and Verification | Nov 28, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts | Nov 27, 2024 | Document AIDocument Classification | —Unverified | 0 |
| Synthetic Data Generation with LLM for Improved Depression Prediction | Nov 26, 2024 | Depression DetectionPrivacy Preserving | —Unverified | 0 |
| High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR | Nov 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | Nov 24, 2024 | Few-Shot Object DetectionImage Generation | —Unverified | 0 |
| Beyond Data Scarcity: A Frequency-Driven Framework for Zero-Shot Forecasting | Nov 24, 2024 | Few-Shot LearningSynthetic Data Generation | —Unverified | 0 |
| Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai | Nov 23, 2024 | DiversityQuestion Answering | CodeCode Available | 1 |
| LLM for Barcodes: Generating Diverse Synthetic Data for Identity Documents | Nov 22, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations | Nov 17, 2024 | DecoderSynthetic Data Generation | —Unverified | 0 |
| Watermarking Generative Categorical Data | Nov 16, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Generation of synthetic gait data: application to multiple sclerosis patients' gait patterns | Nov 15, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Hierarchical Conditional Tabular GAN for Multi-Tabular Synthetic Data Generation | Nov 11, 2024 | Synthetic Data Generation | —Unverified | 0 |
| DRIFTS: Optimizing Domain Randomization with Synthetic Data and Weight Interpolation for Fetal Brain Tissue Segmentation | Nov 11, 2024 | Domain GeneralizationImage Segmentation | —Unverified | 0 |
| Differential Privacy Under Class Imbalance: Methods and Empirical Insights | Nov 8, 2024 | Fraud DetectionPrivacy Preserving | —Unverified | 0 |
| Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation | Nov 7, 2024 | Data AugmentationSynthetic Data Generation | CodeCode Available | 2 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 |
| Debiasing Synthetic Data Generated by Deep Generative Models | Nov 6, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| GUIDE-VAE: Advancing Data Generation with User Information and Pattern Dictionaries | Nov 6, 2024 | ImputationSynthetic Data Generation | CodeCode Available | 0 |
| DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Nov 5, 2024 | Prompt EngineeringSynthetic Data Generation | —Unverified | 0 |
| Enhancing Table Representations with LLM-powered Synthetic Data Generation | Nov 4, 2024 | Code GenerationDecision Making | —Unverified | 0 |
| Retrieval-enriched zero-shot image classification in low-resource domains | Nov 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Scalable AI Framework for Defect Detection in Metal Additive Manufacturing | Nov 1, 2024 | Defect DetectionDenoising | —Unverified | 0 |
| Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy | Oct 31, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities | Oct 31, 2024 | Face RecognitionInference Attack | —Unverified | 0 |
| Neural spell-checker: Beyond words with synthetic data generation | Oct 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SoccerGuard: Investigating Injury Risk Factors for Professional Soccer Players with Machine Learning | Oct 29, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Synthetic Data Generation with Large Language Models for Personalized Community Question Answering | Oct 29, 2024 | Community Question AnsweringInformation Retrieval | CodeCode Available | 0 |
| Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components | Oct 29, 2024 | DisentanglementRobust classification | —Unverified | 0 |
| Evaluating utility in synthetic banking microdata applications | Oct 29, 2024 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |