| Nemotron-4 340B Technical Report | Jun 17, 2024 | Synthetic Data Generation | CodeCode Available | 4 |
| MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data | Jun 15, 2024 | Generative Adversarial NetworkLanguage Modeling | —Unverified | 0 |
| GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Jun 15, 2024 | Autonomous DrivingDepth Estimation | —Unverified | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey | Jun 14, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming | Jun 14, 2024 | BenchmarkingGeneral Knowledge | —Unverified | 0 |
| SimGen: Simulator-conditioned Driving Scene Generation | Jun 13, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| A Synthetic Dataset for Personal Attribute Inference | Jun 11, 2024 | AttributeAuthor Profiling | CodeCode Available | 2 |
| Curating Grounded Synthetic Data with Global Perspectives for Equitable AI | Jun 10, 2024 | ArticlesDiversity | —Unverified | 0 |
| SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection | Jun 10, 2024 | Synthetic Data Generationtext-classification | —Unverified | 0 |
| DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection | Jun 10, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Enhancing human action recognition with GAN-based data augmentation | Jun 7, 2024 | Action RecognitionData Augmentation | CodeCode Available | 0 |
| CTSyn: A Foundational Model for Cross Tabular Data Generation | Jun 7, 2024 | DiversitySynthetic Data Generation | —Unverified | 0 |
| Enhancing Indoor Temperature Forecasting through Synthetic Data in Low-Data Environments | Jun 7, 2024 | Data AugmentationSynthetic Data Generation | —Unverified | 0 |
| Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data Imbalance | Jun 5, 2024 | Data Augmentationimbalanced classification | CodeCode Available | 0 |
| Tiny models from tiny data: Textual and null-text inversion for few-shot distillation | Jun 5, 2024 | Few-Shot Image Classificationimage-classification | CodeCode Available | 0 |
| ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction | Jun 5, 2024 | Grammatical Error CorrectionSynthetic Data Generation | —Unverified | 0 |
| Synthetic Data Outliers: Navigating Identity Disclosure | Jun 4, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy | Jun 3, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Synthetic Data Generation for 3D Myocardium Deformation Analysis | Jun 3, 2024 | Optical Flow EstimationSynthetic Data Generation | CodeCode Available | 0 |
| GenPalm: Contactless Palmprint Generation with Diffusion Models | Jun 1, 2024 | Synthetic Data Generation | —Unverified | 0 |
| MegActor: Harness the Power of Raw Video for Vivid Portrait Animation | May 31, 2024 | Portrait AnimationStyle Transfer | CodeCode Available | 4 |
| Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis | May 31, 2024 | Density EstimationImputation | —Unverified | 0 |
| Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router | May 30, 2024 | Language ModellingSynthetic Data Generation | —Unverified | 0 |
| Differentially Private Synthetic Data Generation for Relational Databases | May 29, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| Interpretable classification of wiki-review streams | May 28, 2024 | ArticlesClassification | —Unverified | 0 |
| Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark | May 27, 2024 | DiversityMTEB Benchmark | —Unverified | 0 |
| Conditioning on Time is All You Need for Synthetic Survival Data Generation | May 27, 2024 | AllFairness | CodeCode Available | 0 |
| NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models | May 27, 2024 | Information RetrievalLanguage Modelling | —Unverified | 0 |
| KiNETGAN: Enabling Distributed Network Intrusion Detection through Knowledge-Infused Synthetic Data Generation | May 26, 2024 | Anomaly DetectionGenerative Adversarial Network | —Unverified | 0 |
| Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure | May 23, 2024 | Data AugmentationFairness | —Unverified | 0 |
| Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking | May 21, 2024 | Dialogue State TrackingDiversity | —Unverified | 0 |
| End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music | May 20, 2024 | Synthetic Data Generation | CodeCode Available | 2 |
| Advancing fNIRS Neuroimaging through Synthetic Data Generation and Machine Learning Applications | May 18, 2024 | Synthetic Data Generation | —Unverified | 0 |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | May 16, 2024 | Bias DetectionDiversity | CodeCode Available | 1 |
| Prompting-based Synthetic Data Generation for Few-Shot Question Answering | May 15, 2024 | Question AnsweringSynthetic Data Generation | CodeCode Available | 0 |
| Permissioned Blockchain-based Framework for Ranking Synthetic Data Generators | May 12, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Inference With Combining Rules From Multiple Differentially Private Synthetic Datasets | May 8, 2024 | Synthetic Data Generation | —Unverified | 0 |
| Clustering of Disease Trajectories with Explainable Machine Learning: A Case Study on Postoperative Delirium Phenotypes | May 6, 2024 | ClusteringFeature Importance | —Unverified | 0 |
| Comparative study of models trained on synthetic data for Ukrainian grammatical error correction | May 5, 2024 | Grammatical Error CorrectionMachine Translation | CodeCode Available | 0 |
| Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation | May 2, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure | May 1, 2024 | DenoisingMetric Learning | —Unverified | 0 |
| Online Data Augmentation for Forecasting with Deep Learning | Apr 25, 2024 | Data AugmentationDeep Learning | CodeCode Available | 0 |
| Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection | Apr 25, 2024 | Privacy PreservingSynthetic Data Generation | —Unverified | 0 |
| Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language Models | Apr 23, 2024 | Conversational Question AnsweringDialogue State Tracking | CodeCode Available | 1 |
| UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Apr 23, 2024 | 3D Human Pose EstimationMulti-view 3D Human Pose Estimation | —Unverified | 0 |
| Better Synthetic Data by Retrieving and Transforming Existing Datasets | Apr 22, 2024 | Dataset GenerationDiversity | CodeCode Available | 7 |
| Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming Generative Adversarial Networks | Apr 21, 2024 | FairnessSynthetic Data Generation | —Unverified | 0 |
| A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language Models | Apr 20, 2024 | Synthetic Data Generation | CodeCode Available | 0 |
| Aligning Actions and Walking to LLM-Generated Textual Descriptions | Apr 18, 2024 | Action RecognitionData Augmentation | CodeCode Available | 0 |