| RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation | Aug 15, 2024 | DiagnosticRAG | CodeCode Available | 5 |
| RaTEScore: A Metric for Radiology Report Generation | Jun 24, 2024 | DiagnosticEntity Embeddings | CodeCode Available | 4 |
| sbi reloaded: a toolkit for simulation-based inference workflows | Nov 26, 2024 | Bayesian InferenceDiagnostic | CodeCode Available | 4 |
| Segment Anything in Medical Images | Apr 24, 2023 | DiagnosticImage Segmentation | CodeCode Available | 4 |
| Molecular-driven Foundation Model for Oncologic Pathology | Jan 28, 2025 | BenchmarkingDiagnostic | CodeCode Available | 4 |
| A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making | Oct 31, 2024 | Decision MakingDiagnostic | CodeCode Available | 3 |
| Differentiable Voxel-based X-ray Rendering Improves Sparse-View 3D CBCT Reconstruction | Nov 28, 2024 | 3D ReconstructionDiagnostic | CodeCode Available | 3 |
| A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray Interpretation | Jan 22, 2024 | BenchmarkingDiagnostic | CodeCode Available | 3 |
| Attention is not not Explanation | Aug 13, 2019 | Decision MakingDiagnostic | CodeCode Available | 3 |
| MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models | Oct 16, 2024 | DiagnosticHallucination | CodeCode Available | 3 |
| Robust and Efficient Medical Imaging with Self-Supervision | May 19, 2022 | DiagnosticRepresentation Learning | CodeCode Available | 3 |
| Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models | May 29, 2025 | Autonomous DrivingDiagnostic | CodeCode Available | 3 |
| DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models | Feb 8, 2022 | DiagnosticImage Captioning | CodeCode Available | 3 |
| GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images | Mar 8, 2025 | cross-modal alignmentDiagnostic | CodeCode Available | 3 |
| MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot | Feb 6, 2025 | DiagnosticLarge Language Model | CodeCode Available | 3 |
| ECG-FM: An Open Electrocardiogram Foundation Model | Aug 9, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 3 |
| A Practical Probabilistic Benchmark for AI Weather Models | Jan 27, 2024 | DiagnosticWeather Forecasting | CodeCode Available | 3 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| CodeS: Towards Building Open-source Language Models for Text-to-SQL | Feb 26, 2024 | Data AugmentationDiagnostic | CodeCode Available | 2 |
| CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis | Jul 18, 2024 | Decision MakingDiagnostic | CodeCode Available | 2 |
| ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model | Apr 13, 2025 | DiagnosticLanguage Modeling | CodeCode Available | 2 |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator | Feb 15, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |
| ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World | Jun 19, 2024 | DiagnosticMultiple-choice | CodeCode Available | 2 |