| ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis | Feb 16, 2025 | DiagnosticRhythm | CodeCode Available | 1 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models | Feb 12, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| SurGen: 1020 H&E-stained Whole Slide Images With Survival and Genetic Markers | Feb 7, 2025 | Diagnosticwhole slide images | CodeCode Available | 1 |
| Prostate-Specific Foundation Models for Enhanced Detection of Clinically Significant Cancer | Feb 1, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion | Jan 28, 2025 | DiagnosticImage Generation | CodeCode Available | 1 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| MechIR: A Mechanistic Interpretability Framework for Information Retrieval | Jan 17, 2025 | DiagnosticInformation Retrieval | CodeCode Available | 1 |
| O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning | Jan 11, 2025 | Decision MakingDiagnostic | CodeCode Available | 1 |
| DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information | Jan 10, 2025 | BenchmarkingData Augmentation | CodeCode Available | 1 |
| Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis | Jan 6, 2025 | AttributeDiagnostic | CodeCode Available | 1 |