| RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports | May 23, 2024 | DiagnosticMulti-Label Classification | CodeCode Available | 2 |
| EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | May 14, 2024 | DiagnosticMotion Estimation | CodeCode Available | 2 |
| A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law | May 2, 2024 | DiagnosticEthics | CodeCode Available | 2 |
| Vim4Path: Self-Supervised Vision Mamba for Histopathology Images | Apr 20, 2024 | DiagnosticMamba | CodeCode Available | 2 |
| DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology | Apr 7, 2024 | DiagnosticMultiple Instance Learning | CodeCode Available | 2 |
| Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement | Mar 11, 2024 | Clinical KnowledgeDescriptive | CodeCode Available | 2 |
| HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction | Mar 8, 2024 | DiagnosticMedical Report Generation | CodeCode Available | 2 |
| CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification | Feb 27, 2024 | ClassificationDiagnostic | CodeCode Available | 2 |
| CodeS: Towards Building Open-source Language Models for Text-to-SQL | Feb 26, 2024 | Data AugmentationDiagnostic | CodeCode Available | 2 |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator | Feb 15, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 |
| Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram | Feb 2, 2024 | DiagnosticECG Classification | CodeCode Available | 2 |
| MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | Nov 28, 2023 | 3D Question Answering (3D-QA)Diagnostic | CodeCode Available | 2 |
| Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review | Nov 3, 2023 | Diagnostic | CodeCode Available | 2 |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Oct 23, 2023 | DiagnosticHallucination | CodeCode Available | 2 |
| BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Sep 12, 2023 | DiagnosticNatural Language Understanding | CodeCode Available | 2 |
| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 |
| Evaluating AI systems under uncertain ground truth: a case study in dermatology | Jul 5, 2023 | DiagnosticMedical Diagnosis | CodeCode Available | 2 |
| A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics | Jun 1, 2023 | DiagnosticRepresentation Learning | CodeCode Available | 2 |
| Perception Test: A Diagnostic Benchmark for Multimodal Video Models | May 23, 2023 | DiagnosticGrounded Video Question Answering | CodeCode Available | 2 |
| DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images | May 18, 2023 | Active LearningDiagnostic | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models | Feb 17, 2023 | DiagnosticTime Series | CodeCode Available | 2 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 |
| Perception Test: A Diagnostic Benchmark for Multimodal Models | Oct 19, 2022 | DiagnosticMultiple-choice | CodeCode Available | 2 |
| Retrieval Augmented Visual Question Answering with Outside Knowledge | Oct 7, 2022 | Answer GenerationDiagnostic | CodeCode Available | 2 |