| RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports | May 23, 2024 | DiagnosticMulti-Label Classification | CodeCode Available | 2 |
| EchoTracker: Advancing Myocardial Point Tracking in Echocardiography | May 14, 2024 | DiagnosticMotion Estimation | CodeCode Available | 2 |
| A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law | May 2, 2024 | DiagnosticEthics | CodeCode Available | 2 |
| Vim4Path: Self-Supervised Vision Mamba for Histopathology Images | Apr 20, 2024 | DiagnosticMamba | CodeCode Available | 2 |
| DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology | Apr 7, 2024 | DiagnosticMultiple Instance Learning | CodeCode Available | 2 |
| Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement | Mar 11, 2024 | Clinical KnowledgeDescriptive | CodeCode Available | 2 |
| HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction | Mar 8, 2024 | DiagnosticMedical Report Generation | CodeCode Available | 2 |
| CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification | Feb 27, 2024 | ClassificationDiagnostic | CodeCode Available | 2 |
| CodeS: Towards Building Open-source Language Models for Text-to-SQL | Feb 26, 2024 | Data AugmentationDiagnostic | CodeCode Available | 2 |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator | Feb 15, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 |
| Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram | Feb 2, 2024 | DiagnosticECG Classification | CodeCode Available | 2 |
| MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | Nov 28, 2023 | 3D Question Answering (3D-QA)Diagnostic | CodeCode Available | 2 |
| Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review | Nov 3, 2023 | Diagnostic | CodeCode Available | 2 |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Oct 23, 2023 | DiagnosticHallucination | CodeCode Available | 2 |
| BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Sep 12, 2023 | DiagnosticNatural Language Understanding | CodeCode Available | 2 |
| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 |
| Evaluating AI systems under uncertain ground truth: a case study in dermatology | Jul 5, 2023 | DiagnosticMedical Diagnosis | CodeCode Available | 2 |
| A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics | Jun 1, 2023 | DiagnosticRepresentation Learning | CodeCode Available | 2 |
| Perception Test: A Diagnostic Benchmark for Multimodal Video Models | May 23, 2023 | DiagnosticGrounded Video Question Answering | CodeCode Available | 2 |
| DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images | May 18, 2023 | Active LearningDiagnostic | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models | Feb 17, 2023 | DiagnosticTime Series | CodeCode Available | 2 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 |
| Perception Test: A Diagnostic Benchmark for Multimodal Models | Oct 19, 2022 | DiagnosticMultiple-choice | CodeCode Available | 2 |
| Retrieval Augmented Visual Question Answering with Outside Knowledge | Oct 7, 2022 | Answer GenerationDiagnostic | CodeCode Available | 2 |
| Healthsheet: Development of a Transparency Artifact for Health Datasets | Feb 26, 2022 | Diagnostic | CodeCode Available | 2 |
| Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model | Feb 15, 2022 | Depression DetectionDiagnostic | CodeCode Available | 2 |
| hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices | Mar 9, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 2 |
| Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts | Aug 5, 2020 | Diagnostic | CodeCode Available | 2 |
| On the limits of cross-domain generalization in automated X-ray prediction | Feb 6, 2020 | DiagnosticDomain Generalization | CodeCode Available | 2 |
| A System for Real-Time Interactive Analysis of Deep Learning Training | Jan 5, 2020 | 3D Action RecognitionDiagnostic | CodeCode Available | 2 |
| Interpretable Counterfactual Explanations Guided by Prototypes | Jul 3, 2019 | counterfactualDiagnostic | CodeCode Available | 2 |
| LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models | Jul 8, 2025 | DenoisingDiagnostic | CodeCode Available | 1 |
| MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Jun 24, 2025 | DiagnosticMedical Diagnosis | CodeCode Available | 1 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Probing the Robustness of Large Language Models Safety to Latent Perturbations | Jun 19, 2025 | DiagnosticSafety Alignment | CodeCode Available | 1 |
| Diffusion-based Counterfactual Augmentation: Towards Robust and Interpretable Knee Osteoarthritis Grading | Jun 18, 2025 | Clinical Knowledgecounterfactual | CodeCode Available | 1 |
| Diffusion-Based Electrocardiography Noise Quantification via Anomaly Detection | Jun 13, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 1 |
| Towards Practical Alzheimer's Disease Diagnosis: A Lightweight and Interpretable Spiking Neural Model | Jun 11, 2025 | Diagnostic | CodeCode Available | 1 |
| ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs | Jun 11, 2025 | Code GenerationDiagnostic | CodeCode Available | 1 |
| MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Jun 9, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation | Jun 9, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 |
| How does Transformer Learn Implicit Reasoning? | May 29, 2025 | ClusteringDiagnostic | CodeCode Available | 1 |
| Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning | May 29, 2025 | DiagnosticQuestion Answering | CodeCode Available | 1 |
| Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge | May 28, 2025 | Depression DetectionDiagnostic | CodeCode Available | 1 |
| Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering | May 25, 2025 | AnatomyBenchmarking | CodeCode Available | 1 |
| Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs | May 22, 2025 | DiagnosticMachine Unlearning | CodeCode Available | 1 |