| Healthsheet: Development of a Transparency Artifact for Health Datasets | Feb 26, 2022 | Diagnostic | CodeCode Available | 2 |
| Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model | Feb 15, 2022 | Depression DetectionDiagnostic | CodeCode Available | 2 |
| hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices | Mar 9, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 2 |
| Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts | Aug 5, 2020 | Diagnostic | CodeCode Available | 2 |
| On the limits of cross-domain generalization in automated X-ray prediction | Feb 6, 2020 | DiagnosticDomain Generalization | CodeCode Available | 2 |
| A System for Real-Time Interactive Analysis of Deep Learning Training | Jan 5, 2020 | 3D Action RecognitionDiagnostic | CodeCode Available | 2 |
| Interpretable Counterfactual Explanations Guided by Prototypes | Jul 3, 2019 | counterfactualDiagnostic | CodeCode Available | 2 |
| LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models | Jul 8, 2025 | DenoisingDiagnostic | CodeCode Available | 1 |
| MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Jun 24, 2025 | DiagnosticMedical Diagnosis | CodeCode Available | 1 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Probing the Robustness of Large Language Models Safety to Latent Perturbations | Jun 19, 2025 | DiagnosticSafety Alignment | CodeCode Available | 1 |
| Diffusion-based Counterfactual Augmentation: Towards Robust and Interpretable Knee Osteoarthritis Grading | Jun 18, 2025 | Clinical Knowledgecounterfactual | CodeCode Available | 1 |
| Diffusion-Based Electrocardiography Noise Quantification via Anomaly Detection | Jun 13, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 1 |
| Towards Practical Alzheimer's Disease Diagnosis: A Lightweight and Interpretable Spiking Neural Model | Jun 11, 2025 | Diagnostic | CodeCode Available | 1 |
| ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs | Jun 11, 2025 | Code GenerationDiagnostic | CodeCode Available | 1 |
| MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Jun 9, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation | Jun 9, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 |
| How does Transformer Learn Implicit Reasoning? | May 29, 2025 | ClusteringDiagnostic | CodeCode Available | 1 |
| Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning | May 29, 2025 | DiagnosticQuestion Answering | CodeCode Available | 1 |
| Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge | May 28, 2025 | Depression DetectionDiagnostic | CodeCode Available | 1 |
| Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering | May 25, 2025 | AnatomyBenchmarking | CodeCode Available | 1 |
| Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs | May 22, 2025 | DiagnosticMachine Unlearning | CodeCode Available | 1 |