| CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | May 23, 2025 | DiagnosticQuestion Answering | CodeCode Available | 0 |
| A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | May 23, 2025 | DiagnosticMRI classification | —Unverified | 0 |
| Diffusion Classifiers Understand Compositionality, but Conditions Apply | May 23, 2025 | Diagnostic | CodeCode Available | 0 |
| Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | May 23, 2025 | Anatomycounterfactual | —Unverified | 0 |
| Anatomy-Guided Multitask Learning for MRI-Based Classification of Placenta Accreta Spectrum and its Subtypes | May 23, 2025 | AnatomyBinary Classification | —Unverified | 0 |
| ConnectomeDiffuser: Generative AI Enables Brain Network Construction from Diffusion Tensor Imaging | May 23, 2025 | Diagnostic | —Unverified | 0 |
| DECT-based Space-Squeeze Method for Multi-Class Classification of Metastatic Lymph Nodes in Breast Cancer | May 23, 2025 | DiagnosticMulti-class Classification | CodeCode Available | 0 |
| More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models | May 23, 2025 | DiagnosticHallucination | —Unverified | 0 |
| Reverse-Speech-Finder: A Neural Network Backtracking Architecture for Generating Alzheimer's Disease Speech Samples and Improving Diagnosis Performance | May 23, 2025 | Diagnostic | —Unverified | 0 |
| Promptable cancer segmentation using minimal expert-curated data | May 23, 2025 | DiagnosticSegmentation | CodeCode Available | 0 |
| WiNGPT-3.0 Technical Report | May 23, 2025 | DiagnosticMedQA | CodeCode Available | 0 |
| Harry Potter is Still Here! Probing Knowledge Leakage in Targeted Unlearned Large Language Models via Automated Adversarial Prompting | May 22, 2025 | Diagnostic | —Unverified | 0 |
| Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models | May 22, 2025 | Diagnostic | CodeCode Available | 0 |
| MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning | May 22, 2025 | DiagnosticVisual Question Answering (VQA) | —Unverified | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models | May 22, 2025 | BenchmarkingDiagnostic | —Unverified | 0 |
| SMART: Self-Generating and Self-Validating Multi-Dimensional Assessment for LLMs' Mathematical Problem Solving | May 22, 2025 | DiagnosticMathematical Problem-Solving | —Unverified | 0 |
| Benchmarking Chest X-ray Diagnosis Models Across Multinational Datasets | May 21, 2025 | BenchmarkingDiagnostic | —Unverified | 0 |
| Comprehensive Lung Disease Detection Using Deep Learning Models and Hybrid Chest X-ray Data with Explainable AI | May 21, 2025 | DiagnosticTransfer Learning | —Unverified | 0 |
| Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs | May 21, 2025 | DiagnosticFairness | —Unverified | 0 |
| Neural Quantum Digital Twins for Optimizing Quantum Annealing | May 21, 2025 | Combinatorial OptimizationDiagnostic | —Unverified | 0 |
| Non-rigid Motion Correction for MRI Reconstruction via Coarse-To-Fine Diffusion Models | May 21, 2025 | DenoisingDiagnostic | —Unverified | 0 |
| Unified Cross-Modal Attention-Mixer Based Structural-Functional Connectomics Fusion for Neuropsychiatric Disorder Diagnosis | May 21, 2025 | DiagnosticMultimodal Deep Learning | —Unverified | 0 |
| A Linear Approach to Data Poisoning | May 21, 2025 | Data PoisoningDiagnostic | —Unverified | 0 |
| Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition | May 21, 2025 | Diagnostic | CodeCode Available | 0 |