| MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Jun 9, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation | Jun 9, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis? | May 30, 2025 | DiagnosticMedical Image Analysis | CodeCode Available | 1 |
| How does Transformer Learn Implicit Reasoning? | May 29, 2025 | ClusteringDiagnostic | CodeCode Available | 1 |
| Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning | May 29, 2025 | DiagnosticQuestion Answering | CodeCode Available | 1 |
| Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge | May 28, 2025 | Depression DetectionDiagnostic | CodeCode Available | 1 |
| Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering | May 25, 2025 | AnatomyBenchmarking | CodeCode Available | 1 |
| Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs | May 22, 2025 | DiagnosticMachine Unlearning | CodeCode Available | 1 |