| Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework | Mar 7, 2025 | Conformal PredictionMedical Question Answering | —Unverified | 0 | 0 |
| Structured Outputs Enable General-Purpose LLMs to be Medical Experts | Mar 5, 2025 | Clinical KnowledgeMedical Question Answering | —Unverified | 0 | 0 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 | 0 |
| Task Specific Pruning with LLM-Sieve: How Many Parameters Does Your Task Really Need? | May 23, 2025 | Medical Question AnsweringQuantization | —Unverified | 0 | 0 |
| TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models | Jun 7, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Towards Generalist Biomedical AI | Jul 26, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Aug 25, 2024 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Uncertainty Estimation of Large Language Models in Medical Question Answering | Jul 11, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Unifying Corroborative and Contributive Attributions in Large Language Models | Nov 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |