| Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis | Oct 15, 2023 | AnatomyComputed Tomography (CT) | CodeCode Available | 1 |
| Adaptive Conformal Predictions for Time Series | Feb 15, 2022 | Conformal PredictionDecision Making | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models | Nov 21, 2023 | Decision Making | CodeCode Available | 1 |
| From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making | May 15, 2025 | Decision Making | CodeCode Available | 1 |
| AvalonBench: Evaluating LLMs Playing the Game of Avalon | Oct 8, 2023 | Decision Making | CodeCode Available | 1 |
| From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation | Jun 7, 2022 | Decision MakingExplainable artificial intelligence | CodeCode Available | 1 |
| Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDAR | Oct 11, 2021 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs | Jun 22, 2023 | Arithmetic ReasoningBenchmarking | CodeCode Available | 1 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |