| Benchmarking Large Language Model Volatility | Nov 26, 2023 | BenchmarkingDecision Making | —Unverified | 0 |
| Bridging Medical Data Inference to Achilles Tendon Rupture Rehabilitation | Dec 7, 2016 | Collaborative FilteringDecision Making | —Unverified | 0 |
| Benchmarking PathCLIP for Pathology Image Analysis | Jan 5, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Building Intelligent Autonomous Navigation Agents | Jun 25, 2021 | Autonomous NavigationDecision Making | —Unverified | 0 |
| Can large language models explore in-context? | Mar 22, 2024 | Decision Making | —Unverified | 0 |
| Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Feb 23, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking Twitter Sentiment Analysis Tools | May 1, 2014 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset | Jul 9, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback | Nov 20, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |