| Emergent Risk Awareness in Rational Agents under Resource Constraints | May 29, 2025 | Sequential Decision Making | —Unverified | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 |
| Variational Deep Learning via Implicit Regularization | May 26, 2025 | Deep LearningInductive Bias | —Unverified | 0 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 |
| DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | May 24, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | May 23, 2025 | Motion PlanningSequential Decision Making | —Unverified | 0 |
| Reward Is Enough: LLMs Are In-Context Reinforcement Learners | May 21, 2025 | Large Language ModelReinforcement Learning (RL) | —Unverified | 0 |
| Web-Shepherd: Advancing PRMs for Reinforcing Web Agents | May 21, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 2 |
| Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation | May 20, 2025 | Computational Efficiencycontinuous-control | CodeCode Available | 0 |
| LLINBO: Trustworthy LLM-in-the-Loop Bayesian Optimization | May 20, 2025 | Bayesian OptimizationGaussian Processes | CodeCode Available | 1 |