| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback | May 20, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System | May 19, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | May 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | May 13, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Learning Planning Abstractions from Language | May 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows | May 6, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| MEXGEN: An Effective and Efficient Information Gain Approximation for Information Gathering Path Planning | May 4, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery | May 3, 2024 | Decision MakingInterpretable Machine Learning | —Unverified | 0 |