| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Towards a Unified Framework for Sequential Decision Making | Oct 3, 2023 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Learning to Make Adherence-Aware Advice | Oct 1, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| TraCE: Trajectory Counterfactual Explanation Scores | Sep 27, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding | Sep 21, 2023 | Decision MakingSelf-Learning | —Unverified | 0 |
| Delays in Reinforcement Learning | Sep 20, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Safe POMDP Online Planning via Shielding | Sep 19, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback | Sep 16, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Efficient quantum recurrent reinforcement learning via quantum reservoir computing | Sep 13, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |