| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control | Apr 10, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation | Jul 30, 2019 | Decision MakingLearning-To-Rank | CodeCode Available | 0 |
| Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control | Sep 4, 2019 | Decision MakingOpen-Ended Question Answering | CodeCode Available | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Reward Machines for Deep RL in Noisy and Uncertain Environments | May 31, 2024 | counterfactualDecision Making | CodeCode Available | 0 |
| Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR | Dec 3, 2019 | Decision MakingReinforcement Learning | CodeCode Available | 0 |
| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |