| Hindsight and Sequential Rationality of Correlated Play | Dec 10, 2020 | counterfactualDecision Making | CodeCode Available | 0 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Model-Free Episodic Control | Jun 14, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Hindsight Learning for MDPs with Exogenous Inputs | Jul 13, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 |
| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| How Should We Represent History in Interpretable Models of Clinical Policies? | Dec 10, 2024 | Decision MakingRepresentation Learning | CodeCode Available | 0 |
| What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning | Apr 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Dec 16, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Apr 10, 2024 | Decision MakingMeta Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Sequence Submodularity | Feb 15, 2019 | Decision MakingLink Prediction | CodeCode Available | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits | Oct 25, 2022 | Decision MakingExperimental Design | CodeCode Available | 0 |
| Sequential Monte Carlo Bandits | Aug 8, 2018 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Design for Justifiable Sequential Decision-Making | Feb 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Deep Variational Reinforcement Learning for POMDPs | Jun 6, 2018 | Decision MakingInductive Bias | CodeCode Available | 0 |
| Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning | May 15, 2020 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control | Apr 10, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation | Jul 30, 2019 | Decision MakingLearning-To-Rank | CodeCode Available | 0 |
| Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control | Sep 4, 2019 | Decision MakingOpen-Ended Question Answering | CodeCode Available | 0 |