| Hindsight and Sequential Rationality of Correlated Play | Dec 10, 2020 | counterfactualDecision Making | CodeCode Available | 0 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Model-Free Episodic Control | Jun 14, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Hindsight Learning for MDPs with Exogenous Inputs | Jul 13, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 |
| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| How Should We Represent History in Interpretable Models of Clinical Policies? | Dec 10, 2024 | Decision MakingRepresentation Learning | CodeCode Available | 0 |
| What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning | Apr 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Dec 16, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Apr 10, 2024 | Decision MakingMeta Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Sequence Submodularity | Feb 15, 2019 | Decision MakingLink Prediction | CodeCode Available | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits | Oct 25, 2022 | Decision MakingExperimental Design | CodeCode Available | 0 |
| Sequential Monte Carlo Bandits | Aug 8, 2018 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Design for Justifiable Sequential Decision-Making | Feb 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Deep Variational Reinforcement Learning for POMDPs | Jun 6, 2018 | Decision MakingInductive Bias | CodeCode Available | 0 |
| Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning | May 15, 2020 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control | Apr 10, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation | Jul 30, 2019 | Decision MakingLearning-To-Rank | CodeCode Available | 0 |
| Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control | Sep 4, 2019 | Decision MakingOpen-Ended Question Answering | CodeCode Available | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Reward Machines for Deep RL in Noisy and Uncertain Environments | May 31, 2024 | counterfactualDecision Making | CodeCode Available | 0 |
| Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR | Dec 3, 2019 | Decision MakingReinforcement Learning | CodeCode Available | 0 |
| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Causal Explanations for Sequential Decision-Making in Multi-Agent Systems | Feb 21, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| Preserving the Privacy of Reward Functions in MDPs through Deception | Jul 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Risk-Aware Continuous Control with Neural Contextual Bandits | Dec 15, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation | Aug 29, 2023 | Decision MakingNavigate | CodeCode Available | 0 |
| Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients | Jun 21, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning | Jul 1, 2019 | Decision MakingImage Captioning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Decomposition Methods with Deep Corrections for Reinforcement Learning | Feb 6, 2018 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 |
| Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs | Mar 29, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Information-Theoretic Safe Exploration with Gaussian Processes | Dec 9, 2022 | Decision MakingGaussian Processes | CodeCode Available | 0 |
| PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion | Jun 7, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Instance Temperature Knowledge Distillation | Jun 27, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization | Apr 17, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Agent-State Construction with Auxiliary Inputs | Nov 15, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Thompson Sampling via Local Uncertainty | Oct 30, 2019 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |