| Learning model-based planning from scratch | Jul 19, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 | 5 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning Structural Weight Uncertainty for Sequential Decision-Making | Dec 30, 2017 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Learning to Follow Instructions in Text-Based Games | Nov 8, 2022 | Decision MakingInstruction Following | CodeCode Available | 0 | 5 |
| Enforcing Almost-Sure Reachability in POMDPs | Jun 30, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Scalable Exploration via Ensemble++ | Jul 18, 2024 | Computational EfficiencyDecision Making | CodeCode Available | 0 | 5 |
| Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients | Jun 21, 2024 | Decision MakingManagement | CodeCode Available | 0 | 5 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Enhancing the Accuracy and Fairness of Human Decision Making | May 25, 2018 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs | Dec 18, 2023 | Decision MakingKnowledge Graphs | CodeCode Available | 0 | 5 |
| Causal Explanations for Sequential Decision-Making in Multi-Agent Systems | Feb 21, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Back to the Future -- Sequential Alignment of Text Representations | Sep 8, 2019 | Decision MakingRumour Detection | CodeCode Available | 0 | 5 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Classification with Costly Features as a Sequential Decision-Making Problem | Sep 5, 2019 | ClassificationClassification with Costly Features | CodeCode Available | 0 | 5 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 | 5 |
| Reinforcement Learning applied to Insurance Portfolio Pursuit | Aug 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 | 5 |
| Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced Creativity | Sep 25, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Adversarial Environment Generation for Learning to Navigate the Web | Mar 2, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying | Aug 21, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 | 5 |
| Adversarially Robust Decision Transformer | Jul 25, 2024 | Adversarial RobustnessSequential Decision Making | CodeCode Available | 0 | 5 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 | 5 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Dynamic Real-time Multimodal Routing with Hierarchical Hybrid Planning | Feb 5, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 | 5 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 | 5 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 | 5 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 | 5 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 | 5 |
| Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand | Apr 14, 2025 | Sequential Decision MakingSurvival Analysis | CodeCode Available | 0 | 5 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Operator World Models for Reinforcement Learning | Jun 28, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Continuous Monte Carlo Graph Search | Oct 4, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Parameterized Projected Bellman Operator | Dec 20, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |