| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 | 5 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 | 5 |
| Combining Experimental and Historical Data for Policy Evaluation | Jun 1, 2024 | Data IntegrationDecision Making | CodeCode Available | 0 | 5 |
| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 | 5 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |