| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Robust Anytime Learning of Markov Decision Processes | May 31, 2022 | Bayesian InferenceDecision Making | CodeCode Available | 0 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 |
| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement | Jan 21, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying | Aug 21, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Combining Experimental and Historical Data for Policy Evaluation | Jun 1, 2024 | Data IntegrationDecision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |