| Learning Multi-Level Hierarchies with Hindsight | Dec 4, 2017 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 1 | 5 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning | Aug 6, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 1 | 5 |
| Counterfactual Explanations in Sequential Decision Making Under Uncertainty | Jul 6, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 1 | 5 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| An Alternative Softmax Operator for Reinforcement Learning | Dec 16, 2016 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| Dynamic Causal Bayesian Optimization | Oct 26, 2021 | Bayesian OptimizationCausal Inference | CodeCode Available | 1 | 5 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 | 5 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 | 5 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 | 5 |