| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation | Apr 28, 2023 | Decision MakingGraph Neural Network | CodeCode Available | 1 |
| TempoRL: laser pulse temporal shape optimization with Deep Reinforcement Learning | Apr 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Variational Information Pursuit for Interpretable Predictions | Feb 6, 2023 | Decision MakingMedical Diagnosis | CodeCode Available | 1 |
| Risk-Sensitive Policy with Distributional Reinforcement Learning | Dec 30, 2022 | Decision MakingDistributional Reinforcement Learning | CodeCode Available | 1 |
| Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems | Dec 15, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| UniMASK: Unified Inference in Sequential Decision Problems | Nov 20, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Markup-to-Image Diffusion Models with Scheduled Sampling | Oct 11, 2022 | Decision MakingDenoising | CodeCode Available | 1 |
| Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization | Oct 3, 2022 | Decision MakingPolicy Gradient Methods | CodeCode Available | 1 |
| Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling | Jul 9, 2022 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| The Sandbox Environment for Generalizable Agent Research (SEGAR) | Mar 19, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Curriculum-based Reinforcement Learning for Distribution System Critical Load Restoration | Mar 8, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification | Dec 1, 2021 | Decision MakingDiagnostic | CodeCode Available | 1 |
| RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning | Nov 4, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning | Oct 27, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Dynamic Causal Bayesian Optimization | Oct 26, 2021 | Bayesian OptimizationCausal Inference | CodeCode Available | 1 |
| Medical Dead-ends and Learning to Identify High-risk States and Treatments | Oct 8, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Counterfactual Explanations in Sequential Decision Making Under Uncertainty | Jul 6, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 1 |
| IQ-Learn: Inverse soft-Q Learning for Imitation | Jun 23, 2021 | Atari GamesContinuous Control | CodeCode Available | 1 |
| The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation | Jun 8, 2021 | BenchmarkingDecision Making | CodeCode Available | 1 |