| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation | Apr 28, 2023 | Decision MakingGraph Neural Network | CodeCode Available | 1 |
| TempoRL: laser pulse temporal shape optimization with Deep Reinforcement Learning | Apr 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Variational Information Pursuit for Interpretable Predictions | Feb 6, 2023 | Decision MakingMedical Diagnosis | CodeCode Available | 1 |
| Risk-Sensitive Policy with Distributional Reinforcement Learning | Dec 30, 2022 | Decision MakingDistributional Reinforcement Learning | CodeCode Available | 1 |
| Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems | Dec 15, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| UniMASK: Unified Inference in Sequential Decision Problems | Nov 20, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Markup-to-Image Diffusion Models with Scheduled Sampling | Oct 11, 2022 | Decision MakingDenoising | CodeCode Available | 1 |
| Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization | Oct 3, 2022 | Decision MakingPolicy Gradient Methods | CodeCode Available | 1 |
| Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling | Jul 9, 2022 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| The Sandbox Environment for Generalizable Agent Research (SEGAR) | Mar 19, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Curriculum-based Reinforcement Learning for Distribution System Critical Load Restoration | Mar 8, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification | Dec 1, 2021 | Decision MakingDiagnostic | CodeCode Available | 1 |
| RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning | Nov 4, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning | Oct 27, 2021 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Dynamic Causal Bayesian Optimization | Oct 26, 2021 | Bayesian OptimizationCausal Inference | CodeCode Available | 1 |
| Medical Dead-ends and Learning to Identify High-risk States and Treatments | Oct 8, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Counterfactual Explanations in Sequential Decision Making Under Uncertainty | Jul 6, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 1 |
| IQ-Learn: Inverse soft-Q Learning for Imitation | Jun 23, 2021 | Atari GamesContinuous Control | CodeCode Available | 1 |
| The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation | Jun 8, 2021 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem | Apr 22, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model | Feb 23, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| An empirical evaluation of active inference in multi-armed bandits | Jan 21, 2021 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |
| TimeSHAP: Explaining Recurrent Models through Sequence Perturbations | Nov 30, 2020 | Decision MakingFeature Importance | CodeCode Available | 1 |
| Adaptive Stress Testing of Trajectory Predictions in Flight Management Systems | Nov 4, 2020 | Decision MakingManagement | CodeCode Available | 1 |
| Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines | Oct 8, 2020 | Common Sense ReasoningCommonsense Reasoning for RL | CodeCode Available | 1 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Occupancy Anticipation for Efficient Exploration and Navigation | Aug 21, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees | Jun 29, 2020 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints | May 27, 2020 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL | May 10, 2020 | Decision MakingLifelong learning | CodeCode Available | 1 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Dynamic Belief Graphs to Generalize on Text-Based Games | Feb 21, 2020 | Decision MakingKnowledge Graphs | CodeCode Available | 1 |
| PDDLGym: Gym Environments from PDDL Problems | Feb 15, 2020 | Decision MakingOpenAI Gym | CodeCode Available | 1 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Feb 5, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees | Sep 11, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Learning Multi-Level Hierarchies with Hindsight | Dec 4, 2017 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 1 |
| SkipNet: Learning Dynamic Routing in Convolutional Networks | Nov 26, 2017 | Decision MakingReinforcement Learning | CodeCode Available | 1 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| An Alternative Softmax Operator for Reinforcement Learning | Dec 16, 2016 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air | Jul 15, 2025 | DenoisingSequential Decision Making | —Unverified | 0 |