| Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem | Apr 22, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model | Feb 23, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| An empirical evaluation of active inference in multi-armed bandits | Jan 21, 2021 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |
| TimeSHAP: Explaining Recurrent Models through Sequence Perturbations | Nov 30, 2020 | Decision MakingFeature Importance | CodeCode Available | 1 |
| Adaptive Stress Testing of Trajectory Predictions in Flight Management Systems | Nov 4, 2020 | Decision MakingManagement | CodeCode Available | 1 |
| Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines | Oct 8, 2020 | Common Sense ReasoningCommonsense Reasoning for RL | CodeCode Available | 1 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Occupancy Anticipation for Efficient Exploration and Navigation | Aug 21, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees | Jun 29, 2020 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints | May 27, 2020 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL | May 10, 2020 | Decision MakingLifelong learning | CodeCode Available | 1 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Dynamic Belief Graphs to Generalize on Text-Based Games | Feb 21, 2020 | Decision MakingKnowledge Graphs | CodeCode Available | 1 |
| PDDLGym: Gym Environments from PDDL Problems | Feb 15, 2020 | Decision MakingOpenAI Gym | CodeCode Available | 1 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Feb 5, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees | Sep 11, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Learning Multi-Level Hierarchies with Hindsight | Dec 4, 2017 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 1 |
| SkipNet: Learning Dynamic Routing in Convolutional Networks | Nov 26, 2017 | Decision MakingReinforcement Learning | CodeCode Available | 1 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| An Alternative Softmax Operator for Reinforcement Learning | Dec 16, 2016 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air | Jul 15, 2025 | DenoisingSequential Decision Making | —Unverified | 0 |