| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Oct 28, 2020 | Atari GamesContinuous Control | CodeCode Available | 0 |
| Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps | May 18, 2020 | Atari GamesDecision Making | CodeCode Available | 0 |
| Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to Rank | Mar 31, 2022 | counterfactualGeneral Reinforcement Learning | CodeCode Available | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning | Jun 3, 2019 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Interactive Learning from Activity Description | Feb 13, 2021 | General Reinforcement LearningGrounded language learning | CodeCode Available | 0 |
| The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning | Jul 7, 2020 | General Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| A Monte Carlo AIXI Approximation | Sep 4, 2009 | General Reinforcement LearningOpen-Ended Question Answering | CodeCode Available | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Gibson Env: Real-World Perception for Embodied Agents | Aug 31, 2018 | Domain AdaptationGeneral Reinforcement Learning | CodeCode Available | 0 |