The Atari Grand Challenge Dataset May 31, 2017 Imitation Learning Reinforcement Learning
Code Code Available 0Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning May 25, 2022 continuous-control Continuous Control
Code Code Available 0Reward-Machine-Guided, Self-Paced Reinforcement Learning May 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Variance Networks: When Expectation Does Not Meet Your Expectations Mar 10, 2018 Efficient Exploration Reinforcement Learning
Code Code Available 0Reinforcement Learning to Disentangle Multiqubit Quantum States from Partial Observations Jun 12, 2024 Benchmarking Deep Reinforcement Learning
Code Code Available 0Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU Nov 18, 2016 CPU GPU
Code Code Available 0The Benefits of Model-Based Generalization in Reinforcement Learning Nov 4, 2022 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning Sep 17, 2019 MuJoCo OpenAI Gym
Code Code Available 0Meta-Reinforcement Learning for Reliable Communication in THz/VLC Wireless VR Networks Jan 29, 2021 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Modular Multitask Reinforcement Learning with Policy Sketches Nov 6, 2016 continuous-control Continuous Control
Code Code Available 0Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning Feb 21, 2023 Q-Learning reinforcement-learning
Code Code Available 0Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning Jul 25, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates Nov 18, 2021 Decision Making reinforcement-learning
Code Code Available 0MDPGT: Momentum-based Decentralized Policy Gradient Tracking Dec 6, 2021 Multi-agent Reinforcement Learning Policy Gradient Methods
Code Code Available 0Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse May 29, 2023 continuous-control Continuous Control
Code Code Available 0Unifying Count-Based Exploration and Intrinsic Motivation Jun 6, 2016 Atari Games Montezuma's Revenge
Code Code Available 0SliceIt! -- A Dual Simulator Framework for Learning Robot Food Slicing Apr 3, 2024 Reinforcement Learning (RL)
Code Code Available 0Posterior Sampling for Reinforcement Learning Without Episodes Aug 9, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 0Variance Reduction based Experience Replay for Policy Optimization Aug 25, 2022 Reinforcement Learning (RL)
Code Code Available 0SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning Dec 28, 2019 Atari Games Deep Reinforcement Learning
Code Code Available 0The Chef's Hat Simulation Environment for Reinforcement-Learning-Based Agents Mar 12, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes Feb 27, 2020 Decision Making reinforcement-learning
Code Code Available 0Posterior-regularized REINFORCE for Instance Selection in Distant Supervision Apr 17, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Modular Multi-Objective Deep Reinforcement Learning with Decision Values Apr 21, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation May 1, 2025 Hallucination Navigate
Code Code Available 0Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction Jun 11, 2024 Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning of Musculoskeletal Control from Functional Simulations Jul 13, 2020 Anatomy Deep Reinforcement Learning
Code Code Available 0Reward-Weighted Regression Converges to a Global Optimum Jul 19, 2021 regression Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions Nov 20, 2018 Object reinforcement-learning
Code Code Available 0Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning Mar 22, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning Jan 21, 2023 Lifelong learning reinforcement-learning
Code Code Available 0Reinforcement Learning Neural Turing Machines - Revised May 4, 2015 reinforcement-learning Reinforcement Learning
Code Code Available 0MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations Mar 30, 2023 Decision Making Imitation Learning
Code Code Available 0Smart Imitator: Learning from Imperfect Clinical Decisions Jan 10, 2025 Imitation Learning Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning In Two Player Zero Sum Simultaneous Action Games Oct 10, 2021 Imitation Learning Meta-Learning
Code Code Available 0MDP environments for the OpenAI Gym Sep 26, 2017 OpenAI Gym reinforcement-learning
Code Code Available 0Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning Jan 14, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Towards Understanding the Link Between Modularity and Performance in Neural Networks for Reinforcement Learning May 13, 2022 Diversity reinforcement-learning
Code Code Available 0Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning May 7, 2019 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0ToyBox: Better Atari Environments for Testing Reinforcement Learning Agents Dec 6, 2018 Atari Games reinforcement-learning
Code Code Available 0Post Reinforcement Learning Inference Feb 17, 2023 counterfactual Off-policy evaluation
Code Code Available 0Reinforcement Learning Increases Wind Farm Power Production by Enabling Closed-Loop Collaborative Control Jun 25, 2025 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 0Off-policy Evaluation in Doubly Inhomogeneous Environments Jun 14, 2023 Offline RL Off-policy evaluation
Code Code Available 0The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards Jan 11, 2024 continuous-control Continuous Control
Code Code Available 0SME-Net: Sparse Motion Estimation for Parametric Video Prediction Through Reinforcement Learning Oct 1, 2019 Motion Compensation Motion Estimation
Code Code Available 0SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies Dec 1, 2019 continuous-control Continuous Control
Code Code Available 0Risk-Aware Active Inverse Reinforcement Learning Jan 8, 2019 Active Learning reinforcement-learning
Code Code Available 0SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments Dec 11, 2019 Navigate reinforcement-learning
Code Code Available 0SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning Nov 11, 2019 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving Jun 5, 2023 Autonomous Driving Motion Planning
Code Code Available 0