Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0Non-Markovian Control with Gated End-to-End Memory Policy Networks May 31, 2017 OpenAI Gym Reinforcement Learning
— Unverified 0Offline Inverse Reinforcement Learning Jun 9, 2021 Data Augmentation Imitation Learning
— Unverified 0Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline May 4, 2024 Computational Efficiency MuJoCo
— Unverified 0Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error Dec 26, 2022 Deep Reinforcement Learning OpenAI Gym
— Unverified 0On Combining Expert Demonstrations in Imitation Learning via Optimal Transport Jul 20, 2023 Imitation Learning OpenAI Gym
— Unverified 0Online Robust Policy Learning in the Presence of Unknown Adversaries Jul 16, 2018 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Asymptotic Analysis of Sample-averaged Q-learning Oct 14, 2024 OpenAI Gym Q-Learning
— Unverified 0Optimism is All You Need: Model-Based Imitation Learning From Observation Alone Mar 9, 2021 All Imitation Learning
— Unverified 0Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning Mar 21, 2025 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Optimizing Sensor Redundancy in Sequential Decision-Making Problems Dec 10, 2024 Decision Making OpenAI Gym
— Unverified 0Photonic Quantum Policy Learning in OpenAI Gym Aug 29, 2021 BIG-bench Machine Learning continuous-control
— Unverified 0Policy Gradient using Weak Derivatives for Reinforcement Learning Apr 9, 2020 OpenAI Gym reinforcement-learning
— Unverified 0Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning Jun 15, 2021 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation Feb 28, 2022 continuous-control Continuous Control
— Unverified 0Proximal Policy Gradient: PPO with Policy Gradient Oct 20, 2020 OpenAI Gym
— Unverified 0Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution Nov 3, 2021 continuous-control Continuous Control
— Unverified 0Decision-Making in Reinforcement Learning Jun 1, 2019 Decision Making Deep Reinforcement Learning
— Unverified 0Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network Jun 14, 2018 OpenAI Gym reinforcement-learning
— Unverified 0Quality Diversity Evolutionary Learning of Decision Trees Aug 17, 2022 Diversity OpenAI Gym
— Unverified 0Reward Prediction Error as an Exploration Objective in Deep RL Jun 19, 2019 Atari Games Continuous Control
— Unverified 0RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning May 8, 2021 Imitation Learning OpenAI Gym
— Unverified 0RangL: A Reinforcement Learning Competition Platform Jul 28, 2022 OpenAI Gym reinforcement-learning
— Unverified 0The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability Oct 2, 2024 Model Predictive Control Offline RL
— Unverified 0Recommendation System-based Upper Confidence Bound for Online Advertising Sep 9, 2019 OpenAI Gym Product Recommendation
— Unverified 0A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly Oct 15, 2020 OpenAI Gym Vocal Bursts Intensity Prediction
— Unverified 0WD3: Taming the Estimation Bias in Deep Reinforcement Learning Jun 18, 2020 continuous-control Continuous Control
— Unverified 0Refined Continuous Control of DDPG Actors via Parametrised Activation Jun 4, 2020 continuous-control Continuous Control
— Unverified 0REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents Oct 11, 2021 Deep Reinforcement Learning Meta-Learning
— Unverified 0Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems Oct 7, 2022 Combinatorial Optimization Decision Making
— Unverified 0Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction May 15, 2019 Management OpenAI Gym
— Unverified 0Reinforcement Learning using Guided Observability Apr 22, 2021 Decision Making MuJoCo
— Unverified 0Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning Oct 30, 2018 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning Mar 24, 2022 continuous-control Continuous Control
— Unverified 0Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations Nov 21, 2023 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Rethinking Population-assisted Off-policy Reinforcement Learning May 4, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Robustness Evaluation of Offline Reinforcement Learning for Robot Control Against Action Perturbations Dec 25, 2024 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Sample-based Distributional Policy Gradient Jan 8, 2020 Distributional Reinforcement Learning OpenAI Gym
— Unverified 0Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing Jul 11, 2023 Lifelong learning OpenAI Gym
— Unverified 0Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research Jan 25, 2024 Data Visualization Hyperparameter Optimization
— Unverified 0Sepsis World Model: A MIMIC-based OpenAI Gym "World Model" Simulator for Sepsis Treatment Dec 15, 2019 model OpenAI Gym
— Unverified 0Sequential Learning of Movement Prediction in Dynamic Environments using LSTM Autoencoder Oct 12, 2018 Decoder Navigate
— Unverified 0Session-Level Dynamic Ad Load Optimization using Offline Robust Reinforcement Learning Jan 9, 2025 OpenAI Gym
— Unverified 0SIMILE: Introducing Sequential Information towards More Effective Imitation Learning May 1, 2019 Imitation Learning OpenAI Gym
— Unverified 0Soft Actor-Critic with Inhibitory Networks for Faster Retraining Feb 7, 2022 Deep Reinforcement Learning OpenAI Gym
— Unverified 0State Distribution-aware Sampling for Deep Q-learning Apr 23, 2018 Atari Games OpenAI Gym
— Unverified 0Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning Aug 28, 2023 D4RL Off-policy evaluation
— Unverified 0Stealing That Free Lunch: Exposing the Limits of Dyna-Style Reinforcement Learning Dec 18, 2024 Model-based Reinforcement Learning OpenAI Gym
— Unverified 0STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation May 27, 2025 D4RL Denoising
— Unverified 0Structured Evolution with Compact Architectures for Scalable Policy Optimization Apr 6, 2018 OpenAI Gym Text-to-Image Generation
— Unverified 0