Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution Nov 3, 2021 continuous-control Continuous Control
— Unverified 0Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach Nov 2, 2021 continuous-control Continuous Control
— Unverified 0Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay Nov 2, 2021 Computational Efficiency continuous-control
— Unverified 0Adjacency constraint for efficient hierarchical reinforcement learning Oct 30, 2021 continuous-control Continuous Control
— Unverified 0Context Meta-Reinforcement Learning via Neuromodulation Oct 30, 2021 continuous-control Continuous Control
Code Code Available 0Dream to Explore: Adaptive Simulations for Autonomous Systems Oct 27, 2021 continuous-control Continuous Control
— Unverified 0Automating Control of Overestimation Bias for Reinforcement Learning Oct 26, 2021 Continuous Control Q-Learning
— Unverified 0Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks Oct 25, 2021 Benchmarking continuous-control
Code Code Available 0Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning Oct 23, 2021 continuous-control Continuous Control
— Unverified 0Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Oct 22, 2021 continuous-control Continuous Control
— Unverified 0Is High Variance Unavoidable in RL? A Case Study in Continuous Control Oct 21, 2021 continuous-control Continuous Control
— Unverified 0Off-Dynamics Inverse Reinforcement Learning from Hetero-Domain Oct 21, 2021 continuous-control Continuous Control
— Unverified 0Balancing Value Underestimation and Overestimation with Realistic Actor-Critic Oct 19, 2021 continuous-control Continuous Control
Code Code Available 0Continuous Control with Action Quantization from Demonstrations Oct 19, 2021 continuous-control Continuous Control
— Unverified 0Offline Reinforcement Learning with Soft Behavior Regularization Oct 14, 2021 continuous-control Continuous Control
— Unverified 0Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization Oct 10, 2021 continuous-control Continuous Control
— Unverified 0Evaluating model-based planning and planner amortization for continuous control Oct 7, 2021 continuous-control Continuous Control
— Unverified 0Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning Oct 7, 2021 Continuous Control Deep Reinforcement Learning
— Unverified 0Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Distributional Decision Transformer for Hindsight Information Matching Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Gradient Information Matters in Policy Optimization by Back-propagating through Model Sep 29, 2021 continuous-control Continuous Control
Code Code Available 0State-Only Imitation Learning by Trajectory Distribution Matching Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Imitation Learning from Pixel Observations for Continuous Control Sep 29, 2021 Benchmarking continuous-control
— Unverified 0SPLID: Self-Imitation Policy Learning through Iterative Distillation Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Effects of Conservatism on Offline Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Graph-Enhanced Exploration for Goal-oriented Reinforcement Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Joint Self-Supervised Learning for Vision-based Reinforcement Learning Sep 29, 2021 Autonomous Driving continuous-control
— Unverified 0Multi-batch Reinforcement Learning via Sample Transfer and Imitation Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0An Experimental Design Perspective on Exploration in Reinforcement Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Generalizing Successor Features to continuous domains for Multi-task Learning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Evaluating Robustness of Cooperative MARL Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement Learning Sep 29, 2021 Clustering continuous-control
— Unverified 0Reward Shifting for Optimistic Exploration and Conservative Exploitation Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Transformers are Meta-Reinforcement Learners Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Faster Reinforcement Learning with Value Target Lower Bounding Sep 29, 2021 Atari Games continuous-control
— Unverified 0Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts Sep 29, 2021 Autonomous Driving continuous-control
— Unverified 0Meta Attention For Off-Policy Actor-Critic Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Exploring More When It Needs in Deep Reinforcement Learning Sep 28, 2021 continuous-control Continuous Control
— Unverified 0Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience Sep 24, 2021 continuous-control Continuous Control
— Unverified 0Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients Sep 24, 2021 continuous-control Continuous Control
Code Code Available 0Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods Sep 22, 2021 continuous-control Continuous Control
Code Code Available 0Federated Ensemble Model-based Reinforcement Learning in Edge Computing Sep 12, 2021 Autonomous Driving continuous-control
— Unverified 0Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning Sep 8, 2021 Adversarial Attack continuous-control
— Unverified 0ADER:Adapting between Exploration and Robustness for Actor-Critic Methods Sep 8, 2021 continuous-control Continuous Control
— Unverified 0Error Controlled Actor-Critic Sep 6, 2021 continuous-control Continuous Control
Code Code Available 0Photonic Quantum Policy Learning in OpenAI Gym Aug 29, 2021 BIG-bench Machine Learning continuous-control
— Unverified 0HAC Explore: Accelerating Exploration with Hierarchical Reinforcement Learning Aug 12, 2021 continuous-control Continuous Control
— Unverified 0Imitation Learning by Reinforcement Learning Aug 10, 2021 continuous-control Continuous Control
Code Code Available 0