Time-Efficient Reinforcement Learning with Stochastic Stateful Policies Nov 7, 2023 continuous-control Continuous Control
— Unverified 00 On Proximal Policy Optimization's Heavy-tailed Gradients Feb 20, 2021 continuous-control Continuous Control
— Unverified 00 On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies Sep 21, 2022 continuous-control Continuous Control
— Unverified 00 On The Fragility of Learned Reward Functions Jan 9, 2023 continuous-control Continuous Control
— Unverified 00 Bridging the gap between Markowitz planning and deep reinforcement learning Sep 30, 2020 Asset Management Autonomous Driving
— Unverified 00 Advantage Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning Sep 25, 2019 continuous-control Continuous Control
— Unverified 00 On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control Jun 15, 2021 continuous-control Continuous Control
— Unverified 00 On the Sensitivity of Reward Inference to Misspecified Human Models Dec 9, 2022 continuous-control Continuous Control
— Unverified 00 On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems Dec 1, 2020 continuous-control Continuous Control
— Unverified 00 On the stability of Lipschitz continuous control problems and its application to reinforcement learning Apr 20, 2024 continuous-control Continuous Control
— Unverified 00 OPAC: Opportunistic Actor-Critic Dec 11, 2020 continuous-control Continuous Control
— Unverified 00 OPEB: Open Physical Environment Benchmark for Artificial Intelligence Jul 4, 2017 continuous-control Continuous Control
— Unverified 00 Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization Oct 10, 2021 continuous-control Continuous Control
— Unverified 00 Optimizing Energy-Efficient Braking Trajectories with Anticipatory Road Data for Automated Vehicles Jun 25, 2024 continuous-control Continuous Control
— Unverified 00 Boosting MCTS with Free Energy Minimization Jan 22, 2025 continuous-control Continuous Control
— Unverified 00 To the Noise and Back: Diffusion for Shared Autonomy Feb 23, 2023 continuous-control Continuous Control
— Unverified 00 Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning Aug 6, 2024 Continuous Control Density Estimation
— Unverified 00 OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments Dec 19, 2023 continuous-control Continuous Control
— Unverified 00 Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications Jan 28, 2022 continuous-control Continuous Control
— Unverified 00 Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control May 1, 2020 continuous-control Continuous Control
— Unverified 00 Overcoming Model Bias for Robust Offline Deep Reinforcement Learning Aug 12, 2020 continuous-control Continuous Control
— Unverified 00 Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies Oct 10, 2022 continuous-control Continuous Control
— Unverified 00 Overcoming the Spectral Bias of Neural Value Approximation Jun 9, 2022 continuous-control Continuous Control
— Unverified 00 PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm Jun 11, 2023 Continuous Control Distributional Reinforcement Learning
— Unverified 00 Adjacency constraint for efficient hierarchical reinforcement learning Oct 30, 2021 continuous-control Continuous Control
— Unverified 00 ADER:Adapting between Exploration and Robustness for Actor-Critic Methods Sep 8, 2021 continuous-control Continuous Control
— Unverified 00 Towards Characterizing Divergence in Deep Q-Learning Mar 21, 2019 continuous-control Continuous Control
— Unverified 00 Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States Oct 1, 2022 continuous-control Continuous Control
— Unverified 00 Path Integral Networks: End-to-End Differentiable Optimal Control Jun 29, 2017 continuous-control Continuous Control
— Unverified 00 PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning Apr 24, 2020 continuous-control Continuous Control
— Unverified 00 Bi-Level Policy Optimization with Nyström Hypergradients May 16, 2025 Bilevel Optimization continuous-control
— Unverified 00 Photonic Quantum Policy Learning in OpenAI Gym Aug 29, 2021 BIG-bench Machine Learning continuous-control
— Unverified 00 Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning Sep 25, 2017 continuous-control Continuous Control
— Unverified 00 Biased Estimates of Advantages over Path Ensembles Sep 15, 2019 Atari Games continuous-control
— Unverified 00 Better Exploration with Optimistic Actor-Critic Oct 28, 2019 continuous-control Continuous Control
— Unverified 00 PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference Mar 1, 2020 Bayesian Inference continuous-control
— Unverified 00 Planning and Control of Uncertain Cooperative Mobile Manipulator-Endowed Systems under Temporal-Logic Tasks Mar 2, 2023 continuous-control Continuous Control
— Unverified 00 Benchmarking Smoothness and Reducing High-Frequency Oscillations in Continuous Control Policies Oct 22, 2024 Benchmarking continuous-control
— Unverified 00 Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning Oct 9, 2023 continuous-control Continuous Control
— Unverified 00 Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning Oct 24, 2020 continuous-control Continuous Control
— Unverified 00 Behavior Regularized Offline Reinforcement Learning Nov 26, 2019 continuous-control Continuous Control
— Unverified 00 Zeroth-Order Supervised Policy Improvement Jun 11, 2020 continuous-control Continuous Control
— Unverified 00 Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning Feb 7, 2025 continuous-control Continuous Control
— Unverified 00 Adaptive Policy Learning for Offline-to-Online Reinforcement Learning Mar 14, 2023 continuous-control Continuous Control
— Unverified 00 Policy-labeled Preference Learning: Is Preference Enough for RLHF? May 6, 2025 continuous-control Continuous Control
— Unverified 00 Policy Learning and Evaluation with Randomized Quasi-Monte Carlo Feb 16, 2022 continuous-control Continuous Control
— Unverified 00 Behavior Priors for Efficient Reinforcement Learning Oct 27, 2020 continuous-control Continuous Control
— Unverified 00 Policy Manifold Search: Exploring the Manifold Hypothesis for Diversity-based Neuroevolution Apr 27, 2021 continuous-control Continuous Control
— Unverified 00 Policy Manifold Search for Improving Diversity-based Neuroevolution Dec 15, 2020 continuous-control Continuous Control
— Unverified 00 Policy Optimization as Online Learning with Mediator Feedback Dec 15, 2020 continuous-control Continuous Control
— Unverified 00