Policy Gradient Reinforcement Learning for Uncertain Polytopic LPV Systems based on MHE-MPC Jun 10, 2022 Model Predictive Control reinforcement-learning
— Unverified 0Policy Gradients for Probabilistic Constrained Reinforcement Learning Oct 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Gradients Incorporating the Future Aug 4, 2021 Offline RL Reinforcement Learning (RL)
— Unverified 0Policy Gradients with Variance Related Risk Criteria Jun 27, 2012 Reinforcement Learning (RL)
— Unverified 0Policy Gradient using Weak Derivatives for Reinforcement Learning Apr 9, 2020 OpenAI Gym reinforcement-learning
— Unverified 0Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning Sep 28, 2020 Decision Making Management
— Unverified 0Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment Oct 3, 2020 Decision Making Decision Making Under Uncertainty
— Unverified 0Policy Gradient With Serial Markov Chain Reasoning Oct 13, 2022 Decision Making MuJoCo
— Unverified 0Policy Learning and Evaluation with Randomized Quasi-Monte Carlo Feb 16, 2022 continuous-control Continuous Control
— Unverified 0Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence May 24, 2021 Reinforcement Learning (RL)
— Unverified 0Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes Jan 30, 2021 Reinforcement Learning (RL)
— Unverified 0Policy Networks with Two-Stage Training for Dialogue Systems Jun 10, 2016 Deep Reinforcement Learning Dialogue State Tracking
— Unverified 0Policy Optimization as Wasserstein Gradient Flows Aug 9, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Policy Optimization by Genetic Distillation Nov 3, 2017 Deep Reinforcement Learning Imitation Learning
— Unverified 0Policy Optimization by Local Improvement through Search Sep 25, 2019 Imitation Learning reinforcement-learning
— Unverified 0Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games Mar 25, 2024 Reinforcement Learning (RL)
— Unverified 0Policy Optimization for Continuous Reinforcement Learning May 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence Oct 21, 2019 Policy Gradient Methods Reinforcement Learning
— Unverified 0Policy Optimization for Stochastic Shortest Path Feb 7, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Optimization over General State and Action Spaces Nov 30, 2022 Reinforcement Learning (RL)
— Unverified 0Policy Optimization with Demonstrations Jul 1, 2018 Policy Gradient Methods Reinforcement Learning
— Unverified 0Policy Optimization with Model-based Explorations Nov 18, 2018 Atari Games Decision Making
— Unverified 0Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation Jan 26, 2023 Adversarial Robustness MuJoCo
— Unverified 0Policy Optimization with Sparse Global Contrastive Explanations Jul 13, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Optimization with Stochastic Mirror Descent Jun 25, 2019 Continuous Control Policy Gradient Methods
— Unverified 0Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space Sep 15, 2019 continuous-control Continuous Control
— Unverified 0Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning Apr 24, 2023 Meta-Learning reinforcement-learning
— Unverified 0Policy Reuse for Communication Load Balancing in Unseen Traffic Scenarios Mar 22, 2023 Reinforcement Learning (RL)
— Unverified 0Policy Search by Target Distribution Learning for Continuous Control May 27, 2019 continuous-control Continuous Control
— Unverified 0Policy Search for Motor Primitives in Robotics Dec 1, 2008 Imitation Learning Policy Gradient Methods
— Unverified 0Policy Search in Continuous Action Domains: an Overview Mar 13, 2018 Bayesian Optimization Deep Reinforcement Learning
— Unverified 0Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL Oct 23, 2021 Model Predictive Control MuJoCo
— Unverified 0Policy Shaping: Integrating Human Feedback with Reinforcement Learning Dec 1, 2013 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Smoothing for Provably Robust Reinforcement Learning Jun 21, 2021 Adversarial Robustness image-classification
— Unverified 0Policy Synthesis and Reinforcement Learning for Discounted LTL May 26, 2023 PAC learning reinforcement-learning
— Unverified 0Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks Nov 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Tree Network Sep 25, 2019 Model-based Reinforcement Learning MuJoCo
— Unverified 0POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning May 23, 2022 Open-Ended Question Answering reinforcement-learning
— Unverified 0Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions Jul 12, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Polyphonic Music Composition: An Adversarial Inverse Reinforcement Learning Approach Sep 29, 2021 Q-Learning reinforcement-learning
— Unverified 0Polyphonic Music Composition with LSTM Neural Networks and Reinforcement Learning Feb 5, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0POMDP-lite for Robust Robot Planning under Uncertainty Feb 16, 2016 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0POMRL: No-Regret Learning-to-Plan with Increasing Horizons Dec 30, 2022 Meta Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning Feb 20, 2022 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0Population-based Global Optimisation Methods for Learning Long-term Dependencies with RNNs May 23, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning Jun 15, 2021 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Portfolio Management with Reinforcement Learning Dec 14, 2020 Management reinforcement-learning
— Unverified 0Portfolio Optimization with 2D Relative-Attentional Gated Transformer Dec 27, 2020 Deep Reinforcement Learning Portfolio Optimization
— Unverified 0PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning Dec 12, 2016 6D Pose Estimation using RGB Pose Estimation
— Unverified 0