Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation Mar 8, 2021 Federated Learning Multi-agent Reinforcement Learning
— Unverified 0Provably Efficient CVaR RL in Low-rank MDPs Nov 20, 2023 Reinforcement Learning (RL) Representation Learning
— Unverified 0Provably Efficient Exploration in Policy Optimization Dec 12, 2019 Efficient Exploration Reinforcement Learning
— Unverified 0Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret Feb 21, 2023 Efficient Exploration reinforcement-learning
— Unverified 0Provably Efficient Exploration in Reward Machines with Low Regret Dec 26, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Jul 6, 2023 Decision Making LEMMA
— Unverified 0Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation Jun 1, 2022 4k Lifelong learning
— Unverified 0Provably Efficient Model-Free Algorithms for Non-stationary CMDPs Mar 10, 2023 Reinforcement Learning (RL)
— Unverified 0Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication Oct 14, 2021 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0Provably Efficient Multi-Task Reinforcement Learning with Model Transfer Jul 19, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus Jun 1, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward Jun 13, 2022 Offline RL reinforcement-learning
— Unverified 0Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources Jun 14, 2023 Offline RL reinforcement-learning
— Unverified 0Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints Jan 28, 2022 Reinforcement Learning (RL) Safe Exploration
— Unverified 0Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle Jun 14, 2019 Q-Learning reinforcement-learning
— Unverified 0Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle Dec 1, 2019 Q-Learning reinforcement-learning
— Unverified 0Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces Feb 7, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Reinforcement Learning with Aggregated States Dec 13, 2019 Q-Learning reinforcement-learning
— Unverified 0Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension May 21, 2020 Reinforcement Learning (RL)
— Unverified 0Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints Jan 6, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping Jun 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization Jun 29, 2022 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games Oct 12, 2021 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems Jun 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Reinforcement Learning via Surprise Bound Feb 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL Jun 22, 2021 Deep Reinforcement Learning Offline RL
— Unverified 0Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation Feb 14, 2025 Reinforcement Learning (RL)
— Unverified 0Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning Apr 18, 2023 Active Learning reinforcement-learning
— Unverified 0Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics Sep 29, 2021 Reinforcement Learning (RL) Representation Learning
— Unverified 0Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments May 12, 2022 Deep Reinforcement Learning Motion Planning
— Unverified 0Provably Safe Model-Based Meta Reinforcement Learning: An Abstraction-Based Approach Sep 3, 2021 Meta-Learning Meta Reinforcement Learning
— Unverified 0Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking May 13, 2022 Benchmarking reinforcement-learning
— Unverified 0Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes Oct 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Sample-Efficient RL with Side Information about Latent Dynamics May 27, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Proximal Bellman mappings for reinforcement learning and their application to robust adaptive filtering Sep 14, 2023 Reinforcement Learning (RL)
— Unverified 0Proximal Deterministic Policy Gradient Aug 3, 2020 continuous-control Continuous Control
— Unverified 0Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning May 23, 2023 Diversity reinforcement-learning
— Unverified 0Proximal Policy Optimization and its Dynamic Version for Sequence Generation Aug 24, 2018 Chatbot Model Optimization
— Unverified 0Proximal Policy Optimization-Based Reinforcement Learning Approach for DC-DC Boost Converter Control: A Comparative Evaluation Against Traditional Control Techniques Oct 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information Jul 20, 2021 Policy Gradient Methods reinforcement-learning
— Unverified 0Proximal Policy Optimization via Enhanced Exploration Efficiency Nov 11, 2020 continuous-control Continuous Control
— Unverified 0Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces May 26, 2014 Decision Making reinforcement-learning
— Unverified 0Proximal Reliability Optimization for Reinforcement Learning Jun 3, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning May 13, 2020 Clustering Data Augmentation
— Unverified 0Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy Mar 7, 2024 Language Modeling Language Modelling
— Unverified 0Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control May 30, 2025 continuous-control Continuous Control
— Unverified 0PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets Jan 14, 2023 Management Mixture-of-Experts
— Unverified 0Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care Jun 13, 2023 Offline RL Q-Learning
— Unverified 0