On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures Jan 26, 2023 Decision Making Policy Gradient Methods
— Unverified 0On the impact of MDP design for Reinforcement Learning agents in Resource Management Sep 7, 2021 Management reinforcement-learning
— Unverified 0On the Importance of Critical Period in Multi-stage Reinforcement Learning Aug 9, 2022 AI Agent reinforcement-learning
— Unverified 0On the improvement of model-predictive controllers Aug 29, 2023 model Model Predictive Control
— Unverified 0On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks Jan 26, 2024 Reinforcement Learning (RL)
— Unverified 0On the Linear convergence of Natural Policy Gradient Algorithm May 4, 2021 Policy Gradient Methods reinforcement-learning
— Unverified 0On the Mechanism of Reasoning Pattern Selection in Reinforcement Learning for Language Models Jun 5, 2025 Instruction Following Reinforcement Learning (RL)
— Unverified 0On the Modeling Capabilities of Large Language Models for Sequential Decision Making Oct 8, 2024 Decision Making Diversity
— Unverified 0On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning Sep 7, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation Sep 25, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal Jun 10, 2019 model Model-based Reinforcement Learning
— Unverified 0On the Power of Multitask Representation Learning in Linear MDP Jun 15, 2021 Reinforcement Learning (RL) Representation Learning
— Unverified 0On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness Oct 19, 2022 Reinforcement Learning (RL)
— Unverified 0On the Practical Consistency of Meta-Reinforcement Learning Algorithms Dec 1, 2021 Meta-Learning Meta Reinforcement Learning
— Unverified 0On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning Apr 3, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On the Reduction of Variance and Overestimation of Deep Q-Learning Oct 14, 2019 Q-Learning reinforcement-learning
— Unverified 0On the Relationship Between Active Inference and Control as Inference Jun 23, 2020 Decision Making reinforcement-learning
— Unverified 0On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent Dec 18, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0On the Robustness of Controlled Deep Reinforcement Learning for Slice Placement Aug 5, 2021 Deep Reinforcement Learning Management
— Unverified 0On the Robustness of Deep Reinforcement Learning in IRS-Aided Wireless Communications Systems Jul 17, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On the Role of Discount Factor in Offline Reinforcement Learning Jun 7, 2022 D4RL Offline RL
— Unverified 0On the role of planning in model-based deep reinforcement learning Nov 8, 2020 Deep Reinforcement Learning Model-based Reinforcement Learning
— Unverified 0On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer Dec 6, 2023 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation Oct 18, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0The Curse of Passive Data Collection in Batch Reinforcement Learning Jun 18, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0On the Sample Complexity of Reinforcement Learning with Policy Space Generalization Aug 17, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples Mar 7, 2023 Offline RL Off-policy evaluation
— Unverified 0On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning Apr 11, 2024 Reinforcement Learning (RL)
— Unverified 0On the Search for Feedback in Reinforcement Learning Feb 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems Dec 1, 2020 continuous-control Continuous Control
— Unverified 0On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning Jan 30, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures Jan 3, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL Jun 21, 2022 Reinforcement Learning (RL)
— Unverified 0On the Stochastic (Variance-Reduced) Proximal Gradient Method for Regularized Expected Reward Optimization Jan 23, 2024 Reinforcement Learning (RL)
— Unverified 0On the Theory of Reinforcement Learning with Once-per-Episode Feedback May 29, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0On the use of Deep Autoencoders for Efficient Embedded Reinforcement Learning Mar 25, 2019 CPU GPU
— Unverified 0On the use of feature-maps and parameter control for improved quality-diversity meta-evolution May 21, 2021 Diversity feature selection
— Unverified 0On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer May 10, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0On the Weaknesses of Reinforcement Learning for Neural Machine Translation Jul 3, 2019 Machine Translation reinforcement-learning
— Unverified 0On Thompson Sampling for Smoother-than-Lipschitz Bandits Jan 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments May 27, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0On Trade-offs of Image Prediction in Visual Model-Based Reinforcement Learning Jan 1, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0On Training Flexible Robots using Deep Reinforcement Learning Jun 29, 2019 Deep Reinforcement Learning Industrial Robots
— Unverified 0On Transforming Reinforcement Learning by Transformer: The Development Trajectory Dec 29, 2022 Autonomous Driving reinforcement-learning
— Unverified 0On Value Functions and the Agent-Environment Boundary May 30, 2019 Imitation Learning reinforcement-learning
— Unverified 0On Wasserstein Reinforcement Learning and the Fokker-Planck equation Dec 19, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0OPAC: Opportunistic Actor-Critic Dec 11, 2020 continuous-control Continuous Control
— Unverified 0OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning Oct 26, 2020 Few-Shot Imitation Learning Imitation Learning
— Unverified 0OPEB: Open Physical Environment Benchmark for Artificial Intelligence Jul 4, 2017 continuous-control Continuous Control
— Unverified 0Open-Ended Learning Strategies for Learning Complex Locomotion Skills Jun 14, 2022 Diversity Reinforcement Learning (RL)
— Unverified 0