Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error Dec 26, 2022 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control Jun 10, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Off-Policy Selection for Initiating Human-Centric Experimental Design Oct 26, 2024 Experimental Design Reinforcement Learning (RL)
— Unverified 0Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation Jun 21, 2020 Image Captioning Reinforcement Learning (RL)
— Unverified 0Off-Policy Shaping Ensembles in Reinforcement Learning May 21, 2014 Computational Efficiency reinforcement-learning
— Unverified 0OffRIPP: Offline RL-based Informative Path Planning Sep 25, 2024 Offline RL reinforcement-learning
— Unverified 0Off-road Autonomous Vehicles Traversability Analysis and Trajectory Planning Based on Deep Inverse Reinforcement Learning Sep 16, 2019 Autonomous Vehicles reinforcement-learning
— Unverified 0Offsetting Unequal Competition through RL-assisted Incentive Schemes Jan 5, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research Oct 18, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents May 18, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0OIL: Observational Imitation Learning Mar 3, 2018 Autonomous Driving Autonomous Navigation
— Unverified 0oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions Feb 20, 2020 continuous-control Continuous Control
— Unverified 0O-MAPL: Offline Multi-agent Preference Learning Jan 31, 2025 Reinforcement Learning (RL) SMAC
— Unverified 0Omega-Regular Objectives in Model-Free Reinforcement Learning Sep 26, 2018 model reinforcement-learning
— Unverified 0Omega-Regular Reward Machines Aug 14, 2023 Reinforcement Learning (RL)
— Unverified 0OMG-RL:Offline Model-based Guided Reward Learning for Heparin Treatment Sep 20, 2024 Reinforcement Learning (RL)
— Unverified 0OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras Mar 2, 2019 Deep Reinforcement Learning Pedestrian Detection
— Unverified 0OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds Feb 5, 2025 Few-Shot Learning Imitation Learning
— Unverified 0On- and Off-Policy Monotonic Policy Improvement Oct 10, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On Applications of Bootstrap in Continuous Space Reinforcement Learning Mar 14, 2019 Decision Making reinforcement-learning
— Unverified 0On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods Nov 8, 2021 Autonomous Vehicles Q-Learning
— Unverified 0On Bellman equations for continuous-time policy evaluation I: discretization and approximation Jul 8, 2024 Reinforcement Learning (RL)
— Unverified 0On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process Feb 25, 2023 Q-Learning reinforcement-learning
— Unverified 0On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection Jun 4, 2019 Deep Reinforcement Learning Q-Learning
— Unverified 0On Computation and Generalization of Generative Adversarial Imitation Learning Jan 9, 2020 Decision Making Imitation Learning
— Unverified 0On Connections between Constrained Optimization and Reinforcement Learning Oct 18, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes Aug 29, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0On Convergence Rate of Adaptive Multiscale Value Function Approximation For Reinforcement Learning Aug 22, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0On Corruption-Robustness in Performative Reinforcement Learning May 8, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning Oct 13, 2021 Imitation Learning Recommendation Systems
— Unverified 0On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios Jul 19, 2022 Federated Learning Q-Learning
— Unverified 0On Double Descent in Reinforcement Learning with LSTD and Random Features Oct 9, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes Apr 24, 2023 Reinforcement Learning (RL)
— Unverified 0On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 0On Enhancing Network Throughput using Reinforcement Learning in Sliced Testbeds Dec 21, 2024 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion May 24, 2025 Humanoid Control Motion Synthesis
— Unverified 0One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning May 31, 2022 Reinforcement Learning (RL)
— Unverified 0One RL to See Them All: Visual Triple Unified Reinforcement Learning May 23, 2025 All Math
— Unverified 0One-shot learning and behavioral eligibility traces in sequential decision making Nov 12, 2019 Decision Making Learning Theory
— Unverified 0One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors Sep 23, 2015 Model-based Reinforcement Learning Model Predictive Control
— Unverified 0One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning Oct 25, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0One-Step Distributional Reinforcement Learning Apr 27, 2023 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks Mar 11, 2021 Offline RL reinforcement-learning
— Unverified 0On Gap-dependent Bounds for Offline Reinforcement Learning Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0On Hard Exploration for Reinforcement Learning: a Case Study in Pommerman Jul 26, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0On Improving Cross-dataset Generalization of Deepfake Detectors Apr 8, 2022 Binary Classification Classification
— Unverified 0On Improving Deep Reinforcement Learning for POMDPs Apr 17, 2018 Atari Games Decision Making
— Unverified 0On Inductive Biases in Deep Reinforcement Learning Jul 5, 2019 continuous-control Continuous Control
— Unverified 0On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality Oct 21, 2020 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0