“Other-Play” for Zero-Shot Coordination Jan 1, 2020 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning Mar 20, 2025 Reinforcement Learning (RL)
— Unverified 0OTTR: Off-Road Trajectory Tracking using Reinforcement Learning Oct 5, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Outcome-Constrained Large Language Models for Countering Hate Speech Mar 25, 2024 Reinforcement Learning (RL) Text Generation
— Unverified 0Outcome-Driven Reinforcement Learning via Variational Inference Apr 20, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space Jul 15, 2022 counterfactual Reinforcement Learning (RL)
— Unverified 0Outline Objects using Deep Reinforcement Learning Apr 10, 2018 Deep Reinforcement Learning Image Segmentation
— Unverified 0Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows May 6, 2024 Causal Inference counterfactual
— Unverified 0Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents Dec 3, 2024 Out-of-Distribution Detection Reinforcement Learning (RL)
— Unverified 0Out-of-distribution generalization of internal models is correlated with reward Mar 9, 2021 Out-of-Distribution Generalization reinforcement-learning
— Unverified 0Out-of-the-box channel pruned networks Apr 30, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Output Feedback Adaptive Optimal Control of Affine Nonlinear systems with a Linear Measurement Model Oct 13, 2022 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning Sep 29, 2021 MuJoCo reinforcement-learning
— Unverified 0Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications Jan 28, 2022 continuous-control Continuous Control
— Unverified 0Overcoming Model Bias for Robust Offline Deep Reinforcement Learning Aug 12, 2020 continuous-control Continuous Control
— Unverified 0Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning Sep 26, 2022 Object reinforcement-learning
— Unverified 0Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization Nov 12, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Overcoming the Spectral Bias of Neural Value Approximation Jun 9, 2022 continuous-control Continuous Control
— Unverified 0Over-communicate no more: Situated RL agents learn concise communication protocols Nov 2, 2022 Reinforcement Learning (RL)
— Unverified 0Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning Mar 1, 2024 Reinforcement Learning (RL)
— Unverified 0Over-the-fiber Digital Predistortion Using Reinforcement Learning Jun 9, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0P4O: Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization Sep 29, 2021 Atari Games Deep Reinforcement Learning
— Unverified 0PAC-Bayesian Model Selection for Reinforcement Learning Dec 1, 2010 model Model Selection
— Unverified 0PAC-Bayesian Policy Evaluation for Reinforcement Learning Feb 14, 2012 Model Selection reinforcement-learning
— Unverified 0PAC-Bayesian Randomized Value Function with Informative Prior Jan 1, 2021 Reinforcement Learning (RL)
— Unverified 0PAC Guarantees for Cooperative Multi-Agent Reinforcement Learning with Restricted Communication May 23, 2019 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Packet Routing with Graph Attention Multi-agent Reinforcement Learning Jul 28, 2021 Graph Attention Graph Neural Network
— Unverified 0A Joint Planning and Learning Framework for Human-Aided Decision-Making Jun 17, 2019 Decision Making General Knowledge
— Unverified 0PAC Reinforcement Learning Algorithm for General-Sum Markov Games Sep 5, 2020 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0PAC Reinforcement Learning for Predictive State Representations Jul 12, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0PAC Reinforcement Learning with Rich Observations Feb 8, 2016 Decision Making Multi-Armed Bandits
— Unverified 0PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier Jun 12, 2025 Reinforcement Learning (RL)
— Unverified 0PaintBot: A Reinforcement Learning Approach for Natural Media Painting Apr 3, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning Oct 26, 2020 Deep Reinforcement Learning Multiple Sequence Alignment
— Unverified 0Adaptive Pairwise Weights for Temporal Credit Assignment Feb 9, 2021 Reinforcement Learning (RL)
— Unverified 0Palm up: Playing in the Latent Manifold for Unsupervised Pretraining Oct 19, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Dec 22, 2023 AI Agent Reinforcement Learning (RL)
— Unverified 0Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning May 30, 2025 Question Answering Reinforcement Learning (RL)
— Unverified 0PEaRL: Personalized Privacy of Human-Centric Systems using Early-Exit Reinforcement Learning Mar 9, 2024 Reinforcement Learning (RL)
— Unverified 0Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations Oct 3, 2021 CPU GPU
— Unverified 0Parallel Automatic History Matching Algorithm Using Reinforcement Learning Nov 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Parallel bandit architecture based on laser chaos for reinforcement learning May 19, 2022 Decision Making Q-Learning
— Unverified 0Parallelized Reverse Curriculum Generation Aug 4, 2021 Reinforcement Learning (RL)
— Unverified 0Parallel Knowledge Transfer in Multi-Agent Reinforcement Learning Mar 29, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation Sep 22, 2022 Navigate reinforcement-learning
— Unverified 0Parameter-free Gradient Temporal Difference Learning May 10, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework Jun 17, 2020 Decision Making Q-Learning
— Unverified 0Parameterized Reinforcement Learning for Optical System Optimization Oct 9, 2020 Q-Learning reinforcement-learning
— Unverified 0Parameter Optimization of LLC-Converter with multiple operation points using Reinforcement Learning Feb 28, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning Oct 1, 2017 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0