Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning Jul 7, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Pseudorehearsal in actor-critic agents Apr 17, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Pseudorehearsal in actor-critic agents with neural network function approximation Dec 20, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Pseudorehearsal in value function approximation Mar 21, 2017 Q-Learning reinforcement-learning
— Unverified 0PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Mar 9, 2021 Autonomous Driving reinforcement-learning
— Unverified 0Reinforcement Learning and its Connections with Neuroscience and Psychology Jun 25, 2020 Atari Games Decision Making
— Unverified 0Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics Mar 16, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning Oct 17, 2022 Learning-To-Rank Multi-agent Reinforcement Learning
— Unverified 0Purpose in the Machine: Do Traffic Simulators Produce Distributionally Equivalent Outcomes for Reinforcement Learning Applications? Nov 14, 2023 Reinforcement Learning (RL)
— Unverified 0Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers May 7, 2025 Math Reinforcement Learning (RL)
— Unverified 0PWM: Policy Learning with Multi-Task World Models Jul 2, 2024 continuous-control Continuous Control
— Unverified 0Q-Cogni: An Integrated Causal Reinforcement Learning Framework Feb 26, 2023 Causal Inference Decision Making
— Unverified 0Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning Sep 4, 2019 Management Q-Learning
— Unverified 0QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration Aug 7, 2023 continuous-control Continuous Control
— Unverified 0QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine Jun 8, 2025 Decision Making Quantization
— Unverified 0QF-tuner: Breaking Tradition in Reinforcement Learning Feb 26, 2024 OpenAI Gym Q-Learning
— Unverified 0Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning Jul 15, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling Jun 10, 2020 Decision Making Q-Learning
— Unverified 0Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing May 14, 2022 Decision Making Q-Learning
— Unverified 0QKSA: Quantum Knowledge Seeking Agent -- resource-optimized reinforcement learning using quantum process tomography Dec 7, 2021 Quantum Machine Learning reinforcement-learning
— Unverified 0Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes Dec 1, 2017 Decision Making Deep Reinforcement Learning
— Unverified 0Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells Jul 10, 2017 Q-Learning Reinforcement Learning
— Unverified 0Q-Learning Based Aerial Base Station Placement for Fairness Enhancement in Mobile Networks Sep 10, 2019 Fairness Q-Learning
— Unverified 0Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks Nov 29, 2023 Q-Learning reinforcement-learning
— Unverified 0Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL Sep 8, 2022 D4RL Offline RL
— Unverified 0Q-Learning for Continuous Actions with Cross-Entropy Guided Policies Mar 25, 2019 Q-Learning Reinforcement Learning
— Unverified 0q-Learning in Continuous Time Jul 2, 2022 Learning Theory Q-Learning
— Unverified 0Q-Learning in Regularized Mean-field Games Mar 24, 2020 Q-Learning reinforcement-learning
— Unverified 0Q-learning with online random forests Apr 7, 2022 Q-Learning reinforcement-learning
— Unverified 0q-Munchausen Reinforcement Learning May 16, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Q-NAV: NAV Setting Method based on Reinforcement Learning in Underwater Wireless Networks May 21, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Q-Networks for Binary Vector Actions Dec 4, 2015 Q-Learning reinforcement-learning
— Unverified 0QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning Jun 22, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning Oct 13, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning Jul 14, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Q-Policy: Quantum-Enhanced Policy Evaluation for Scalable Reinforcement Learning May 17, 2025 Reinforcement Learning (RL)
— Unverified 0QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning Sep 9, 2020 Multi-agent Reinforcement Learning quantile regression
— Unverified 0qRRT: Quality-Biased Incremental RRT for Optimal Motion Planning in Non-Holonomic Systems Jan 7, 2021 Motion Planning Optimal Motion Planning
— Unverified 0Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning Nov 7, 2024 Offline RL Policy Gradient Methods
— Unverified 0Q* Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison Mar 9, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0QT-TDM: Planning With Transformer Dynamics Model and Autoregressive Q-Learning Jul 26, 2024 continuous-control Continuous Control
— Unverified 0Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles Nov 29, 2019 Autonomous Driving Autonomous Vehicles
— Unverified 0Quadruped Locomotion on Non-Rigid Terrain using Reinforcement Learning Jul 7, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0QuadWBG: Generalizable Quadrupedal Whole-Body Grasping Nov 11, 2024 Reinforcement Learning (RL) Transparent objects
— Unverified 0Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents May 16, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network Jun 14, 2018 OpenAI Gym reinforcement-learning
— Unverified 0Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning Nov 27, 2018 Decision Making Decoder
— Unverified 0Quality-Driven Curation of Remote Sensing Vision-Language Data via Learned Scoring Models Mar 2, 2025 Reinforcement Learning (RL)
— Unverified 0Quality of service based radar resource management using deep reinforcement learning Oct 20, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0Quality of syntactic implication of RL-based sentence summarization Dec 11, 2019 POS Reinforcement Learning
— Unverified 0