| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | May 2, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces | May 26, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Pure Exploration under Mediators' Feedback | Aug 29, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine | Jun 8, 2025 | Decision MakingQuantization | —Unverified | 0 | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 | 0 |
| Quantile Off-Policy Evaluation via Deep Conditional Generative Learning | Dec 29, 2022 | Decision MakingOff-policy evaluation | —Unverified | 0 | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Oct 28, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models | Apr 9, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Real-Time Web Scale Event Summarization Using Sequential Decision Making | May 12, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 | 0 |
| Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks | Jul 13, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Rectifying Reinforcement Learning for Reward Matching | Jun 4, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization | May 1, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Regret Bounds for Online Portfolio Selection with a Cardinality Constraint | Dec 1, 2018 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems | Oct 30, 2023 | Cloud ComputingDecision Making | —Unverified | 0 | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Regular Decision Processes for Grid Worlds | Nov 5, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Regularized Q-Learning with Linear Function Approximation | Jan 26, 2024 | Decision Making Under UncertaintyQ-Learning | —Unverified | 0 | 0 |
| Reinforced Approximate Exploratory Data Analysis | Dec 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |