| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| The Sample-Communication Complexity Trade-off in Federated Q-Learning | Aug 30, 2024 | Q-Learning | —Unverified | 0 | 0 |
| The Sample Complexity of Teaching-by-Reinforcement on Q-Learning | Jun 16, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| The tree reconstruction game: phylogenetic reconstruction using reinforcement learning | Mar 12, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| The Value of Chess Squares | Jul 8, 2023 | Game of ChessQ-Learning | —Unverified | 0 | 0 |
| The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions | Sep 27, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Throughput and Latency in the Distributed Q-Learning Random Access mMTC Networks | Oct 30, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis | Feb 12, 2021 | Natural QuestionsQ-Learning | —Unverified | 0 | 0 |
| Time-Scale Separation in Q-Learning: Extending TD() for Action-Value Function Decomposition | Nov 21, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A step toward a reinforcement learning de novo genome assembler | Feb 2, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Towards Characterizing Divergence in Deep Q-Learning | Mar 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel | Nov 4, 2021 | Language AcquisitionMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Resilience for Multi-Agent QD-Learning | Apr 7, 2021 | AllMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Towards Secure and Efficient Data Scheduling for Vehicular Social Networks | Jun 28, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking | Feb 18, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Unknown-aware Deep Q-Learning | Sep 29, 2021 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 | 0 |
| Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach | Oct 8, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Trading the Twitter Sentiment with Reinforcement Learning | Jan 7, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 | 0 |
| Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks | Apr 9, 2024 | Q-LearningTraffic Signal Control | —Unverified | 0 | 0 |
| Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication | Jul 13, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Transfer Reinforcement Learning under Unobserved Contextual Information | Mar 9, 2020 | Motion PlanningQ-Learning | —Unverified | 0 | 0 |
| Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning | Jan 9, 2023 | NavigateQ-Learning | —Unverified | 0 | 0 |
| Two Phase Q-learning for Bidding-based Vehicle Sharing | Sep 29, 2015 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach | Nov 8, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Two-Step Q-Learning | Jul 2, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Two Timescale Convergent Q-learning for Sleep--Scheduling in Wireless Sensor Networks | Dec 27, 2013 | feature selectionIntrusion Detection | —Unverified | 0 | 0 |
| Two-Timescale Networks for Nonlinear Value Function Approximation | May 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games | Dec 8, 2023 | Q-Learningvalid | —Unverified | 0 | 0 |
| UAV Aided Search and Rescue Operation Using Reinforcement Learning | Feb 19, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms | Nov 27, 2022 | FairnessQ-Learning | —Unverified | 0 | 0 |
| UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations | Feb 17, 2022 | ClusteringQ-Learning | —Unverified | 0 | 0 |
| UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning | Sep 21, 2023 | Q-Learning | —Unverified | 0 | 0 |
| UCB Exploration via Q-Ensembles | Jun 5, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Unbiased Methods for Multi-Goal Reinforcement Learning | Jun 16, 2021 | Multi-Goal Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective | Sep 26, 2022 | Imitation LearningMulti-Goal Reinforcement Learning | —Unverified | 0 | 0 |
| Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization | Mar 31, 2023 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration | Apr 15, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Unified continuous-time q-learning for mean-field game and mean-field control problems | Jul 5, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Unified ODE Analysis of Smooth Q-Learning Algorithms | Apr 20, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Unified Reinforcement Q-Learning for Mean Field Game and Control Problems | Jun 24, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Unifying Ensemble Methods for Q-learning via Social Choice Theory | Feb 27, 2019 | DiversityQ-Learning | —Unverified | 0 | 0 |
| Unifying Top-down and Bottom-up for Recurrent Visual Attention | Sep 29, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Universal Approximation Theorem for Deep Q-Learning via FBSDE System | May 9, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Universal Approximation Theorem of Deep Q-Networks | May 4, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |