| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Approximate Global Convergence of Independent Learning in Multi-Agent Systems | May 30, 2024 | Q-Learning | —Unverified | 0 |
| Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability | Mar 22, 2021 | Q-Learning | —Unverified | 0 |
| Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence | Mar 25, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Convergence Results For Q-Learning With Experience Replay | Dec 8, 2021 | Q-Learning | —Unverified | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convergent Reinforcement Learning with Function Approximation: A Bilevel Optimization Perspective | Sep 27, 2018 | Bilevel OptimizationQ-Learning | —Unverified | 0 |
| Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation | Dec 1, 2009 | Q-Learning | —Unverified | 0 |
| Convert Language Model into a Value-based Strategic Planner | May 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Convex Q Learning in a Stochastic Environment: Extended Version | Sep 10, 2023 | Q-Learning | —Unverified | 0 |
| Convex Q-Learning, Part 1: Deterministic Optimal Control | Aug 8, 2020 | Q-Learning | —Unverified | 0 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Approximation of Convex Envelope Using Reinforcement Learning | Nov 24, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Control of Mobile Robots with Stackelberg Learning | Aug 3, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Cooperative Deep Q-learning Framework for Environments Providing Image Feedback | Oct 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis | Jan 11, 2025 | Q-Learning | —Unverified | 0 |
| Cooperative Reward Shaping for Multi-Agent Pathfinding | Jul 15, 2024 | Collision AvoidanceMulti-agent Reinforcement Learning | —Unverified | 0 |
| Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework | Jan 24, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Correct-by-synthesis reinforcement learning with temporal logic constraints | Mar 5, 2015 | Motion PlanningQ-Learning | —Unverified | 0 |
| Correlated Deep Q-learning based Microgrid Energy Management | Mar 6, 2021 | energy managementManagement | —Unverified | 0 |
| Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning | Nov 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation | Nov 26, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning | Oct 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization | Aug 29, 2024 | Q-Learning | —Unverified | 0 |
| Coverage-aware and Reinforcement Learning Using Multi-agent Approach for HD Map QoS in a Realistic Environment | Jul 19, 2024 | Q-Learning | —Unverified | 0 |
| Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents | Jul 16, 2023 | Learning TheoryQ-Learning | —Unverified | 0 |
| Credit-cognisant reinforcement learning for multi-agent cooperation | Nov 18, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning | Jan 13, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cross Learning in Deep Q-Networks | Sep 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow | Oct 22, 2021 | Distributed OptimizationQ-Learning | —Unverified | 0 |
| Curriculum Q-Learning for Visual Vocabulary Acquisition | Nov 29, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Cycles and collusion in congestion games under Q-learning | Feb 26, 2025 | Q-Learning | —Unverified | 0 |
| A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT | Jun 1, 2021 | Industrial RobotsQ-Learning | —Unverified | 0 |
| DASA: Delay-Adaptive Multi-Agent Stochastic Approximation | Mar 25, 2024 | AvgQ-Learning | —Unverified | 0 |
| Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients | Dec 30, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems | Sep 16, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| Data-Driven Knowledge Transfer in Batch Q^* Learning | Apr 1, 2024 | Decision MakingMarketing | —Unverified | 0 |
| Data-efficient Deep Reinforcement Learning for Dexterous Manipulation | Apr 10, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control | Nov 30, 2023 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Data-Efficient Quadratic Q-Learning Using LMIs | Sep 18, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| DDPG based on multi-scale strokes for financial time series trading strategy | Jun 5, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Breaking the Deadly Triad with a Target Network | Jan 21, 2021 | Q-Learning | —Unverified | 0 |
| DECAF: Learning to be Fair in Multi-agent Resource Allocation | Feb 6, 2025 | FairnessQ-Learning | —Unverified | 0 |
| Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion | Nov 21, 2023 | Q-Learning | —Unverified | 0 |
| An Attempt to Model Human Trust with Reinforcement Learning | Sep 29, 2021 | Decision MakingQ-Learning | —Unverified | 0 |