| Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper | Jun 29, 2020 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Configuring Transmission Thresholds in IIoT Alarm Scenarios for Energy-Efficient Event Reporting | Jul 4, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Consecutive Task-oriented Dialog Policy Learning | Nov 16, 2021 | Continual LearningManagement | —Unverified | 0 | 0 |
| CoNSoLe: Convex Neural Symbolic Learning | Jun 1, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation | Jan 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Constrained Model-Free Reinforcement Learning for Process Optimization | Nov 16, 2020 | modelModel Predictive Control | —Unverified | 0 | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Constructing narrative using a generative model and continuous action policies | Sep 1, 2017 | Paraphrase IdentificationQ-Learning | —Unverified | 0 | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Continuous-time q-learning for mean-field control problems | Jun 28, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability | Mar 22, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence | Mar 25, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Convergence Results For Q-Learning With Experience Replay | Dec 8, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Convergent Reinforcement Learning with Function Approximation: A Bilevel Optimization Perspective | Sep 27, 2018 | Bilevel OptimizationQ-Learning | —Unverified | 0 | 0 |
| Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation | Dec 1, 2009 | Q-Learning | —Unverified | 0 | 0 |
| Convert Language Model into a Value-based Strategic Planner | May 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Convex Q Learning in a Stochastic Environment: Extended Version | Sep 10, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Convex Q-Learning, Part 1: Deterministic Optimal Control | Aug 8, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Cooperative Control of Mobile Robots with Stackelberg Learning | Aug 3, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Cooperative Deep Q-learning Framework for Environments Providing Image Feedback | Oct 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis | Jan 11, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Cooperative Reward Shaping for Multi-Agent Pathfinding | Jul 15, 2024 | Collision AvoidanceMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework | Jan 24, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Correct-by-synthesis reinforcement learning with temporal logic constraints | Mar 5, 2015 | Motion PlanningQ-Learning | —Unverified | 0 | 0 |
| Correlated Deep Q-learning based Microgrid Energy Management | Mar 6, 2021 | energy managementManagement | —Unverified | 0 | 0 |
| Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning | Nov 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Coverage Analysis for Digital Cousin Selection -- Improving Multi-Environment Q-Learning | Nov 13, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization | Aug 29, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Coverage-aware and Reinforcement Learning Using Multi-agent Approach for HD Map QoS in a Realistic Environment | Jul 19, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents | Jul 16, 2023 | Learning TheoryQ-Learning | —Unverified | 0 | 0 |
| Credit-cognisant reinforcement learning for multi-agent cooperation | Nov 18, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning | Jan 13, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Cross Learning in Deep Q-Networks | Sep 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Curriculum Q-Learning for Visual Vocabulary Acquisition | Nov 29, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Cycles and collusion in congestion games under Q-learning | Feb 26, 2025 | Q-Learning | —Unverified | 0 | 0 |
| DASA: Delay-Adaptive Multi-Agent Stochastic Approximation | Mar 25, 2024 | AvgQ-Learning | —Unverified | 0 | 0 |
| Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients | Dec 30, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems | Sep 16, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 | 0 |