| CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY | Sep 25, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Nov 17, 2020 | ClassificationDensity Estimation | —Unverified | 0 |
| An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems | Dec 6, 2023 | Q-Learning | —Unverified | 0 |
| Collaborative Deep Reinforcement Learning for Joint Object Search | Feb 18, 2017 | Active Object LocalizationDeep Reinforcement Learning | —Unverified | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 |
| Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear | Nov 3, 2016 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions | Feb 2, 2025 | Q-Learning | —Unverified | 0 |
| DASA: Delay-Adaptive Multi-Agent Stochastic Approximation | Mar 25, 2024 | AvgQ-Learning | —Unverified | 0 |
| Combining policy gradient and Q-learning | Nov 5, 2016 | Atari GamesQ-Learning | —Unverified | 0 |
| Combining Q-Learning and Search with Amortized Value Estimates | Dec 5, 2019 | Q-Learning | —Unverified | 0 |
| Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA Networks | Aug 12, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support | Dec 3, 2024 | Computational EfficiencyFairness | —Unverified | 0 |
| Comparative Study of Q-Learning and NeuroEvolution of Augmenting Topologies for Self Driving Agents | Sep 19, 2022 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cache-Aided NOMA Mobile Edge Computing: A Reinforcement Learning Approach | Jun 20, 2019 | Edge-computingQ-Learning | —Unverified | 0 |
| Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems | Aug 6, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems | Jun 9, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Compressive Features in Offline Reinforcement Learning for Recommender Systems | Nov 16, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle | Mar 22, 2022 | Q-Learning | —Unverified | 0 |
| Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels | Feb 13, 2023 | Q-Learning | —Unverified | 0 |
| Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications | Feb 2, 2025 | counterfactualPolicy Gradient Methods | —Unverified | 0 |
| Concentration bounds for SSP Q-learning for average cost MDPs | Jun 7, 2022 | Q-Learning | —Unverified | 0 |
| Concentration of Contractive Stochastic Approximation and Reinforcement Learning | Jun 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise | Mar 28, 2023 | Q-Learning | —Unverified | 0 |
| Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper | Jun 29, 2020 | OpenAI GymQ-Learning | —Unverified | 0 |
| Configuring Transmission Thresholds in IIoT Alarm Scenarios for Energy-Efficient Event Reporting | Jul 4, 2024 | Q-LearningScheduling | —Unverified | 0 |
| A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach | Aug 10, 2022 | Bayesian InferenceQ-Learning | —Unverified | 0 |
| An Efficient and Uncertainty-aware Reinforcement Learning Framework for Quality Assurance in Extrusion Additive Manufacturing | Mar 2, 2025 | Q-LearningUncertainty Quantification | —Unverified | 0 |
| Consecutive Task-oriented Dialog Policy Learning | Nov 16, 2021 | Continual LearningManagement | —Unverified | 0 |
| An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models | May 9, 2024 | Hierarchical Reinforcement LearningManagement | —Unverified | 0 |
| Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning | Jun 4, 2025 | Q-Learning | —Unverified | 0 |
| A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation | Nov 26, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation | Jan 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Constrained Model-Free Reinforcement Learning for Process Optimization | Nov 16, 2020 | modelModel Predictive Control | —Unverified | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Constructing narrative using a generative model and continuous action policies | Sep 1, 2017 | Paraphrase IdentificationQ-Learning | —Unverified | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens | Jul 13, 2021 | Q-Learning | —Unverified | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 |
| Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills | Apr 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games | Mar 17, 2025 | Atari GamesQ-Learning | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning | Oct 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Application of Deep Q-Network in Portfolio Management | Mar 13, 2020 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 |
| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 |
| Continuous-time q-learning for mean-field control problems | Jun 28, 2023 | Q-Learning | —Unverified | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Attempt to Model Human Trust with Reinforcement Learning | Sep 29, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |