| Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval | Jan 10, 2024 | Q-LearningRhythm | —Unverified | 0 | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Constrained Model-Free Reinforcement Learning for Process Optimization | Nov 16, 2020 | modelModel Predictive Control | —Unverified | 0 | 0 |
| AoI Minimization in Status Update Control with Energy Harvesting Sensors | Sep 9, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation | Jan 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| CoNSoLe: Convex Neural Symbolic Learning | Jun 1, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks | Feb 22, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models | Nov 9, 2023 | Algorithmic TradingQ-Learning | —Unverified | 0 | 0 |
| Accelerating Goal-Directed Reinforcement Learning by Model Characterization | Jan 4, 2019 | modelModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Multi-Objective Deep Reinforcement Learning for Optimisation in Autonomous Systems | Aug 2, 2024 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| Feature-Based Q-Learning for Two-Player Stochastic Games | Jun 2, 2019 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem | May 9, 2019 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 | 0 |
| An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models | May 9, 2024 | Hierarchical Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Consecutive Task-oriented Dialog Policy Learning | Nov 16, 2021 | Continual LearningManagement | —Unverified | 0 | 0 |
| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Configuring Transmission Thresholds in IIoT Alarm Scenarios for Energy-Efficient Event Reporting | Jul 4, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach | Aug 10, 2022 | Bayesian InferenceQ-Learning | —Unverified | 0 | 0 |
| Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper | Jun 29, 2020 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise | Mar 28, 2023 | Q-Learning | —Unverified | 0 | 0 |
| A Novel Reinforcement Learning Model for Post-Incident Malware Investigations | Oct 19, 2024 | Malware DetectionQ-Learning | —Unverified | 0 | 0 |
| Active Deep Q-learning with Demonstration | Dec 6, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Concentration of Contractive Stochastic Approximation and Reinforcement Learning | Jun 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Concentration bounds for SSP Q-learning for average cost MDPs | Jun 7, 2022 | Q-Learning | —Unverified | 0 | 0 |
| A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game | Mar 9, 2025 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications | Feb 2, 2025 | counterfactualPolicy Gradient Methods | —Unverified | 0 | 0 |
| Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels | Feb 13, 2023 | Q-Learning | —Unverified | 0 | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 | 0 |
| Compressive Features in Offline Reinforcement Learning for Recommender Systems | Nov 16, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 | 0 |
| A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle | Mar 22, 2022 | Q-Learning | —Unverified | 0 | 0 |
| An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems | Jun 9, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Accelerated Value Iteration via Anderson Mixing | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems | Aug 6, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Comparative Study of Q-Learning and NeuroEvolution of Augmenting Topologies for Self Driving Agents | Sep 19, 2022 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 | 0 |
| An Optimal Online Method of Selecting Source Policies for Reinforcement Learning | Sep 24, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms | Mar 27, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support | Dec 3, 2024 | Computational EfficiencyFairness | —Unverified | 0 | 0 |
| A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms | Feb 20, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Combining Q-Learning and Search with Amortized Value Estimates | Dec 5, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Combining policy gradient and Q-learning | Nov 5, 2016 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Anomaly Detection via Learning-Based Sequential Controlled Sensing | Nov 30, 2023 | Anomaly DetectionDecision Making | —Unverified | 0 | 0 |
| Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | Jun 24, 2023 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions | Feb 2, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear | Nov 3, 2016 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 | 0 |
| Collaborative Deep Reinforcement Learning for Joint Object Search | Feb 18, 2017 | Active Object LocalizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Nov 17, 2020 | ClassificationDensity Estimation | —Unverified | 0 | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 | 0 |