| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 |
| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 |
| Action Learning for 3D Point Cloud Based Organ Segmentation | Jun 14, 2018 | Organ SegmentationQ-Learning | —Unverified | 0 |
| An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation | May 25, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A new multilayer optical film optimal method based on deep q-learning | Dec 7, 2018 | Q-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control | Nov 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A new convergent variant of Q-learning with linear function approximation | Dec 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward | Sep 24, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization | Jul 1, 2024 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support | May 11, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network | May 2, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Network Simulation of OTC Markets with Multiple Agents | May 3, 2024 | Q-Learning | —Unverified | 0 |
| Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors | Jul 22, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 |
| A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning | Oct 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks | Feb 7, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm | Sep 2, 2024 | Q-Learning | —Unverified | 0 |
| Can Q-learning solve Multi Armed Bantids? | Oct 21, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments | Jan 6, 2024 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Can Q-Learning be Improved with Advice? | Oct 25, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Aug 23, 2024 | HallucinationPrompt Engineering | —Unverified | 0 |
| An Elementary Proof that Q-learning Converges Almost Surely | Aug 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback | Oct 3, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| RSRM: Reinforcement Symbolic Regression Machine | May 24, 2023 | MathQ-Learning | —Unverified | 0 |
| CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY | Sep 25, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Jun 8, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Dec 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| CAQL: Continuous Action Q-Learning | Sep 26, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach | Sep 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems | Dec 6, 2023 | Q-Learning | —Unverified | 0 |
| Catalytic evolution of cooperation in a population with behavioural bimodality | Jun 17, 2024 | Q-Learning | —Unverified | 0 |
| An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS | May 26, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Causal Deep Reinforcement Learning Using Observational Data | Nov 28, 2022 | Autonomous DrivingCausal Inference | —Unverified | 0 |
| Causal Mean Field Multi-Agent Reinforcement Learning | Feb 20, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA Networks | Aug 12, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision | May 1, 2024 | Q-Learning | —Unverified | 0 |
| Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning | Oct 1, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles | Oct 12, 2022 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Challenging On Car Racing Problem from OpenAI gym | Nov 2, 2019 | Car Racingcontinuous-control | —Unverified | 0 |
| Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach | Jan 25, 2021 | DenoisingQ-Learning | —Unverified | 0 |
| Characterizing the Action-Generalization Gap in Deep Q-Learning | May 11, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Chemoreception and chemotaxis of a three-sphere swimmer | May 5, 2022 | Q-Learning | —Unverified | 0 |
| Chrome Dino Run using Reinforcement Learning | Aug 15, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cache-Aided NOMA Mobile Edge Computing: A Reinforcement Learning Approach | Jun 20, 2019 | Edge-computingQ-Learning | —Unverified | 0 |