| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach | Jan 25, 2021 | DenoisingQ-Learning | —Unverified | 0 | 0 |
| GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits | Aug 19, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 | 0 |
| G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning | Feb 25, 2020 | ManagementQ-Learning | —Unverified | 0 | 0 |
| Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control | Oct 25, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Challenging On Car Racing Problem from OpenAI gym | Nov 2, 2019 | Car Racingcontinuous-control | —Unverified | 0 | 0 |
| An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation | May 25, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Classification Performance via Reinforcement Learning for Feature Selection | Mar 9, 2024 | Classificationfeature selection | —Unverified | 0 | 0 |
| GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization | May 30, 2022 | Computational EfficiencyMarketing | —Unverified | 0 | 0 |
| Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning | Feb 8, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery | Mar 8, 2022 | global-optimizationMotion Planning | —Unverified | 0 | 0 |
| Graph Exploration for Effective Multi-agent Q-Learning | Apr 19, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles | Oct 12, 2022 | Autonomous VehiclesQ-Learning | —Unverified | 0 | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Greedy-Step Off-Policy Reinforcement Learning | Feb 23, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition | Mar 31, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks | Dec 12, 2023 | Q-LearningTransfer Learning | —Unverified | 0 | 0 |
| Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution | Apr 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Guiding Reinforcement Learning Exploration Using Natural Language | Jul 26, 2017 | DecoderMachine Translation | —Unverified | 0 | 0 |
| On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension | Nov 11, 2020 | Matrix CompletionQ-Learning | —Unverified | 0 | 0 |
| Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time | Dec 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning | Oct 1, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A new multilayer optical film optimal method based on deep q-learning | Dec 7, 2018 | Q-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control | Nov 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Action Learning for 3D Point Cloud Based Organ Segmentation | Jun 14, 2018 | Organ SegmentationQ-Learning | —Unverified | 0 | 0 |
| HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search | Nov 1, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Hedging of Financial Derivative Contracts via Monte Carlo Tree Search | Feb 11, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Hedging using reinforcement learning: Contextual k-Armed Bandit versus Q-learning | Jul 3, 2020 | FrictionQ-Learning | —Unverified | 0 | 0 |
| Enhanced Deep Q-Learning for 2D Self-Driving Cars: Implementation and Evaluation on a Custom Track Environment | Feb 13, 2024 | Q-LearningSelf-Driving Cars | —Unverified | 0 | 0 |
| Energy Sharing for Multiple Sensor Nodes with Finite Buffers | Mar 17, 2015 | Q-Learning | —Unverified | 0 | 0 |
| Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process | Sep 17, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Hierarchical clustering with deep Q-learning | May 28, 2018 | ClusteringQ-Learning | —Unverified | 0 | 0 |
| Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision | May 1, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem | Apr 8, 2018 | PositionQ-Learning | —Unverified | 0 | 0 |
| Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization | Jun 24, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| High dimensional precision medicine from patient-derived xenografts | Dec 13, 2019 | Q-LearningVocal Bursts Intensity Prediction | —Unverified | 0 | 0 |
| High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning | Dec 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Highway Reinforcement Learning | May 28, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task | Dec 2, 2020 | HippocampusQ-Learning | —Unverified | 0 | 0 |
| How to discretize continuous state-action spaces in Q-learning: A symbolic control approach | Jun 3, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Human and Multi-Agent collaboration in a human-MARL teaming framework | Jun 12, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Hybridizing the 1/5-th Success Rule with Q-Learning for Controlling the Mutation Rate of an Evolutionary Algorithm | Jun 19, 2020 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 | 0 |
| Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication | Apr 20, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Hybrid Policies Using Inverse Rewards for Reinforcement Learning | Sep 27, 2018 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Hybrid Q-Learning Applied to Ubiquitous recommender system | Mar 10, 2013 | Q-LearningRecommendation Systems | —Unverified | 0 | 0 |
| A new convergent variant of Q-learning with linear function approximation | Dec 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |