| Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks | Mar 7, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems | Nov 10, 2023 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits | Aug 19, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning | Feb 25, 2020 | ManagementQ-Learning | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Spectral Q-learning with Application to Mobile Health | Jan 3, 2023 | Q-Learning | —Unverified | 0 |
| Approximate Global Convergence of Independent Learning in Multi-Agent Systems | May 30, 2024 | Q-Learning | —Unverified | 0 |
| Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning | Sep 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor | Sep 6, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization | May 30, 2022 | Computational EfficiencyMarketing | —Unverified | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery | Mar 8, 2022 | global-optimizationMotion Planning | —Unverified | 0 |
| Graph Exploration for Effective Multi-agent Q-Learning | Apr 19, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Graph Neural Network based Agent in Google Research Football | Apr 23, 2022 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Greedy-Step Off-Policy Reinforcement Learning | Feb 23, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Approximate Nash Equilibrium Learning for n-Player Markov Games in Dynamic Pricing | Jul 13, 2022 | Q-Learning | —Unverified | 0 |
| Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution | Apr 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Guiding Reinforcement Learning Exploration Using Natural Language | Jul 26, 2017 | DecoderMachine Translation | —Unverified | 0 |
| On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension | Nov 11, 2020 | Matrix CompletionQ-Learning | —Unverified | 0 |
| Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time | Dec 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning | Feb 13, 2023 | energy managementManagement | —Unverified | 0 |
| Convert Language Model into a Value-based Strategic Planner | May 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration | Sep 13, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching | Feb 1, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search | Nov 1, 2024 | Q-Learning | —Unverified | 0 |
| Hedging of Financial Derivative Contracts via Monte Carlo Tree Search | Feb 11, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hedging using reinforcement learning: Contextual k-Armed Bandit versus Q-learning | Jul 3, 2020 | FrictionQ-Learning | —Unverified | 0 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hidden Incentives for Auto-Induced Distributional Shift | Sep 19, 2020 | BIG-bench Machine LearningMeta-Learning | —Unverified | 0 |
| Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process | Sep 17, 2018 | Q-Learning | —Unverified | 0 |
| Hierarchical clustering with deep Q-learning | May 28, 2018 | ClusteringQ-Learning | —Unverified | 0 |
| Cooperative Control of Mobile Robots with Stackelberg Learning | Aug 3, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem | Apr 8, 2018 | PositionQ-Learning | —Unverified | 0 |
| Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis | Jan 11, 2025 | Q-Learning | —Unverified | 0 |
| High dimensional precision medicine from patient-derived xenografts | Dec 13, 2019 | Q-LearningVocal Bursts Intensity Prediction | —Unverified | 0 |
| High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning | Dec 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Highway Reinforcement Learning | May 28, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task | Dec 2, 2020 | HippocampusQ-Learning | —Unverified | 0 |
| How to discretize continuous state-action spaces in Q-learning: A symbolic control approach | Jun 3, 2024 | Q-Learning | —Unverified | 0 |
| Human and Multi-Agent collaboration in a human-MARL teaming framework | Jun 12, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Hybridizing the 1/5-th Success Rule with Q-Learning for Controlling the Mutation Rate of an Evolutionary Algorithm | Jun 19, 2020 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 |
| Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving | Oct 11, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Hybrid Policies Using Inverse Rewards for Reinforcement Learning | Sep 27, 2018 | OpenAI GymQ-Learning | —Unverified | 0 |
| Hybrid Q-Learning Applied to Ubiquitous recommender system | Mar 10, 2013 | Q-LearningRecommendation Systems | —Unverified | 0 |
| A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts | Aug 15, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |