| A Discrete-Time Switching System Analysis of Q-learning | Feb 17, 2021 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View | Jul 25, 2022 | Q-Learning | —Unverified | 0 |
| Final Iteration Convergence Bound of Q-Learning: Switching System Approach | May 11, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning | Oct 28, 2020 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach | Jun 9, 2023 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Simultaneous Double Q-learning | Jun 14, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise | Mar 24, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model | Feb 19, 2024 | modelQ-Learning | —Unverified | 0 |
| Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach | Mar 11, 2024 | Q-Learning | —Unverified | 0 |
| FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations | Sep 28, 2022 | Autonomous DrivingEdge-computing | —Unverified | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 |
| Fitted Q-Learning for Relational Domains | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Learning in Discounted-cost and Average-cost Mean-field Games | Dec 31, 2019 | Q-Learning | —Unverified | 0 |
| Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning | Sep 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals | Sep 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game | Feb 1, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Forecasting and stabilizing chaotic regimes in two macroeconomic models via artificial intelligence technologies and control methods | Feb 20, 2023 | Decision MakingEvolutionary Algorithms | —Unverified | 0 |
| FPGA Architecture for Deep Learning and its application to Planetary Robotics | Jan 26, 2017 | CPUQ-Learning | —Unverified | 0 |
| FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots | Nov 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Frugal Reinforcement-based Active Learning | Dec 9, 2022 | Active LearningDiversity | —Unverified | 0 |
| Full Gradient Deep Reinforcement Learning for Average-Reward Criterion | Apr 7, 2023 | Deep Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory | Mar 31, 2022 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Federated Q-learning | Feb 5, 2025 | Q-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition | Oct 10, 2024 | Q-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Two-Player Markov Games | Jul 1, 2021 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 |
| GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy | Aug 4, 2020 | Q-Learning | —Unverified | 0 |
| Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks | Mar 7, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Genetic Algorithm enhanced by Deep Reinforcement Learning in parent selection mechanism and mutation : Minimizing makespan in permutation flow shop scheduling problems | Nov 10, 2023 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits | Aug 19, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning | Feb 25, 2020 | ManagementQ-Learning | —Unverified | 0 |
| Goal Reasoning by Selecting Subgoals with Deep Q-Learning | Dec 22, 2020 | Q-Learning | —Unverified | 0 |
| Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning | Sep 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| GraMeR: Graph Meta Reinforcement Learning for Multi-Objective Influence Maximization | May 30, 2022 | Computational EfficiencyMarketing | —Unverified | 0 |
| Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery | Mar 8, 2022 | global-optimizationMotion Planning | —Unverified | 0 |
| Graph Exploration for Effective Multi-agent Q-Learning | Apr 19, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Graph Neural Network based Agent in Google Research Football | Apr 23, 2022 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Graph Q-Learning for Combinatorial Optimization | Jan 11, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Greedy-Step Off-Policy Reinforcement Learning | Feb 23, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution | Apr 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Guiding Reinforcement Learning Exploration Using Natural Language | Jul 26, 2017 | DecoderMachine Translation | —Unverified | 0 |
| On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension | Nov 11, 2020 | Matrix CompletionQ-Learning | —Unverified | 0 |
| Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time | Dec 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration | Sep 13, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search | Nov 1, 2024 | Q-Learning | —Unverified | 0 |
| Hedging of Financial Derivative Contracts via Monte Carlo Tree Search | Feb 11, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hedging using reinforcement learning: Contextual k-Armed Bandit versus Q-learning | Jul 3, 2020 | FrictionQ-Learning | —Unverified | 0 |
| Hidden Incentives for Auto-Induced Distributional Shift | Sep 19, 2020 | BIG-bench Machine LearningMeta-Learning | —Unverified | 0 |