| Fictitious play in zero-sum stochastic games | Oct 8, 2020 | Q-Learning | —Unverified | 0 |
| Fidelity-based Probabilistic Q-learning for Control of Quantum Systems | Jun 8, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Final Adaptation Reinforcement Learning for N-Player Games | Nov 29, 2021 | Board GamesQ-Learning | —Unverified | 0 |
| Finding the best design parameters for optical nanostructures using reinforcement learning | Oct 18, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids | Oct 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Sample Analysis for SARSA with Linear Function Approximation | Feb 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Finite Sample Analysis of Average-Reward TD Learning and Q-Learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games | Dec 15, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Balancing Profit, Risk, and Sustainability for Portfolio Management | Jun 6, 2022 | ManagementPortfolio Optimization | —Unverified | 0 |
| Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation | Mar 1, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis for Double Q-learning | Sep 29, 2020 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning | Feb 1, 2020 | Q-Learning | —Unverified | 0 |
| A Discrete-Time Switching System Analysis of Q-learning | Feb 17, 2021 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View | Jul 25, 2022 | Q-Learning | —Unverified | 0 |
| Deep Transfer Q-Learning for Offline Non-Stationary Reinforcement Learning | Jan 8, 2025 | Decision MakingInductive Learning | —Unverified | 0 |
| Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning | Oct 28, 2020 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach | Jun 9, 2023 | Q-Learning | —Unverified | 0 |
| CoNSoLe: Convex Neural Symbolic Learning | Jun 1, 2022 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Simultaneous Double Q-learning | Jun 14, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise | Mar 24, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model | Feb 19, 2024 | modelQ-Learning | —Unverified | 0 |
| Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach | Mar 11, 2024 | Q-Learning | —Unverified | 0 |
| FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations | Sep 28, 2022 | Autonomous DrivingEdge-computing | —Unverified | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 |
| Fitted Q-Learning for Relational Domains | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Learning in Discounted-cost and Average-cost Mean-field Games | Dec 31, 2019 | Q-Learning | —Unverified | 0 |
| Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning | Sep 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Balancing a CartPole System with Reinforcement Learning -- A Tutorial | Jun 8, 2020 | OpenAI GymQ-Learning | —Unverified | 0 |
| ShiQ: Bringing back Bellman to LLMs | May 16, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals | Sep 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game | Feb 1, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets | Nov 3, 2021 | Q-Learning | —Unverified | 0 |
| Deep Surrogate Q-Learning for Autonomous Driving | Oct 21, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots | Nov 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise | Nov 20, 2024 | Q-Learning | —Unverified | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration | Sep 13, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Full Gradient Deep Reinforcement Learning for Average-Reward Criterion | Apr 7, 2023 | Deep Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory | Mar 31, 2022 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search | Nov 1, 2024 | Q-Learning | —Unverified | 0 |
| Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy | Jul 4, 2024 | Q-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Federated Q-learning | Feb 5, 2025 | Q-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition | Oct 10, 2024 | Q-Learning | —Unverified | 0 |
| Gap-Dependent Bounds for Two-Player Markov Games | Jul 1, 2021 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 |
| GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy | Aug 4, 2020 | Q-Learning | —Unverified | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hidden Incentives for Auto-Induced Distributional Shift | Sep 19, 2020 | BIG-bench Machine LearningMeta-Learning | —Unverified | 0 |
| Deep Spectral Q-learning with Application to Mobile Health | Jan 3, 2023 | Q-Learning | —Unverified | 0 |