| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 | 0 |
| Fidelity-based Probabilistic Q-learning for Control of Quantum Systems | Jun 8, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Final Adaptation Reinforcement Learning for N-Player Games | Nov 29, 2021 | Board GamesQ-Learning | —Unverified | 0 | 0 |
| Finding the best design parameters for optical nanostructures using reinforcement learning | Oct 18, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 | 0 |
| Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids | Oct 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Finite-Sample Analysis for SARSA with Linear Function Approximation | Feb 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Trader without Offline Training | Mar 1, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games | Dec 15, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Action-modulated midbrain dopamine activity arises from distributed control policies | Jul 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation | Mar 1, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning | Feb 1, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Equivalence Between Policy Gradients and Soft Q-Learning | Apr 21, 2017 | Policy Gradient MethodsQ-Learning | —Unverified | 0 | 0 |
| Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View | Jul 25, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks | Sep 10, 2016 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning | Oct 28, 2020 | Multi-Task LearningQ-Learning | —Unverified | 0 | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 | 0 |
| Chrome Dino Run using Reinforcement Learning | Aug 15, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Finite-Time Analysis of Simultaneous Double Q-learning | Jun 14, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 | 0 |
| Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise | Mar 24, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model | Feb 19, 2024 | modelQ-Learning | —Unverified | 0 | 0 |
| Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach | Mar 11, 2024 | Q-Learning | —Unverified | 0 | 0 |
| FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations | Sep 28, 2022 | Autonomous DrivingEdge-computing | —Unverified | 0 | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 | 0 |
| Fitted Q-Learning for Relational Domains | Jun 10, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Learning in Discounted-cost and Average-cost Mean-field Games | Dec 31, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning | Sep 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning | May 18, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model | May 30, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals | Sep 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game | Feb 1, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Chemoreception and chemotaxis of a three-sphere swimmer | May 5, 2022 | Q-Learning | —Unverified | 0 | 0 |
| FPGA Architecture for Deep Learning and its application to Planetary Robotics | Jan 26, 2017 | CPUQ-Learning | —Unverified | 0 | 0 |
| Ensemble Bootstrapping for Q-Learning | Feb 28, 2021 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Characterizing the Action-Generalization Gap in Deep Q-Learning | May 11, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| Full Gradient Deep Reinforcement Learning for Average-Reward Criterion | Apr 7, 2023 | Deep Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 | 0 |
| Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach | Jan 25, 2021 | DenoisingQ-Learning | —Unverified | 0 | 0 |
| Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control | Oct 25, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Gap-Dependent Bounds for Federated Q-learning | Feb 5, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition | Oct 10, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Gap-Dependent Bounds for Two-Player Markov Games | Jul 1, 2021 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy | Aug 4, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Challenging On Car Racing Problem from OpenAI gym | Nov 2, 2019 | Car Racingcontinuous-control | —Unverified | 0 | 0 |
| An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation | May 25, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Classification Performance via Reinforcement Learning for Feature Selection | Mar 9, 2024 | Classificationfeature selection | —Unverified | 0 | 0 |