| Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks | Sep 10, 2016 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Equivalence Between Policy Gradients and Soft Q-Learning | Apr 21, 2017 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Deep Transfer Q-Learning for Offline Non-Stationary Reinforcement Learning | Jan 8, 2025 | Decision MakingInductive Learning | —Unverified | 0 |
| Balancing a CartPole System with Reinforcement Learning -- A Tutorial | Jun 8, 2020 | OpenAI GymQ-Learning | —Unverified | 0 |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Nov 17, 2020 | ClassificationDensity Estimation | —Unverified | 0 |
| Evaluating Load Models and Their Impacts on Power Transfer Limits | Aug 7, 2020 | Q-Learning | —Unverified | 0 |
| ShiQ: Bringing back Bellman to LLMs | May 16, 2025 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Evaluation of Reinforcement Learning Techniques for Trading on a Diverse Portfolio | Jun 28, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN | Jul 22, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Collaborative Deep Reinforcement Learning for Joint Object Search | Feb 18, 2017 | Active Object LocalizationDeep Reinforcement Learning | —Unverified | 0 |
| Evolution of cooperation in the public goods game with Q-learning | Jul 29, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Evolution of Q Values for Deep Q Learning in Stable Baselines | Apr 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets | Nov 3, 2021 | Q-Learning | —Unverified | 0 |
| Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear | Nov 3, 2016 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Surrogate Q-Learning for Autonomous Driving | Oct 21, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning | Feb 5, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar | Jan 6, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Combining policy gradient and Q-learning | Nov 5, 2016 | Atari GamesQ-Learning | —Unverified | 0 |
| Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks | Feb 14, 2024 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework | Jun 11, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment | May 26, 2022 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise | Nov 20, 2024 | Q-Learning | —Unverified | 0 |
| Exploration in Knowledge Transfer Utilizing Reinforcement Learning | Jul 15, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Exploration via Epistemic Value Estimation | Mar 7, 2023 | Decision MakingEfficient Exploration | —Unverified | 0 |
| Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning | Jun 5, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Exploratory Control with Tsallis Entropy for Latent Factor Models | Nov 14, 2022 | Q-Learning | —Unverified | 0 |
| Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning | Mar 14, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Oct 3, 2023 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model | Feb 19, 2024 | modelQ-Learning | —Unverified | 0 |
| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition | Oct 1, 2019 | Face RecognitionQ-Learning | —Unverified | 0 |
| Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction | Feb 7, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Fast Block Linear System Solver Using Q-Learning Schduling for Unified Dynamic Power System Simulations | Oct 12, 2021 | Q-LearningScheduling | —Unverified | 0 |
| Fast constraint satisfaction problem and learning-based algorithm for solving Minesweeper | May 10, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| GLSearch: Maximum Common Subgraph Detection via Learning to Search | Feb 8, 2020 | Cloud ComputingGraph Embedding | —Unverified | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Faster Non-asymptotic Convergence for Double Q-learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Faster Q-Learning Algorithms for Restless Bandits | Sep 6, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Fastest Convergence for Q-learning | Jul 12, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network | Jun 7, 2024 | Q-Learning | —Unverified | 0 |
| Federated Deep Q-Learning and 5G load balancing | Feb 10, 2024 | Q-Learning | —Unverified | 0 |
| Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks | Apr 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Federated Q-Learning: Linear Regret Speedup with Low Communication Cost | Dec 22, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost | May 29, 2024 | Q-Learning | —Unverified | 0 |
| Federated Stochastic Approximation under Markov Noise and Heterogeneity: Applications in Reinforcement Learning | Jun 21, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| FedHQL: Federated Heterogeneous Q-Learning | Jan 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 |