| Anomaly Detection via Learning-Based Sequential Controlled Sensing | Nov 30, 2023 | Anomaly DetectionDecision Making | —Unverified | 0 | 0 |
| Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | Jun 24, 2023 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar | Jan 6, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions | Feb 2, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Evolution of Q Values for Deep Q Learning in Stable Baselines | Apr 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Evolution of cooperation in the public goods game with Q-learning | Jul 29, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear | Nov 3, 2016 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 | 0 |
| Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN | Jul 22, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Collaborative Deep Reinforcement Learning for Joint Object Search | Feb 18, 2017 | Active Object LocalizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Evaluation of Reinforcement Learning Techniques for Trading on a Diverse Portfolio | Jun 28, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour | Jul 17, 2025 | Autonomous NavigationQ-Learning | —Unverified | 0 | 0 |
| An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking | Feb 19, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Evaluating Load Models and Their Impacts on Power Transfer Limits | Aug 7, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning | Feb 5, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Escaping the State of Nature: A Hobbesian Approach to Cooperation in Multi-agent Reinforcement Learning | Jun 5, 2019 | Multi-agent Reinforcement LearningPhilosophy | —Unverified | 0 | 0 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Nov 17, 2020 | ClassificationDensity Estimation | —Unverified | 0 | 0 |
| Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework | Jun 11, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment | May 26, 2022 | Multi-Armed BanditsQ-Learning | —Unverified | 0 | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 | 0 |
| Exploration in Knowledge Transfer Utilizing Reinforcement Learning | Jul 15, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Exploration via Epistemic Value Estimation | Mar 7, 2023 | Decision MakingEfficient Exploration | —Unverified | 0 | 0 |
| Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning | Jun 5, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Exploratory Control with Tsallis Entropy for Latent Factor Models | Nov 14, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Exploring Competitive and Collusive Behaviors in Algorithmic Pricing with Deep Reinforcement Learning | Mar 14, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Trader without Offline Training | Mar 1, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Action-modulated midbrain dopamine activity arises from distributed control policies | Jul 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Equivalence Between Policy Gradients and Soft Q-Learning | Apr 21, 2017 | Policy Gradient MethodsQ-Learning | —Unverified | 0 | 0 |
| Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks | Sep 10, 2016 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction | Feb 7, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 | 0 |
| Chrome Dino Run using Reinforcement Learning | Aug 15, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning | May 18, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Faster Non-asymptotic Convergence for Double Q-learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Faster Q-Learning Algorithms for Restless Bandits | Sep 6, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 | 0 |
| Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model | May 30, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network | Jun 7, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Chemoreception and chemotaxis of a three-sphere swimmer | May 5, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks | Apr 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 | 0 |
| Federated Q-Learning: Linear Regret Speedup with Low Communication Cost | Dec 22, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost | May 29, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Federated Stochastic Approximation under Markov Noise and Heterogeneity: Applications in Reinforcement Learning | Jun 21, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| FedHQL: Federated Heterogeneous Q-Learning | Jan 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Ensemble Bootstrapping for Q-Learning | Feb 28, 2021 | Atari GamesQ-Learning | —Unverified | 0 | 0 |