| SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems | Apr 4, 2021 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks | Apr 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Mar 22, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning based on Scenario-tree MPC for ASVs | Mar 22, 2021 | Model Predictive ControlPoint Tracking | —Unverified | 0 |
| Variational quantum compiling with double Q-learning | Mar 22, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability | Mar 22, 2021 | Q-Learning | —Unverified | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks | Mar 10, 2021 | Q-LearningScheduling | —Unverified | 0 |
| The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning | Mar 7, 2021 | Q-LearningTransfer Learning | —Unverified | 0 |
| Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach | Mar 6, 2021 | energy managementenergy trading | —Unverified | 0 |
| Correlated Deep Q-learning based Microgrid Energy Management | Mar 6, 2021 | energy managementManagement | —Unverified | 0 |
| UCB Momentum Q-learning: Correcting the bias without forgetting | Mar 1, 2021 | Q-Learning | CodeCode Available | 0 |
| Ensemble Bootstrapping for Q-Learning | Feb 28, 2021 | Atari GamesQ-Learning | —Unverified | 0 |
| Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach | Feb 26, 2021 | Hierarchical Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement learning approach for resource allocation in humanitarian logistics | Feb 25, 2021 | HumanitarianQ-Learning | —Unverified | 0 |
| No-Regret Reinforcement Learning with Heavy-Tailed Rewards | Feb 25, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments | Feb 24, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Sequential Learning-based IaaS Composition | Feb 24, 2021 | ClusteringQ-Learning | —Unverified | 0 |
| Greedy-Step Off-Policy Reinforcement Learning | Feb 23, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Understanding algorithmic collusion with experience replay | Feb 18, 2021 | Q-Learning | CodeCode Available | 0 |
| A Discrete-Time Switching System Analysis of Q-learning | Feb 17, 2021 | Q-Learning | —Unverified | 0 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reversible Action Design for Combinatorial Optimization with Reinforcement Learning | Feb 14, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis | Feb 12, 2021 | Natural QuestionsQ-Learning | —Unverified | 0 |
| Hedging of Financial Derivative Contracts via Monte Carlo Tree Search | Feb 11, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States | Feb 10, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Model-Augmented Q-learning | Feb 7, 2021 | modelQ-Learning | —Unverified | 0 |
| Revisiting Prioritized Experience Replay: A Value Perspective | Feb 5, 2021 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning | Feb 5, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A review of motion planning algorithms for intelligent robotics | Feb 4, 2021 | Motion PlanningQ-Learning | —Unverified | 0 |
| Deep reinforcement learning-based image classification achieves perfect testing set accuracy for MRI brain tumors with a training set of only 30 images | Feb 4, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A step toward a reinforcement learning de novo genome assembler | Feb 2, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants | Feb 2, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning | Feb 2, 2021 | Q-Learning | —Unverified | 0 |
| Variation-resistant Q-learning: Controlling and Utilizing Estimation Bias in Reinforcement Learning for Better Performance | Feb 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning based Per-antenna Discrete Power Control for Massive MIMO Systems | Jan 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Assisted Beamforming for Inter-cell Interference Mitigation in 5G Massive MIMO Networks | Jan 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Robust Android Malware Detection System against Adversarial Attacks using Q-Learning | Jan 27, 2021 | Adversarial DefenseAndroid Malware Detection | —Unverified | 0 |
| Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach | Jan 25, 2021 | DenoisingQ-Learning | —Unverified | 0 |
| Solving optimal stopping problems with Deep Q-Learning | Jan 24, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 |
| Breaking the Deadly Triad with a Target Network | Jan 21, 2021 | Q-Learning | —Unverified | 0 |
| Reinforcement learning based recommender systems: A survey | Jan 15, 2021 | Collaborative FilteringDeep Reinforcement Learning | —Unverified | 0 |
| Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems | Jan 13, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Learning Augmented Index Policy for Optimal Service Placement at the Network Edge | Jan 10, 2021 | Q-Learning | —Unverified | 0 |
| Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs | Jan 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Safe Coupled Deep Q-Learning for Recommendation Systems | Jan 8, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity | Jan 1, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Success-Rate Targeted Reinforcement Learning by Disorientation Penalty | Jan 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 |