| Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity | Jun 2, 2021 | Q-Learning | —Unverified | 0 |
| Design and Comparison of Reward Functions in Reinforcement Learning for Energy Management of Sensor Nodes | Jun 2, 2021 | energy managementManagement | —Unverified | 0 |
| Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning | Jun 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT | Jun 1, 2021 | Industrial RobotsQ-Learning | —Unverified | 0 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model | May 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reputation Bootstrapping for Composite Services using CP-nets | May 27, 2021 | Q-Learning | —Unverified | 0 |
| A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem | May 25, 2021 | PositionQ-Learning | CodeCode Available | 0 |
| Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning | May 24, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations | May 19, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization | May 18, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | May 17, 2021 | Offline RLQ-Learning | CodeCode Available | 1 |
| Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare | May 17, 2021 | Q-Learning | —Unverified | 0 |
| Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems | May 17, 2021 | Q-Learning | —Unverified | 0 |
| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Fast constraint satisfaction problem and learning-based algorithm for solving Minesweeper | May 10, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning with Expert Trajectory For Quantitative Trading | May 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network | May 5, 2021 | ManagementQ-Learning | —Unverified | 0 |
| HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation | May 4, 2021 | Bayesian OptimizationQ-Learning | CodeCode Available | 1 |
| Robotic Surgery With Lean Reinforcement Learning | May 3, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks | May 3, 2021 | Q-Learning | CodeCode Available | 0 |
| CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network | May 2, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| RP-DQN: An application of Q-Learning to Vehicle Routing Problems | Apr 25, 2021 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks | Apr 21, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems | Apr 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Simulated Experiment to Explore Robotic Dialogue Strategies for People with Dementia | Apr 18, 2021 | Q-Learning | —Unverified | 0 |
| Low-rank State-action Value-function Approximation | Apr 18, 2021 | Q-Learning | CodeCode Available | 0 |
| Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills | Apr 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Prospect-theoretic Q-learning | Apr 12, 2021 | Q-Learning | —Unverified | 0 |
| Autoequivariant Network Search via Group Decomposition | Apr 10, 2021 | Inductive BiasNeural Architecture Search | CodeCode Available | 0 |
| Optimal Market Making by Reinforcement Learning | Apr 8, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Towards Resilience for Multi-Agent QD-Learning | Apr 7, 2021 | AllMulti-agent Reinforcement Learning | —Unverified | 0 |
| Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing | Apr 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems | Apr 4, 2021 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks | Apr 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability | Mar 22, 2021 | Q-Learning | —Unverified | 0 |
| Variational quantum compiling with double Q-learning | Mar 22, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Mar 22, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning based on Scenario-tree MPC for ASVs | Mar 22, 2021 | Model Predictive ControlPoint Tracking | —Unverified | 0 |
| A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks | Mar 10, 2021 | Q-LearningScheduling | —Unverified | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning | Mar 7, 2021 | Q-LearningTransfer Learning | —Unverified | 0 |
| Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach | Mar 6, 2021 | energy managementenergy trading | —Unverified | 0 |
| Correlated Deep Q-learning based Microgrid Energy Management | Mar 6, 2021 | energy managementManagement | —Unverified | 0 |
| UCB Momentum Q-learning: Correcting the bias without forgetting | Mar 1, 2021 | Q-Learning | CodeCode Available | 0 |
| Ensemble Bootstrapping for Q-Learning | Feb 28, 2021 | Atari GamesQ-Learning | —Unverified | 0 |
| Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach | Feb 26, 2021 | Hierarchical Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement learning approach for resource allocation in humanitarian logistics | Feb 25, 2021 | HumanitarianQ-Learning | —Unverified | 0 |
| No-Regret Reinforcement Learning with Heavy-Tailed Rewards | Feb 25, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |