| Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration | Sep 29, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Density Estimation for Conservative Q-Learning | Sep 29, 2021 | Density EstimationQ-Learning | —Unverified | 0 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Online Robust Reinforcement Learning with Model Uncertainty | Sep 29, 2021 | modelQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Adjustments | Sep 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Smart Home Energy Management: Sequence-to-Sequence Load Forecasting and Q-Learning | Sep 25, 2021 | energy managementLoad Forecasting | —Unverified | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods | Sep 22, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning | Sep 22, 2021 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands | Sep 21, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Search For Deep Graph Neural Networks | Sep 21, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing | Sep 16, 2021 | FairnessManagement | —Unverified | 0 |
| Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep hierarchical reinforcement agents for automated penetration testing | Sep 14, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem | Sep 9, 2021 | Car RacingQ-Learning | CodeCode Available | 0 |
| Bootstrapped Meta-Learning | Sep 9, 2021 | Efficient ExplorationFew-Shot Learning | CodeCode Available | 0 |
| User Tampering in Reinforcement Learning Recommender Systems | Sep 9, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor | Sep 6, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia | Sep 6, 2021 | Q-Learning | —Unverified | 0 |
| Event-Based Communication in Distributed Q-Learning | Sep 3, 2021 | Q-Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dynamic Band Switch in Cellular-Connected UAV | Aug 26, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection | Aug 24, 2021 | Q-Learning | —Unverified | 0 |
| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 |
| DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks | Aug 11, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Maximizing Influence with Graph Neural Networks | Aug 10, 2021 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Modified Double DQN: addressing stability | Aug 9, 2021 | Q-Learning | —Unverified | 0 |
| An Elementary Proof that Q-learning Converges Almost Surely | Aug 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Offline Decentralized Multi-Agent Reinforcement Learning | Aug 4, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots | Aug 3, 2021 | Model Predictive ControlMotion Planning | CodeCode Available | 0 |
| A DQN-based Approach to Finding Precise Evidences for Fact Verification | Aug 1, 2021 | Claim VerificationFact Verification | CodeCode Available | 0 |
| Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings | Jul 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Distributed Intelligence Architecture for B5G Network Automation | Jul 28, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Double Deep Q-learning Based Real-Time Optimization Strategy for Microgrids | Jul 27, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Integrating Deep Learning and Augmented Reality to Enhance Situational Awareness in Firefighting Environments | Jul 23, 2021 | Anomaly Detectionobject-detection | —Unverified | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Q-SMASH: Q-Learning-based Self-Adaptation of Human-Centered Internet of Things | Jul 13, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication | Jul 13, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens | Jul 13, 2021 | Q-Learning | —Unverified | 0 |
| Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem | Jul 9, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning | Jul 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 |