| Final Adaptation Reinforcement Learning for N-Player Games | Nov 29, 2021 | Board GamesQ-Learning | —Unverified | 0 |
| Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning | Nov 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 |
| Multicrew Scheduling and Routing in Road Network Restoration Based on Deep Q-learning | Nov 24, 2021 | Q-LearningScheduling | —Unverified | 0 |
| Reversible Action Design for Combinatorial Optimization with ReinforcementLearning | Nov 24, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| The Impact of Data Distribution on Q-learning with Function Approximation | Nov 23, 2021 | DiversityQ-Learning | CodeCode Available | 0 |
| Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures | Nov 22, 2021 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Consecutive Task-oriented Dialog Policy Learning | Nov 16, 2021 | Continual LearningManagement | —Unverified | 0 |
| Compressive Features in Offline Reinforcement Learning for Recommender Systems | Nov 16, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning | Nov 13, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity | Nov 12, 2021 | Q-LearningQuantization | —Unverified | 0 |
| On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods | Nov 8, 2021 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Supervised Advantage Actor-Critic for Recommender Systems | Nov 5, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel | Nov 4, 2021 | Language AcquisitionMulti-agent Reinforcement Learning | —Unverified | 0 |
| Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets | Nov 3, 2021 | Q-Learning | —Unverified | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method | Oct 31, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Throughput and Latency in the Distributed Q-Learning Random Access mMTC Networks | Oct 30, 2021 | Q-Learning | —Unverified | 0 |
| Location-routing Optimisation for Urban Logistics Using Mobile Parcel Locker Based on Hybrid Q-Learning Algorithm | Oct 29, 2021 | Q-Learning | —Unverified | 0 |
| Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System | Oct 29, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates | Oct 28, 2021 | Q-LearningScheduling | —Unverified | 0 |
| Cooperative Deep Q-learning Framework for Environments Providing Image Feedback | Oct 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids | Oct 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL | Oct 27, 2021 | Medical Visual Question AnsweringQ-Learning | —Unverified | 0 |
| Multi-Agent Advisor Q-Learning | Oct 26, 2021 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Automating Control of Overestimation Bias for Reinforcement Learning | Oct 26, 2021 | Continuous ControlQ-Learning | —Unverified | 0 |
| Can Q-Learning be Improved with Advice? | Oct 25, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks | Oct 24, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow | Oct 22, 2021 | Distributed OptimizationQ-Learning | —Unverified | 0 |
| Can Q-learning solve Multi Armed Bantids? | Oct 21, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Playing 2048 With Reinforcement Learning | Oct 20, 2021 | Playing the Game of 2048Q-Learning | CodeCode Available | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Q-Learning-based Approach for Distributed Beam Scheduling in mmWave Networks | Oct 17, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs | Oct 16, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Value Penalized Q-Learning for Recommender Systems | Oct 15, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication | Oct 14, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games | Oct 12, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning | Oct 12, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning with Implicit Q-Learning | Oct 12, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Fast Block Linear System Solver Using Q-Learning Schduling for Unified Dynamic Power System Simulations | Oct 12, 2021 | Q-LearningScheduling | —Unverified | 0 |
| Urban traffic dynamic rerouting framework: A DRL-based model with fog-cloud architecture | Oct 11, 2021 | Graph AttentionQ-Learning | —Unverified | 0 |
| Navigation In Urban Environments Amongst Pedestrians Using Multi-Objective Deep Reinforcement Learning | Oct 11, 2021 | Autonomous DrivingAutonomous Navigation | —Unverified | 0 |
| A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization | Oct 10, 2021 | Edge-computingQ-Learning | —Unverified | 0 |
| Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning | Oct 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Training Transition Policies via Distribution Matching for Complex Tasks | Oct 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations | Oct 6, 2021 | Decision MakingNavigate | —Unverified | 0 |
| A study of first-passage time minimization via Q-learning in heated gridworlds | Oct 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Dropout Q-Functions for Doubly Efficient Reinforcement Learning | Oct 5, 2021 | Computational EfficiencyQ-Learning | CodeCode Available | 1 |