| Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning | Jun 15, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Self-Imitation Learning via Generalized Lower Bound Q-learning | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine | Jun 12, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Human and Multi-Agent collaboration in a human-MARL teaming framework | Jun 12, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Neural Control | Jun 12, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Decorrelated Double Q-learning | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework | Jun 11, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling | Jun 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning | Jun 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation | Jun 10, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Fitted Q-Learning for Relational Domains | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Self-Supervised Reinforcement Learning for Recommender Systems | Jun 10, 2020 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Reinforcement Learning-Based Joint Self-Optimisation Method for the Fuzzy Logic Handover Algorithm in 5G HetNets | Jun 9, 2020 | ClusteringManagement | —Unverified | 0 |
| Balancing a CartPole System with Reinforcement Learning -- A Tutorial | Jun 8, 2020 | OpenAI GymQ-Learning | —Unverified | 0 |
| A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret | Jun 8, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Jun 8, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks | Jun 6, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Logical Team Q-learning: An approach towards factored policies in cooperative MARL | Jun 5, 2020 | Q-Learning | —Unverified | 0 |
| Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction | Jun 4, 2020 | Q-Learning | —Unverified | 0 |
| A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines | Jun 4, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Multi-Agent Determinantal Q-Learning | Jun 2, 2020 | Q-Learning | CodeCode Available | 1 |
| Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning | Jun 1, 2020 | Face RecognitionFairness | —Unverified | 0 |
| Hyperparameter optimization with REINFORCE and Transformers | Jun 1, 2020 | BenchmarkingHyperparameter Optimization | —Unverified | 0 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network | May 28, 2020 | Q-Learning | —Unverified | 0 |
| Modeling Penetration Testing with Reinforcement Learning Using Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A Priori Knowledge | May 26, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Power Allocation for D2D Network | May 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Should artificial agents ask for help in human-robot collaborative problem-solving? | May 25, 2020 | Q-Learning | —Unverified | 0 |
| A reinforcement learning based decision support system in textile manufacturing process | May 20, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Safe Learning for Near Optimal Scheduling | May 19, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning | May 18, 2020 | Q-Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions | May 15, 2020 | Q-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support | May 11, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 |
| Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python | May 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach | May 2, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | May 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Implementing Inductive bias for different navigation tasks through diverse RNN attrractors | May 1, 2020 | Inductive BiasQ-Learning | —Unverified | 0 |
| Whittle index based Q-learning for restless bandits with average reward | Apr 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Evolution of Q Values for Deep Q Learning in Stable Baselines | Apr 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning Dialog Policies from Weak Demonstrations | Apr 23, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication | Apr 20, 2020 | Q-Learning | —Unverified | 0 |
| Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping | Apr 20, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Spatial Action Maps for Mobile Manipulation | Apr 20, 2020 | Q-LearningValue prediction | CodeCode Available | 1 |
| Deep Reinforcement Learning for Adaptive Learning Systems | Apr 17, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |