| Fitted Q-Learning for Relational Domains | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation | Jun 10, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning-Based Joint Self-Optimisation Method for the Fuzzy Logic Handover Algorithm in 5G HetNets | Jun 9, 2020 | ClusteringManagement | —Unverified | 0 |
| Balancing a CartPole System with Reinforcement Learning -- A Tutorial | Jun 8, 2020 | OpenAI GymQ-Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Jun 8, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret | Jun 8, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks | Jun 6, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Logical Team Q-learning: An approach towards factored policies in cooperative MARL | Jun 5, 2020 | Q-Learning | —Unverified | 0 |
| A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines | Jun 4, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction | Jun 4, 2020 | Q-Learning | —Unverified | 0 |
| Hyperparameter optimization with REINFORCE and Transformers | Jun 1, 2020 | BenchmarkingHyperparameter Optimization | —Unverified | 0 |
| Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning | Jun 1, 2020 | Face RecognitionFairness | —Unverified | 0 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network | May 28, 2020 | Q-Learning | —Unverified | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Should artificial agents ask for help in human-robot collaborative problem-solving? | May 25, 2020 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Power Allocation for D2D Network | May 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A reinforcement learning based decision support system in textile manufacturing process | May 20, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Safe Learning for Near Optimal Scheduling | May 19, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning | May 18, 2020 | Q-Learning | —Unverified | 0 |
| A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions | May 15, 2020 | Q-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support | May 11, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning | May 10, 2020 | L2 RegularizationOpenAI Gym | —Unverified | 0 |
| Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python | May 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach | May 2, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Implementing Inductive bias for different navigation tasks through diverse RNN attrractors | May 1, 2020 | Inductive BiasQ-Learning | —Unverified | 0 |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | May 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Whittle index based Q-learning for restless bandits with average reward | Apr 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Evolution of Q Values for Deep Q Learning in Stable Baselines | Apr 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning Dialog Policies from Weak Demonstrations | Apr 23, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping | Apr 20, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication | Apr 20, 2020 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Learning Systems | Apr 17, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Show Us the Way: Learning to Manage Dialog from Demonstrations | Apr 17, 2020 | dialog state trackingManagement | —Unverified | 0 |
| K-spin Hamiltonian for quantum-resolvable Markov decision processes | Apr 13, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Self Punishment and Reward Backfill for Deep Q-Learning | Apr 10, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics | Apr 6, 2020 | ClusteringQ-Learning | —Unverified | 0 |
| Multi-agent Reinforcement Learning for Resource Allocation in IoT networks with Edge Computing | Apr 5, 2020 | Cloud ComputingDistributed Computing | —Unverified | 0 |
| Minimizing Age-of-Information for Fog Computing-supported Vehicular Networks with Deep Q-learning | Apr 4, 2020 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Mixed-Integer Problems Based on MPC | Apr 3, 2020 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality? | Apr 2, 2020 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning | Apr 1, 2020 | NegationQ-Learning | —Unverified | 0 |
| Augmented Q Imitation Learning (AQIL) | Mar 31, 2020 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition | Mar 31, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning medical triage from clinicians using Deep Q-Learning | Mar 28, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms | Mar 27, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Robust Q-learning | Mar 27, 2020 | Q-Learningregression | —Unverified | 0 |
| Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence | Mar 25, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |