| Stabilizing Transformer-Based Action Sequence Generation For Q-Learning | Oct 23, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Surrogate Q-Learning for Autonomous Driving | Oct 21, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement learning using Deep Q Networks and Q learning accurately localizes brain tumors on MRI with very small training sets | Oct 21, 2020 | Keypoint DetectionQ-Learning | —Unverified | 0 |
| On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality | Oct 21, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Logistic Q-Learning | Oct 21, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Language Inference with Multi-head Automata through Reinforcement Learning | Oct 20, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Multi-Agent Collaboration via Reward Attribution Decomposition | Oct 16, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning | Oct 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control | Oct 13, 2020 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Parameterized Reinforcement Learning for Optical System Optimization | Oct 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models | Oct 9, 2020 | Deep Reinforcement LearningEpidemiology | CodeCode Available | 1 |
| Q-learning with Language Model for Edit-based Unsupervised Summarization | Oct 9, 2020 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments | Oct 9, 2020 | Incremental LearningQ-Learning | CodeCode Available | 0 |
| Fictitious play in zero-sum stochastic games | Oct 8, 2020 | Q-Learning | —Unverified | 0 |
| Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control | Oct 7, 2020 | Computational EfficiencyQ-Learning | —Unverified | 0 |
| Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks | Oct 6, 2020 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Cross Learning in Deep Q-Networks | Sep 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Finite-Time Analysis for Double Q-learning | Sep 29, 2020 | Q-Learning | —Unverified | 0 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space | Sep 28, 2020 | Off-policy evaluationQ-Learning | —Unverified | 0 |
| Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs | Sep 28, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward | Sep 24, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games | Sep 24, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Is Q-Learning Provably Efficient? An Extended Analysis | Sep 22, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hidden Incentives for Auto-Induced Distributional Shift | Sep 19, 2020 | BIG-bench Machine LearningMeta-Learning | —Unverified | 0 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Reinforcement Learning for Dynamic Resource Optimization in 5G Radio Access Network Slicing | Sep 14, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Option Replication and Hedging | Sep 9, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| AoI Minimization in Status Update Control with Energy Harvesting Sensors | Sep 9, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Active Inference for Partially Observable MDPs | Sep 8, 2020 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| A Hybrid PAC Reinforcement Learning Algorithm | Sep 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners | Sep 5, 2020 | Q-Learning | —Unverified | 0 |
| PAC Reinforcement Learning Algorithm for General-Sum Markov Games | Sep 5, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation | Sep 1, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Solving the single-track train scheduling problem via Deep Reinforcement Learning | Sep 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Q-Learning: Theoretical Insights from an Asymptotic Analysis | Aug 25, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Table2Charts: Recommending Charts by Learning Shared Table Representations | Aug 24, 2020 | Q-LearningRecommendation Systems | CodeCode Available | 1 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Chrome Dino Run using Reinforcement Learning | Aug 15, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning | Aug 14, 2020 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Multi-Agent Double Deep Q-Learning for Beamforming in mmWave MIMO Networks | Aug 13, 2020 | Q-Learning | —Unverified | 0 |
| Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA Networks | Aug 12, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Convex Q-Learning, Part 1: Deterministic Optimal Control | Aug 8, 2020 | Q-Learning | —Unverified | 0 |
| Evaluating Load Models and Their Impacts on Power Transfer Limits | Aug 7, 2020 | Q-Learning | —Unverified | 0 |
| Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents | Aug 6, 2020 | Multi-agent Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Robust Deep Reinforcement Learning through Adversarial Loss | Aug 5, 2020 | Adversarial AttackAtari Games | CodeCode Available | 1 |
| A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles | Aug 4, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Deep Inverse Q-learning with Constraints | Aug 4, 2020 | Q-Learning | CodeCode Available | 1 |