| Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications | Oct 27, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle | Oct 27, 2020 | energy managementManagement | —Unverified | 0 |
| Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning | Oct 26, 2020 | Q-Learning | —Unverified | 0 |
| Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control | Oct 25, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Adiabatic Theorem for Policy Tracking with TD-learning | Oct 24, 2020 | Q-Learning | —Unverified | 0 |
| Stabilizing Transformer-Based Action Sequence Generation For Q-Learning | Oct 23, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Logistic Q-Learning | Oct 21, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Surrogate Q-Learning for Autonomous Driving | Oct 21, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement learning using Deep Q Networks and Q learning accurately localizes brain tumors on MRI with very small training sets | Oct 21, 2020 | Keypoint DetectionQ-Learning | —Unverified | 0 |
| On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality | Oct 21, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Language Inference with Multi-head Automata through Reinforcement Learning | Oct 20, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning | Oct 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control | Oct 13, 2020 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Parameterized Reinforcement Learning for Optical System Optimization | Oct 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments | Oct 9, 2020 | Incremental LearningQ-Learning | CodeCode Available | 0 |
| Fictitious play in zero-sum stochastic games | Oct 8, 2020 | Q-Learning | —Unverified | 0 |
| Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control | Oct 7, 2020 | Computational EfficiencyQ-Learning | —Unverified | 0 |
| Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks | Oct 6, 2020 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Finite-Time Analysis for Double Q-learning | Sep 29, 2020 | Q-Learning | —Unverified | 0 |
| Cross Learning in Deep Q-Networks | Sep 29, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space | Sep 28, 2020 | Off-policy evaluationQ-Learning | —Unverified | 0 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs | Sep 28, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward | Sep 24, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games | Sep 24, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Is Q-Learning Provably Efficient? An Extended Analysis | Sep 22, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hidden Incentives for Auto-Induced Distributional Shift | Sep 19, 2020 | BIG-bench Machine LearningMeta-Learning | —Unverified | 0 |
| Reinforcement Learning for Dynamic Resource Optimization in 5G Radio Access Network Slicing | Sep 14, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Option Replication and Hedging | Sep 9, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| AoI Minimization in Status Update Control with Energy Harvesting Sensors | Sep 9, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| PAC Reinforcement Learning Algorithm for General-Sum Markov Games | Sep 5, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners | Sep 5, 2020 | Q-Learning | —Unverified | 0 |
| A Hybrid PAC Reinforcement Learning Algorithm | Sep 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation | Sep 1, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Solving the single-track train scheduling problem via Deep Reinforcement Learning | Sep 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Q-Learning: Theoretical Insights from an Asymptotic Analysis | Aug 25, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Chrome Dino Run using Reinforcement Learning | Aug 15, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning | Aug 14, 2020 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Multi-Agent Double Deep Q-Learning for Beamforming in mmWave MIMO Networks | Aug 13, 2020 | Q-Learning | —Unverified | 0 |
| Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA Networks | Aug 12, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Convex Q-Learning, Part 1: Deterministic Optimal Control | Aug 8, 2020 | Q-Learning | —Unverified | 0 |
| Evaluating Load Models and Their Impacts on Power Transfer Limits | Aug 7, 2020 | Q-Learning | —Unverified | 0 |
| Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents | Aug 6, 2020 | Multi-agent Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles | Aug 4, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy | Aug 4, 2020 | Q-Learning | —Unverified | 0 |
| Cooperative Control of Mobile Robots with Stackelberg Learning | Aug 3, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Momentum Q-learning with Finite-Sample Convergence Guarantee | Jul 30, 2020 | Q-Learning | —Unverified | 0 |