| Blackwell Online Learning for Markov Decision Processes | Dec 28, 2020 | Learning TheoryQ-Learning | —Unverified | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| Assured RL: Reinforcement Learning with Almost Sure Constraints | Dec 24, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Goal Reasoning by Selecting Subgoals with Deep Q-Learning | Dec 22, 2020 | Q-Learning | —Unverified | 0 |
| Distributed Q-Learning with State Tracking for Multi-agent Networked Control | Dec 22, 2020 | Q-LearningState Estimation | —Unverified | 0 |
| Stabilizing Q Learning Via Soft Mellowmax Operator | Dec 17, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL | Dec 17, 2020 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation | Dec 16, 2020 | counterfactualData Augmentation | —Unverified | 0 |
| Deploying Reinforcement Learning in Water Transport | Dec 14, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Virtual Autonomous Driving with Reinforcement Learning | Dec 14, 2020 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| Semi-Supervised Off Policy Reinforcement Learning | Dec 9, 2020 | ImputationQ-Learning | —Unverified | 0 |
| Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem | Dec 8, 2020 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation | Dec 7, 2020 | Domain AdaptationQ-Learning | —Unverified | 0 |
| Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways | Dec 6, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task | Dec 2, 2020 | HippocampusQ-Learning | —Unverified | 0 |
| Self-correcting Q-Learning | Dec 2, 2020 | Q-Learning | —Unverified | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity | Dec 1, 2020 | Q-Learning | —Unverified | 0 |
| A new convergent variant of Q-learning with linear function approximation | Dec 1, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Dec 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms | Dec 1, 2020 | Q-Learning | —Unverified | 0 |
| Robust Multi-Agent Reinforcement Learning with Model Uncertainty | Dec 1, 2020 | modelMulti-agent Reinforcement Learning | —Unverified | 0 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Dec 1, 2020 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles | Nov 30, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning | Nov 27, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning-based Joint Path and Energy Optimization of Cellular-Connected Unmanned Aerial Vehicles | Nov 27, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation | Nov 25, 2020 | Imitation LearningQ-Learning | —Unverified | 0 |
| Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning | Nov 24, 2020 | NavigateQ-Learning | CodeCode Available | 0 |
| Learning Principle of Least Action with Reinforcement Learning | Nov 24, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Multi-Agent Reinforcement Learning for Markov Routing Games: A New Modeling Paradigm For Dynamic Traffic Assignment | Nov 22, 2020 | Autonomous VehiclesBilevel Optimization | —Unverified | 0 |
| Provable Multi-Objective Reinforcement Learning with Generative Models | Nov 19, 2020 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Adaptive Contention Window Design using Deep Q-learning | Nov 18, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Nov 17, 2020 | ClassificationDensity Estimation | —Unverified | 0 |
| Constrained Model-Free Reinforcement Learning for Process Optimization | Nov 16, 2020 | modelModel Predictive Control | —Unverified | 0 |
| A deep Q-Learning based Path Planning and Navigation System for Firefighting Environments | Nov 12, 2020 | Q-Learning | —Unverified | 0 |
| On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension | Nov 11, 2020 | Matrix CompletionQ-Learning | —Unverified | 0 |
| Reinforced Deep Markov Models With Applications in Automatic Trading | Nov 9, 2020 | Q-Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems | Nov 9, 2020 | Autonomous VehiclesMulti-agent Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning for Assignment problem | Nov 8, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Control with adaptive Q-learning | Nov 3, 2020 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings | Oct 29, 2020 | Change Point DetectionOff-policy evaluation | CodeCode Available | 0 |
| DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding Proteins | Oct 28, 2020 | Deep Reinforcement LearningProtein Structure Prediction | —Unverified | 0 |
| Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning | Oct 28, 2020 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications | Oct 27, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle | Oct 27, 2020 | energy managementManagement | —Unverified | 0 |
| Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls | Oct 27, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning | Oct 26, 2020 | Q-Learning | —Unverified | 0 |
| Enhancing reinforcement learning by a finite reward response filter with a case study in intelligent structural control | Oct 25, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Adiabatic Theorem for Policy Tracking with TD-learning | Oct 24, 2020 | Q-Learning | —Unverified | 0 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |