| Dropout Q-Functions for Doubly Efficient Reinforcement Learning | Oct 5, 2021 | Computational EfficiencyQ-Learning | CodeCode Available | 1 |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes | Oct 4, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning | Oct 1, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning the Markov Decision Process in the Sparse Gaussian Elimination | Sep 30, 2021 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| Learning Explicit Credit Assignment for Multi-agent Joint Q-learning | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Polyphonic Music Composition: An Adversarial Inverse Reinforcement Learning Approach | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-Learning Scheduler for Multi-Task Learning through the use of Histogram of Task Uncertainty | Sep 29, 2021 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Unifying Top-down and Bottom-up for Recurrent Visual Attention | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Value Refinement Network (VRN) | Sep 29, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| An Attempt to Model Human Trust with Reinforcement Learning | Sep 29, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Robust and Data-efficient Q-learning by Composite Value-estimation | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| ^2-exploration for Reinforcement Learning | Sep 29, 2021 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning with In-sample Q-Learning | Sep 29, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration | Sep 29, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Towards Unknown-aware Deep Q-Learning | Sep 29, 2021 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Q-learning for real time control of heterogeneous microagent collectives | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Convergent and Efficient Deep Q Learning Algorithm | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Density Estimation for Conservative Q-Learning | Sep 29, 2021 | Density EstimationQ-Learning | —Unverified | 0 |
| Text Generation with Efficient (Soft) Q-Learning | Sep 29, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Untangling Braids with Multi-agent Q-Learning | Sep 29, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Online Robust Reinforcement Learning with Model Uncertainty | Sep 29, 2021 | modelQ-Learning | —Unverified | 0 |
| Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection | Sep 29, 2021 | Q-LearningTraffic Signal Control | CodeCode Available | 1 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Deep Reinforcement Learning with Adjustments | Sep 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Smart Home Energy Management: Sequence-to-Sequence Load Forecasting and Q-Learning | Sep 25, 2021 | energy managementLoad Forecasting | —Unverified | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods | Sep 22, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning | Sep 22, 2021 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands | Sep 21, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Search For Deep Graph Neural Networks | Sep 21, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures | Sep 19, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing | Sep 16, 2021 | FairnessManagement | —Unverified | 0 |
| Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep hierarchical reinforcement agents for automated penetration testing | Sep 14, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Bootstrapped Meta-Learning | Sep 9, 2021 | Efficient ExplorationFew-Shot Learning | CodeCode Available | 0 |
| User Tampering in Reinforcement Learning Recommender Systems | Sep 9, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem | Sep 9, 2021 | Car RacingQ-Learning | CodeCode Available | 0 |
| Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning | Sep 8, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor | Sep 6, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia | Sep 6, 2021 | Q-Learning | —Unverified | 0 |
| Event-Based Communication in Distributed Q-Learning | Sep 3, 2021 | Q-Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dynamic Band Switch in Cellular-Connected UAV | Aug 26, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection | Aug 24, 2021 | Q-Learning | —Unverified | 0 |