| An Independent Study of Reinforcement Learning and Autonomous Driving | Aug 20, 2021 | Autonomous DrivingOpenAI Gym | —Unverified | 0 |
| DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks | Aug 11, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Maximizing Influence with Graph Neural Networks | Aug 10, 2021 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Modified Double DQN: addressing stability | Aug 9, 2021 | Q-Learning | —Unverified | 0 |
| An Elementary Proof that Q-learning Converges Almost Surely | Aug 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Offline Decentralized Multi-Agent Reinforcement Learning | Aug 4, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots | Aug 3, 2021 | Model Predictive ControlMotion Planning | CodeCode Available | 0 |
| A DQN-based Approach to Finding Precise Evidences for Fact Verification | Aug 1, 2021 | Claim VerificationFact Verification | CodeCode Available | 0 |
| A Distributed Intelligence Architecture for B5G Network Automation | Jul 28, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings | Jul 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Double Deep Q-learning Based Real-Time Optimization Strategy for Microgrids | Jul 27, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Integrating Deep Learning and Augmented Reality to Enhance Situational Awareness in Firefighting Environments | Jul 23, 2021 | Anomaly Detectionobject-detection | —Unverified | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens | Jul 13, 2021 | Q-Learning | —Unverified | 0 |
| Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication | Jul 13, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-SMASH: Q-Learning-based Self-Adaptation of Human-Centered Internet of Things | Jul 13, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Backprop-Free Reinforcement Learning with Active Neural Generative Coding | Jul 10, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem | Jul 9, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning | Jul 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow | Jul 1, 2021 | Decision MakingMarketing | —Unverified | 0 |
| Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation | Jul 1, 2021 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| Distilling Reinforcement Learning Tricks for Video Games | Jul 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Gap-Dependent Bounds for Two-Player Markov Games | Jul 1, 2021 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 |
| Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent | Jun 29, 2021 | Q-Learning | CodeCode Available | 1 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning | Jun 28, 2021 | Q-Learning | —Unverified | 0 |
| Concentration of Contractive Stochastic Approximation and Reinforcement Learning | Jun 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Mean Field Games, with Applications to Economics | Jun 25, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality | Jun 24, 2021 | Q-Learning | —Unverified | 0 |
| Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation | Jun 23, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| IQ-Learn: Inverse soft-Q Learning for Imitation | Jun 23, 2021 | Atari GamesContinuous Control | CodeCode Available | 1 |
| Q-Learning Lagrange Policies for Multi-Action Restless Bandits | Jun 22, 2021 | Multi-Armed BanditsQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| Distributed Heuristic Multi-Agent Path Finding with Communication | Jun 21, 2021 | Multi-Agent Path FindingQ-Learning | CodeCode Available | 1 |
| Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems | Jun 21, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Analytically Tractable Bayesian Deep Q-Learning | Jun 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes | Jun 17, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| A Q-Learning-Based Topology-Aware Routing Protocol for Flying Ad Hoc Networks | Jun 16, 2021 | Q-Learning | —Unverified | 0 |
| Unbiased Methods for Multi-Goal Reinforcement Learning | Jun 16, 2021 | Multi-Goal Reinforcement LearningQ-Learning | —Unverified | 0 |
| Efficient (Soft) Q-Learning for Text Generation with Limited Good Data | Jun 14, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| TempoRL: Learning When to Act | Jun 9, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Decentralized Q-Learning in Zero-sum Markov Games | Jun 4, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Jun 4, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |