| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Gap-Dependent Bounds for Two-Player Markov Games | Jul 1, 2021 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 |
| Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow | Jul 1, 2021 | Decision MakingMarketing | —Unverified | 0 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning | Jun 28, 2021 | Q-Learning | —Unverified | 0 |
| Concentration of Contractive Stochastic Approximation and Reinforcement Learning | Jun 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Mean Field Games, with Applications to Economics | Jun 25, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality | Jun 24, 2021 | Q-Learning | —Unverified | 0 |
| Q-Learning Lagrange Policies for Multi-Action Restless Bandits | Jun 22, 2021 | Multi-Armed BanditsQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems | Jun 21, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Analytically Tractable Bayesian Deep Q-Learning | Jun 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes | Jun 17, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A Q-Learning-Based Topology-Aware Routing Protocol for Flying Ad Hoc Networks | Jun 16, 2021 | Q-Learning | —Unverified | 0 |
| Unbiased Methods for Multi-Goal Reinforcement Learning | Jun 16, 2021 | Multi-Goal Reinforcement LearningQ-Learning | —Unverified | 0 |
| Decentralized Q-Learning in Zero-sum Markov Games | Jun 4, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Jun 4, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Design and Comparison of Reward Functions in Reinforcement Learning for Energy Management of Sensor Nodes | Jun 2, 2021 | energy managementManagement | —Unverified | 0 |
| Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity | Jun 2, 2021 | Q-Learning | —Unverified | 0 |
| Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning | Jun 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT | Jun 1, 2021 | Industrial RobotsQ-Learning | —Unverified | 0 |
| Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model | May 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reputation Bootstrapping for Composite Services using CP-nets | May 27, 2021 | Q-Learning | —Unverified | 0 |
| A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem | May 25, 2021 | PositionQ-Learning | CodeCode Available | 0 |
| Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning | May 24, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations | May 19, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization | May 18, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 |
| Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems | May 17, 2021 | Q-Learning | —Unverified | 0 |
| Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare | May 17, 2021 | Q-Learning | —Unverified | 0 |
| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Fast constraint satisfaction problem and learning-based algorithm for solving Minesweeper | May 10, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Reinforcement Learning with Expert Trajectory For Quantitative Trading | May 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network | May 5, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Robotic Surgery With Lean Reinforcement Learning | May 3, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks | May 3, 2021 | Q-Learning | CodeCode Available | 0 |
| CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network | May 2, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| RP-DQN: An application of Q-Learning to Vehicle Routing Problems | Apr 25, 2021 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks | Apr 21, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems | Apr 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Low-rank State-action Value-function Approximation | Apr 18, 2021 | Q-Learning | CodeCode Available | 0 |
| A Simulated Experiment to Explore Robotic Dialogue Strategies for People with Dementia | Apr 18, 2021 | Q-Learning | —Unverified | 0 |
| Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills | Apr 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Prospect-theoretic Q-learning | Apr 12, 2021 | Q-Learning | —Unverified | 0 |
| Autoequivariant Network Search via Group Decomposition | Apr 10, 2021 | Inductive BiasNeural Architecture Search | CodeCode Available | 0 |
| Towards Resilience for Multi-Agent QD-Learning | Apr 7, 2021 | AllMulti-agent Reinforcement Learning | —Unverified | 0 |
| Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing | Apr 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |