| Model-based versus model-free feeding control and water quality monitoring for fish growth tracking in aquaculture systems | Jun 14, 2023 | modelModel Predictive Control | —Unverified | 0 |
| Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care | Jun 13, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach | Jun 9, 2023 | Q-Learning | —Unverified | 0 |
| Approximate information state based convergence analysis of recurrent Q-learning | Jun 9, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Active Inference in Hebbian Learning Networks | Jun 8, 2023 | OpenAI GymQ-Learning | —Unverified | 0 |
| Agent Performing Autonomous Stock Trading under Good and Bad Situations | Jun 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor | Jun 6, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task | Jun 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control | Jun 1, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse | May 29, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| VA-learning as a more efficient alternative to Q-learning | May 29, 2023 | Q-Learning | —Unverified | 0 |
| Sample Complexity of Variance-reduced Distributionally Robust Q-learning | May 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market | May 27, 2023 | Portfolio OptimizationQ-Learning | —Unverified | 0 |
| Reinforcement Learning With Reward Machines in Stochastic Games | May 27, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| MADiff: Offline Multi-agent Learning with Diffusion Models | May 27, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks | May 25, 2023 | Q-Learning | —Unverified | 0 |
| RSRM: Reinforcement Symbolic Regression Machine | May 24, 2023 | MathQ-Learning | —Unverified | 0 |
| OER: Offline Experience Replay for Continual Offline Reinforcement Learning | May 23, 2023 | Continual LearningMuJoCo | —Unverified | 0 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 |
| A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks | May 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Bayesian Risk-Averse Q-Learning with Streaming Observations | May 18, 2023 | Q-Learning | —Unverified | 0 |
| The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond | May 18, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Model-Free Robust Average-Reward Reinforcement Learning | May 17, 2023 | modelQ-Learning | —Unverified | 0 |
| Smart Home Energy Management: VAE-GAN synthetic dataset generator and Q-learning | May 14, 2023 | energy managementGenerative Adversarial Network | —Unverified | 0 |
| Mastering Percolation-like Games with Deep Learning | May 12, 2023 | Deep LearningQ-Learning | CodeCode Available | 0 |
| On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm | May 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning-based Distribution Network Reconfiguration for Reliability Improvement | May 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Electric Vehicle Energy Management | May 2, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments | Apr 30, 2023 | Motion PlanningQ-Learning | CodeCode Available | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Safe Q-learning for continuous-time linear systems | Apr 26, 2023 | Q-Learning | —Unverified | 0 |
| Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach | Apr 25, 2023 | Q-LearningScheduling | —Unverified | 0 |
| Learned Collusion | Apr 25, 2023 | Q-Learning | —Unverified | 0 |
| IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies | Apr 20, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems | Apr 20, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Graph Exploration for Effective Multi-agent Q-Learning | Apr 19, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Quantum deep Q learning with distributed prioritized experience replay | Apr 19, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A study on a Q-Learning algorithm application to a manufacturing assembly problem | Apr 17, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning | Apr 17, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Exploring the Noise Resilience of Successor Features and Predecessor Features Algorithms in One and Two-Dimensional Environments | Apr 14, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep reinforcement learning applied to an assembly sequence planning problem with user preferences | Apr 13, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| RELS-DQN: A Robust and Efficient Local Search Framework for Combinatorial Optimization | Apr 11, 2023 | Combinatorial OptimizationMarketing | —Unverified | 0 |
| Reinforcement Learning Based Minimum State-flipped Control for the Reachability of Boolean Control Networks | Apr 11, 2023 | Q-LearningTransfer Learning | —Unverified | 0 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks | Apr 8, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Full Gradient Deep Reinforcement Learning for Average-Reward Criterion | Apr 7, 2023 | Deep Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Deep Reinforcement Learning Based Optimal Infinite-Horizon Control of Probabilistic Boolean Control Networks | Apr 7, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Quantitative Trading using Deep Q Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Tutorial Introduction to Reinforcement Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization | Mar 31, 2023 | Offline RLQ-Learning | —Unverified | 0 |