| Q-Learning based system for path planning with unmanned aerial vehicles swarms in obstacle environments | Mar 30, 2023 | Q-Learning | —Unverified | 0 |
| Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise | Mar 28, 2023 | Q-Learning | —Unverified | 0 |
| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Mar 28, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems | Mar 25, 2023 | Q-Learning | —Unverified | 0 |
| Specific investments under negotiated transfer pricing: effects of different surplus sharing parameters on managerial performance: An agent-based simulation with fuzzy Q-learning agents | Mar 25, 2023 | Q-Learning | —Unverified | 0 |
| Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning | Mar 24, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Artificial Intelligence and Dual Contract | Mar 22, 2023 | Q-Learning | —Unverified | 0 |
| Comparing NARS and Reinforcement Learning: An Analysis of ONA and Q-Learning Algorithms | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control | Mar 17, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Self-Inspection Method of Unmanned Aerial Vehicles in Power Plants Using Deep Q-Network Reinforcement Learning | Mar 16, 2023 | Autonomous NavigationQ-Learning | —Unverified | 0 |
| Smoothed Q-learning | Mar 15, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera | Mar 13, 2023 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| The tree reconstruction game: phylogenetic reconstruction using reinforcement learning | Mar 12, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Ignorance is Bliss: Robust Control via Information Gating | Mar 10, 2023 | Inductive BiasQ-Learning | —Unverified | 0 |
| Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning | Mar 10, 2023 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments | Mar 9, 2023 | FormQ-Learning | —Unverified | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 |
| Exploration via Epistemic Value Estimation | Mar 7, 2023 | Decision MakingEfficient Exploration | —Unverified | 0 |
| Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control | Mar 4, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Double A3C: Deep Reinforcement Learning on OpenAI Gym Games | Mar 4, 2023 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning | Mar 3, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| A Deep Reinforcement Learning Trader without Offline Training | Mar 1, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| The Point to Which Soft Actor-Critic Converges | Mar 1, 2023 | Q-Learning | —Unverified | 0 |
| Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation | Mar 1, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning | Mar 1, 2023 | Continuous ControlImitation Learning | CodeCode Available | 1 |
| Minimizing the Outage Probability in a Markov Decision Process | Feb 28, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Finite Sample Complexity Bound for Distributionally Robust Q-learning | Feb 26, 2023 | Q-Learning | —Unverified | 0 |
| Q-Cogni: An Integrated Causal Reinforcement Learning Framework | Feb 26, 2023 | Causal InferenceDecision Making | —Unverified | 0 |
| Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation | Feb 25, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process | Feb 25, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Robust Auto-landing Control of an agile Regional Jet Using Fuzzy Q-learning | Feb 21, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes | Feb 21, 2023 | Learning TheoryMedical Diagnosis | —Unverified | 0 |
| Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning | Feb 21, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Forecasting and stabilizing chaotic regimes in two macroeconomic models via artificial intelligence technologies and control methods | Feb 20, 2023 | Decision MakingEvolutionary Algorithms | —Unverified | 0 |
| Logit-Q Dynamics for Efficient Learning in Stochastic Teams | Feb 20, 2023 | Q-Learning | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data | Feb 15, 2023 | Q-Learningvalid | —Unverified | 0 |
| Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels | Feb 13, 2023 | Q-Learning | —Unverified | 0 |
| A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning | Feb 13, 2023 | energy managementManagement | —Unverified | 0 |
| Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading | Feb 9, 2023 | Edge-computingQ-Learning | —Unverified | 0 |
| MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage | Feb 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition | Feb 2, 2023 | DiversityQ-Learning | —Unverified | 0 |
| Best Possible Q-Learning | Feb 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Sample Complexity of Kernel-Based Q-Learning | Feb 1, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense | Jan 28, 2023 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| RCsearcher: Reaction Center Identification in Retrosynthesis via Deep Q-Learning | Jan 28, 2023 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |