| Logit-Q Dynamics for Efficient Learning in Stochastic Teams | Feb 20, 2023 | Q-Learning | —Unverified | 0 |
| Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data | Feb 15, 2023 | Q-Learningvalid | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Computation Offloading for Uncertain Marine Tasks by Cooperation of UAVs and Vessels | Feb 13, 2023 | Q-Learning | —Unverified | 0 |
| A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning | Feb 13, 2023 | energy managementManagement | —Unverified | 0 |
| Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading | Feb 9, 2023 | Edge-computingQ-Learning | —Unverified | 0 |
| MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage | Feb 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition | Feb 2, 2023 | DiversityQ-Learning | —Unverified | 0 |
| Best Possible Q-Learning | Feb 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Sample Complexity of Kernel-Based Q-Learning | Feb 1, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| The impact of surplus sharing on the outcomes of specific investments under negotiated transfer pricing: An agent-based simulation with fuzzy Q-learning agents | Jan 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense | Jan 28, 2023 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| RCsearcher: Reaction Center Identification in Retrosynthesis via Deep Q-Learning | Jan 28, 2023 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Single-Trajectory Distributionally Robust Reinforcement Learning | Jan 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning | Jan 26, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| FedHQL: Federated Heterogeneous Q-Learning | Jan 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics | Jan 23, 2023 | Q-Learning | —Unverified | 0 |
| Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets | Jan 20, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures | Jan 14, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decentralized model-free reinforcement learning in stochastic games with average-reward objective | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Multi-Power Level Q-Learning Algorithm for Random Access in NOMA mMTC Systems | Jan 12, 2023 | Q-Learning | —Unverified | 0 |
| Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning | Jan 9, 2023 | NavigateQ-Learning | —Unverified | 0 |
| Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning | Jan 3, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Deep Spectral Q-learning with Application to Mobile Health | Jan 3, 2023 | Q-Learning | —Unverified | 0 |
| NARS vs. Reinforcement learning: ONA vs. Q-Learning | Dec 23, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Decoding surface codes with deep reinforcement learning and probabilistic policy reuse | Dec 22, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement Learning | Dec 21, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation | Dec 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Taming Lagrangian Chaos with Multi-Objective Reinforcement Learning | Dec 19, 2022 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling | Dec 16, 2022 | MuJoCoQ-Learning | —Unverified | 0 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation | Dec 12, 2022 | Q-Learningregression | —Unverified | 0 |
| Frugal Reinforcement-based Active Learning | Dec 9, 2022 | Active LearningDiversity | —Unverified | 0 |
| Reinforcement Learning for Resilient Power Grids | Dec 8, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| PALMER: Perception-Action Loop with Memory for Long-Horizon Planning | Dec 8, 2022 | Q-LearningRepresentation Learning | —Unverified | 0 |
| EASpace: Enhanced Action Space for Policy Transfer | Dec 7, 2022 | Q-LearningTransfer Learning | CodeCode Available | 0 |
| A Machine with Short-Term, Episodic, and Semantic Memory Systems | Dec 5, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Automata Learning meets Shielding | Dec 4, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Welfare and Fairness in Multi-objective Reinforcement Learning | Nov 30, 2022 | FairnessMulti-Objective Reinforcement Learning | CodeCode Available | 0 |
| Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning | Nov 30, 2022 | Model DiscoveryQ-Learning | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols | Nov 28, 2022 | Q-Learning | —Unverified | 0 |
| Causal Deep Reinforcement Learning Using Observational Data | Nov 28, 2022 | Autonomous DrivingCausal Inference | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms | Nov 27, 2022 | FairnessQ-Learning | —Unverified | 0 |
| Explainable and Safe Reinforcement Learning for Autonomous Air Mobility | Nov 24, 2022 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |