| The impact of surplus sharing on the outcomes of specific investments under negotiated transfer pricing: An agent-based simulation with fuzzy Q-learning agents | Jan 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Single-Trajectory Distributionally Robust Reinforcement Learning | Jan 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| FedHQL: Federated Heterogeneous Q-Learning | Jan 26, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning | Jan 26, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics | Jan 23, 2023 | Q-Learning | —Unverified | 0 |
| Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets | Jan 20, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures | Jan 14, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Decentralized model-free reinforcement learning in stochastic games with average-reward objective | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity | Jan 13, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems | Jan 13, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Multi-Power Level Q-Learning Algorithm for Random Access in NOMA mMTC Systems | Jan 12, 2023 | Q-Learning | —Unverified | 0 |
| Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning | Jan 9, 2023 | NavigateQ-Learning | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver | Jan 5, 2023 | Graph Neural NetworkQ-Learning | CodeCode Available | 1 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning | Jan 3, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Spectral Q-learning with Application to Mobile Health | Jan 3, 2023 | Q-Learning | —Unverified | 0 |
| NARS vs. Reinforcement learning: ONA vs. Q-Learning | Dec 23, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Decoding surface codes with deep reinforcement learning and probabilistic policy reuse | Dec 22, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Control of Continuous Quantum Systems with Many Degrees of Freedom based on Convergent Reinforcement Learning | Dec 21, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation | Dec 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Taming Lagrangian Chaos with Multi-Objective Reinforcement Learning | Dec 19, 2022 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling | Dec 16, 2022 | MuJoCoQ-Learning | —Unverified | 0 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation | Dec 12, 2022 | Q-Learningregression | —Unverified | 0 |
| Frugal Reinforcement-based Active Learning | Dec 9, 2022 | Active LearningDiversity | —Unverified | 0 |
| PALMER: Perception-Action Loop with Memory for Long-Horizon Planning | Dec 8, 2022 | Q-LearningRepresentation Learning | —Unverified | 0 |
| Reinforcement Learning for Resilient Power Grids | Dec 8, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| EASpace: Enhanced Action Space for Policy Transfer | Dec 7, 2022 | Q-LearningTransfer Learning | CodeCode Available | 0 |
| A Machine with Short-Term, Episodic, and Semantic Memory Systems | Dec 5, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Automata Learning meets Shielding | Dec 4, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Welfare and Fairness in Multi-objective Reinforcement Learning | Nov 30, 2022 | FairnessMulti-Objective Reinforcement Learning | CodeCode Available | 0 |
| Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning | Nov 30, 2022 | Model DiscoveryQ-Learning | —Unverified | 0 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols | Nov 28, 2022 | Q-Learning | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Causal Deep Reinforcement Learning Using Observational Data | Nov 28, 2022 | Autonomous DrivingCausal Inference | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms | Nov 27, 2022 | FairnessQ-Learning | —Unverified | 0 |
| Double Deep Q-Learning in Opponent Modeling | Nov 24, 2022 | Mixture-of-ExpertsQ-Learning | —Unverified | 0 |
| Explainable and Safe Reinforcement Learning for Autonomous Air Mobility | Nov 24, 2022 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Self-Awareness Models for Physical Layer Security in Cognitive and AI-enabled Radios | Nov 23, 2022 | Q-Learning | —Unverified | 0 |
| Reinforcement Causal Structure Learning on Order Graph | Nov 22, 2022 | Causal DiscoveryQ-Learning | —Unverified | 0 |
| Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks | Nov 21, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Simultaneously Updating All Persistence Values in Reinforcement Learning | Nov 21, 2022 | AllAtari Games | —Unverified | 0 |
| Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit | Nov 18, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Credit-cognisant reinforcement learning for multi-agent cooperation | Nov 18, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Reinforcement Learning Approach for Process Parameter Optimization in Additive Manufacturing | Nov 17, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Planning Irregular Object Packing via Hierarchical Reinforcement Learning | Nov 17, 2022 | Hierarchical Reinforcement LearningObject | —Unverified | 0 |
| Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning | Nov 16, 2022 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |