| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity | May 22, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Stochastic Q-learning for Large Discrete Action Spaces | May 16, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning | May 14, 2024 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments | May 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 |
| An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models | May 9, 2024 | Hierarchical Reinforcement LearningManagement | —Unverified | 0 |
| SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems | May 7, 2024 | CPUGPU | CodeCode Available | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| A Network Simulation of OTC Markets with Multiple Agents | May 3, 2024 | Q-Learning | —Unverified | 0 |
| Regularized Q-learning through Robust Averaging | May 3, 2024 | Q-Learning | CodeCode Available | 0 |
| Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach | May 3, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| LOQA: Learning with Opponent Q-Learning Awareness | May 2, 2024 | Q-Learning | —Unverified | 0 |
| Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision | May 1, 2024 | Q-Learning | —Unverified | 0 |
| Numeric Reward Machines | Apr 30, 2024 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning Problem Solving with Large Language Models | Apr 29, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Using Deep Q-Learning to Dynamically Toggle between Push/Pull Actions in Computational Trust Mechanisms | Apr 28, 2024 | Q-Learning | —Unverified | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation | Apr 24, 2024 | Q-LearningScheduling | —Unverified | 0 |
| AFU: Actor-Free critic Updates in off-policy RL for continuous control | Apr 24, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Recursive Backwards Q-Learning in Deterministic Environments | Apr 24, 2024 | Q-Learning | —Unverified | 0 |
| Research on Robot Path Planning Based on Reinforcement Learning | Apr 22, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Unified ODE Analysis of Smooth Q-Learning Algorithms | Apr 20, 2024 | Q-Learning | —Unverified | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 |
| Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty | Apr 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| From r to Q^*: Your Language Model is Secretly a Q-Function | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 |
| Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement | Apr 12, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks | Apr 9, 2024 | Q-LearningTraffic Signal Control | —Unverified | 0 |
| Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA | Apr 9, 2024 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Control for Disturbance Rejection in a Nonlinear Dynamic System with Parametric Uncertainty | Apr 6, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution | Apr 5, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Superior Genetic Algorithms for the Target Set Selection Problem Based on Power-Law Parameter Choices and Simple Greedy Heuristics | Apr 5, 2024 | Q-Learning | CodeCode Available | 0 |
| Laser Learning Environment: A new environment for coordination-critical multi-agent tasks | Apr 4, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Data-Driven Knowledge Transfer in Batch Q^* Learning | Apr 1, 2024 | Decision MakingMarketing | —Unverified | 0 |
| Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning | Mar 31, 2024 | Atari GamesQ-Learning | —Unverified | 0 |
| EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning | Mar 29, 2024 | NavigateQ-Learning | —Unverified | 0 |
| From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Mar 27, 2024 | Autonomous NavigationDecision Making | CodeCode Available | 0 |
| Compressed Federated Reinforcement Learning with a Generative Model | Mar 26, 2024 | modelQ-Learning | CodeCode Available | 0 |
| Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints | Mar 25, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| DASA: Delay-Adaptive Multi-Agent Stochastic Approximation | Mar 25, 2024 | AvgQ-Learning | —Unverified | 0 |
| A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services | Mar 23, 2024 | FairnessQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Mar 20, 2024 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Neural-Kernel Conditional Mean Embeddings | Mar 16, 2024 | Deep LearningDensity Estimation | —Unverified | 0 |
| A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning | Mar 14, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning | Mar 13, 2024 | Q-Learning | —Unverified | 0 |
| Strategizing against Q-learners: A Control-theoretical Approach | Mar 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services | Mar 12, 2024 | Q-Learning | —Unverified | 0 |