| Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks | Jun 26, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention | Jun 24, 2024 | Imitation LearningQ-Learning | —Unverified | 0 |
| EduQate: Generating Adaptive Curricula through RMABs in Education Settings | Jun 20, 2024 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Learning to Select Goals in Automated Planning with Deep-Q Learning | Jun 20, 2024 | Q-Learning | —Unverified | 0 |
| A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms | Jun 20, 2024 | Learning TheoryQ-Learning | —Unverified | 0 |
| Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry | Jun 18, 2024 | Q-Learning | CodeCode Available | 0 |
| Optimal Transport-Assisted Risk-Sensitive Q-Learning | Jun 17, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Catalytic evolution of cooperation in a population with behavioural bimodality | Jun 17, 2024 | Q-Learning | —Unverified | 0 |
| Finite-Time Analysis of Simultaneous Double Q-learning | Jun 14, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Jun 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors | Jun 12, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation | Jun 12, 2024 | Q-Learning | CodeCode Available | 0 |
| Online Frequency Scheduling by Learning Parallel Actions | Jun 7, 2024 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network | Jun 7, 2024 | Q-Learning | —Unverified | 0 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Bootstrapping Expectiles in Reinforcement Learning | Jun 6, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks | Jun 4, 2024 | PhilosophyQ-Learning | —Unverified | 0 |
| Tabular and Deep Learning for the Whittle Index | Jun 4, 2024 | Deep LearningQ-Learning | —Unverified | 0 |
| Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| How to discretize continuous state-action spaces in Q-learning: A symbolic control approach | Jun 3, 2024 | Q-Learning | —Unverified | 0 |
| Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation | May 31, 2024 | Q-Learning | CodeCode Available | 0 |
| Q-learning as a monotone scheme | May 30, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Approximate Global Convergence of Independent Learning in Multi-Agent Systems | May 30, 2024 | Q-Learning | —Unverified | 0 |
| Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost | May 29, 2024 | Q-Learning | —Unverified | 0 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Mutation-Bias Learning in Games | May 28, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Highway Reinforcement Learning | May 28, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games | May 27, 2024 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning for Jump-Diffusions, with Financial Applications | May 26, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS | May 26, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine | May 24, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning | May 24, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning | May 24, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity | May 22, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Stochastic Q-learning for Large Discrete Action Spaces | May 16, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments | May 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning | May 14, 2024 | Q-Learning | —Unverified | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 |
| An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models | May 9, 2024 | Hierarchical Reinforcement LearningManagement | —Unverified | 0 |
| SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems | May 7, 2024 | CPUGPU | CodeCode Available | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach | May 3, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Network Simulation of OTC Markets with Multiple Agents | May 3, 2024 | Q-Learning | —Unverified | 0 |
| Regularized Q-learning through Robust Averaging | May 3, 2024 | Q-Learning | CodeCode Available | 0 |
| LOQA: Learning with Opponent Q-Learning Awareness | May 2, 2024 | Q-Learning | —Unverified | 0 |
| Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision | May 1, 2024 | Q-Learning | —Unverified | 0 |