| MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation | Oct 15, 2024 | Multi-Agent Path FindingQ-Learning | —Unverified | 0 |
| Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning | Feb 24, 2020 | Distributional Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning | Oct 2, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Minimax Optimal Q Learning with Nearest Neighbors | Aug 3, 2023 | Q-Learning | —Unverified | 0 |
| Minimizing Age-of-Information for Fog Computing-supported Vehicular Networks with Deep Q-learning | Apr 4, 2020 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| Minimizing the Outage Probability in a Markov Decision Process | Feb 28, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Misspecified Q-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error | Jul 18, 2024 | Q-Learning | —Unverified | 0 |
| Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning | Nov 25, 2019 | Face RecognitionFairness | —Unverified | 0 |
| Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning | Jun 1, 2020 | Face RecognitionFairness | —Unverified | 0 |
| Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning | Nov 17, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning | Apr 19, 2025 | Computational EfficiencyQ-Learning | —Unverified | 0 |
| Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Jun 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks | Apr 21, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-Augmented Q-learning | Feb 7, 2021 | modelQ-Learning | —Unverified | 0 |
| Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping | Jan 15, 2020 | Model-based Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control | Oct 13, 2020 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-based versus model-free feeding control and water quality monitoring for fish growth tracking in aquaculture systems | Jun 14, 2023 | modelModel Predictive Control | —Unverified | 0 |
| Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints | Mar 11, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints | Jun 10, 2020 | Q-Learning | —Unverified | 0 |
| Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games | Aug 17, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time | Oct 14, 2022 | Q-Learning | —Unverified | 0 |
| Model-free Control of Chaos with Continuous Deep Q-learning | Jul 16, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach | Aug 6, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Model-free Posterior Sampling via Learning Rate Randomization | Oct 27, 2023 | modelQ-Learning | —Unverified | 0 |
| Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care | Jan 11, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation | Feb 26, 2022 | Edge-computingQ-Learning | —Unverified | 0 |
| Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning | Mar 13, 2024 | Q-Learning | —Unverified | 0 |
| Model-Free Robust Average-Reward Reinforcement Learning | May 17, 2023 | modelQ-Learning | —Unverified | 0 |
| Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning | Sep 25, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Modelling Bahdanau Attention using Election methods aided by Q-Learning | Nov 10, 2019 | DecoderMachine Translation | —Unverified | 0 |
| Modelling Stock-market Investors as Reinforcement Learning Agents [Correction] | Sep 20, 2016 | Decision MakingQ-Learning | —Unverified | 0 |
| Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach | Dec 1, 2019 | Q-Learning | —Unverified | 0 |
| Modified Double DQN: addressing stability | Aug 9, 2021 | Q-Learning | —Unverified | 0 |
| MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search | Jul 22, 2024 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Momentum Q-learning with Finite-Sample Convergence Guarantee | Jul 30, 2020 | Q-Learning | —Unverified | 0 |
| Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network | Jul 31, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures | Nov 22, 2021 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning for Energy Efficient Multi-Hop STAR-RIS-Assisted Transmissions | Jul 26, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-Agent Double Deep Q-Learning for Beamforming in mmWave MIMO Networks | Aug 13, 2020 | Q-Learning | —Unverified | 0 |
| Multi-Agent Inverse Q-Learning from Demonstrations | Mar 6, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity | Mar 13, 2025 | Q-LearningStochastic Block Model | —Unverified | 0 |
| Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids | Aug 25, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks | Dec 22, 2024 | Q-Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks | Oct 24, 2018 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs | Feb 5, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Multi-agent Reinforcement Learning for Resource Allocation in IoT networks with Edge Computing | Apr 5, 2020 | Cloud ComputingDistributed Computing | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Markov Routing Games: A New Modeling Paradigm For Dynamic Traffic Assignment | Nov 22, 2020 | Autonomous VehiclesBilevel Optimization | —Unverified | 0 |
| Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems | Nov 9, 2020 | Autonomous VehiclesMulti-agent Reinforcement Learning | —Unverified | 0 |