| Learning Sharing Behaviors with Arbitrary Numbers of Agents | Dec 10, 2018 | Q-Learning | —Unverified | 0 |
| Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments | Mar 9, 2023 | FormQ-Learning | —Unverified | 0 |
| Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas | Sep 26, 2018 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications | Oct 27, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents | May 28, 2025 | Q-Learning | —Unverified | 0 |
| Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System | Oct 29, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning to Cooperate and Communicate Over Imperfect Channels | Nov 24, 2023 | Q-Learning | —Unverified | 0 |
| Learning to Cooperate via Policy Search | Aug 7, 2014 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems | Jul 1, 2018 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks | Dec 4, 2019 | Combinatorial OptimizationGraph Attention | —Unverified | 0 |
| Learning to Explore via Meta-Policy Gradient | Jul 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning to Explore with Meta-Policy Gradient | Mar 13, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning | May 20, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Learn from Noisy Web Videos | Jun 9, 2017 | Action RecognitionQ-Learning | —Unverified | 0 |
| Maximizing Influence with Graph Neural Networks | Aug 10, 2021 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Learning to Play Video Games with Intuitive Physics Priors | Sep 20, 2024 | Decision MakingObject | —Unverified | 0 |
| Learning to predict where to look in interactive environments using deep recurrent q-learning | Dec 17, 2016 | Atari GamesQ-Learning | —Unverified | 0 |
| Learning to Reason | Oct 12, 2018 | Automated Theorem ProvingQ-Learning | —Unverified | 0 |
| Learning to Represent Haptic Feedback for Partially-Observable Tasks | May 17, 2017 | Q-Learning | —Unverified | 0 |
| Learning to Select Goals in Automated Planning with Deep-Q Learning | Jun 20, 2024 | Q-Learning | —Unverified | 0 |
| Learning to Sketch with Deep Q Networks and Demonstrated Strokes | Oct 14, 2018 | Q-Learning | —Unverified | 0 |
| Learning Value Functions from Undirected State-only Experience | Apr 26, 2022 | Future predictionImitation Learning | —Unverified | 0 |
| Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare | May 17, 2021 | Q-Learning | —Unverified | 0 |
| Lifting the Veil: Unlocking the Power of Depth in Q-learning | Oct 27, 2023 | Learning TheoryManagement | —Unverified | 0 |
| Linear Q-Learning Does Not Diverge: Convergence Rates to a Bounded Set | Jan 31, 2025 | Q-Learning | —Unverified | 0 |
| Listwise Learning to Rank with Deep Q-Networks | Feb 13, 2020 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning | Jul 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Location-routing Optimisation for Urban Logistics Using Mobile Parcel Locker Based on Hybrid Q-Learning Algorithm | Oct 29, 2021 | Q-Learning | —Unverified | 0 |
| Logical Team Q-learning: An approach towards factored policies in cooperative MARL | Jun 5, 2020 | Q-Learning | —Unverified | 0 |
| Logistic Q-Learning | Oct 21, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Long and Short Memory Balancing in Visual Co-Tracking using Q-Learning | Feb 14, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Long-term Fairness in Ride-Hailing Platform | Jul 25, 2024 | FairnessQ-Learning | —Unverified | 0 |
| Long-term planning, short-term adjustments | Sep 25, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| LOQA: Learning with Opponent Q-Learning Awareness | May 2, 2024 | Q-Learning | —Unverified | 0 |
| MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning | Sep 17, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Machine learning-based decentralized TDMA for VLC IoT networks | Nov 23, 2023 | Collision AvoidanceQ-Learning | —Unverified | 0 |
| Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks | Oct 6, 2020 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework | Feb 7, 2023 | Q-Learning | —Unverified | 0 |
| Managing App Install Ad Campaigns in RTB: A Q-Learning Approach | Nov 11, 2018 | Q-Learning | —Unverified | 0 |
| Manipulating Reinforcement Learning: Poisoning Attacks on Cost Signals | Feb 7, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Many-Goals Reinforcement Learning | Jun 22, 2018 | AllQ-Learning | —Unverified | 0 |
| Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow | Jul 1, 2021 | Decision MakingMarketing | —Unverified | 0 |
| MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures | Aug 27, 2018 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Maximizing User Connectivity in AI-Enabled Multi-UAV Networks: A Distributed Strategy Generalized to Arbitrary User Distributions | Nov 7, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Maximum entropy GFlowNets with soft Q-learning | Dec 21, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning | Dec 1, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning | Sep 22, 2021 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention | Jun 24, 2024 | Imitation LearningQ-Learning | —Unverified | 0 |
| Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation | May 7, 2025 | DisentanglementLightweight Deployment | —Unverified | 0 |
| Meta-Gradient Reinforcement Learning with an Objective Discovered Online | Jul 16, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |