| Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles with Uncertainties | Mar 30, 2020 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles | Aug 17, 2020 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Modified Actor-Critics | Jul 2, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization | Feb 13, 2020 | Deep Reinforcement LearningMultiobjective Optimization | —Unverified | 0 |
| MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization | Jul 16, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search | Jul 22, 2024 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Modular Architecture for StarCraft II with Deep Reinforcement Learning | Nov 8, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies | Oct 21, 2016 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Modular Neural Network Policies for Learning In-Flight Object Catching with a Robot Hand-Arm System | Dec 21, 2023 | Deep Reinforcement LearningObject | —Unverified | 0 |
| MoET: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees | Sep 25, 2019 | Deep Reinforcement LearningGame of Go | —Unverified | 0 |
| Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning | Apr 29, 2020 | Deep Reinforcement LearningDrug Design | —Unverified | 0 |
| Mollification Effects of Policy Gradient Methods | May 28, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Monotonic Robust Policy Optimization with Model Discrepancy | Jan 1, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Oct 14, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Monte-Carlo Tree Search for Policy Optimization | Dec 23, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Motion Control in Multi-Rotor Aerial Robots Using Deep Reinforcement Learning | Feb 9, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments | Oct 22, 2020 | Contact-rich ManipulationDeep Reinforcement Learning | —Unverified | 0 |
| Motion Prediction on Self-driving Cars: A Review | Nov 6, 2020 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Movable Antenna-Aided Cooperative ISAC Network with Time Synchronization error and Imperfect CSI | Jan 26, 2025 | Deep Reinforcement LearningIntegrated sensing and communication | —Unverified | 0 |
| Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach | Nov 21, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Movable Cell-Free Massive MIMO For High-Speed Train Communications: A PPO-Based Antenna Position Optimization | Mar 16, 2025 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| MoveLight: Enhancing Traffic Signal Control through Movement-Centric Deep Reinforcement Learning | Jul 24, 2024 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| MP3: Movement Primitive-Based (Re-)Planning Policy | Jun 22, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| MR-iNet Gym: Framework for Edge Deployment of Deep Reinforcement Learning on Embedded Software Defined Radio | Apr 9, 2022 | Deep Reinforcement LearningGPU | —Unverified | 0 |