MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization Feb 13, 2020 Deep Reinforcement Learning Multiobjective Optimization
— Unverified 0MODRL/D-EL: Multiobjective Deep Reinforcement Learning with Evolutionary Learning for Multiobjective Optimization Jul 16, 2021 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Modular Architecture for StarCraft II with Deep Reinforcement Learning Nov 8, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Modularity benefits reinforcement learning agents with competing homeostatic drives Apr 13, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment Jun 28, 2021 Decision Making Policy Gradient Methods
— Unverified 0Modulated Policy Hierarchies Nov 30, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Modulating Reservoir Dynamics via Reinforcement Learning for Efficient Robot Skill Synthesis Nov 17, 2024 Reinforcement Learning (RL)
— Unverified 0MoET: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees Sep 25, 2019 Deep Reinforcement Learning Game of Go
— Unverified 0Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning Apr 29, 2020 Deep Reinforcement Learning Drug Design
— Unverified 0Molecular Generative Adversarial Network with Multi-Property Optimization Mar 29, 2024 Drug Discovery Generative Adversarial Network
— Unverified 0Mollification Effects of Policy Gradient Methods May 28, 2024 continuous-control Continuous Control
— Unverified 0Momentum in Reinforcement Learning Oct 21, 2019 Atari Games reinforcement-learning
— Unverified 0MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0MONAS: Multi-Objective Neural Architecture Search using Reinforcement Learning Jun 27, 2018 General Classification Neural Architecture Search
— Unverified 0MONEYBaRL: Exploiting pitcher decision-making using Reinforcement Learning Jul 31, 2014 BIG-bench Machine Learning Decision Making
— Unverified 0Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials Feb 26, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Monte-Carlo Planning and Learning with Language Action Value Estimates Jan 1, 2021 Natural Language Understanding reinforcement-learning
— Unverified 0Monte Carlo Planning with Large Language Model for Text-Based Game Agents Apr 23, 2025 Language Modeling Language Modelling
— Unverified 0Monte-Carlo Siamese Policy on Actor for Satellite Image Super Resolution Apr 8, 2020 Image Super-Resolution reinforcement-learning
— Unverified 0Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning Nov 23, 2022 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Monte-Carlo Tree Search for Policy Optimization Dec 23, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Moody Learners -- Explaining Competitive Behaviour of Reinforcement Learning Agents Jul 30, 2020 Decision Making reinforcement-learning
— Unverified 0MOORe: Model-based Offline-to-Online Reinforcement Learning Jan 25, 2022 D4RL model
— Unverified 0Moral reinforcement learning using actual causation May 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences Oct 20, 2021 Efficient Exploration Open-Ended Question Answering
— Unverified 0More Efficient Off-Policy Evaluation through Regularized Targeted Learning Dec 13, 2019 Causal Inference Off-policy evaluation
— Unverified 0(More) Efficient Reinforcement Learning via Posterior Sampling Jun 4, 2013 Efficient Exploration reinforcement-learning
— Unverified 0MOReL: Model-Based Offline Reinforcement Learning Dec 1, 2020 model Offline RL
— Unverified 0More Robust Doubly Robust Off-policy Evaluation Feb 10, 2018 Multi-Armed Bandits Off-policy evaluation
— Unverified 0MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models Mar 11, 2025 Large Language Model Mixture-of-Experts
— Unverified 0MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading Jun 3, 2024 Algorithmic Trading Imitation Learning
— Unverified 0MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding Feb 18, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Motion Perception in Reinforcement Learning with Dynamic Objects Jan 10, 2019 continuous-control Continuous Control
— Unverified 0Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments Oct 22, 2020 Contact-rich Manipulation Deep Reinforcement Learning
— Unverified 0Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles Sep 24, 2020 Motion Planning reinforcement-learning
— Unverified 0Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning Oct 1, 2021 Autonomous Driving Autonomous Vehicles
— Unverified 0Motion Prediction on Self-driving Cars: A Review Nov 6, 2020 Autonomous Vehicles Deep Reinforcement Learning
— Unverified 0MotionRL: Align Text-to-Motion Generation to Human Preferences with Multi-Reward Reinforcement Learning Oct 9, 2024 Motion Generation reinforcement-learning
— Unverified 0Motivating Physical Activity via Competitive Human-Robot Interaction Feb 14, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0MP3: Movement Primitive-Based (Re-)Planning Policy Jun 22, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0MPC4RL -- A Software Package for Reinforcement Learning based on Model Predictive Control Jan 27, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles Jun 16, 2021 Model Predictive Control Position
— Unverified 0MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage Apr 6, 2021 Model Predictive Control reinforcement-learning
— Unverified 0MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning Jan 1, 2021 Efficient Exploration MuJoCo
— Unverified 0MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server Apr 22, 2018 BIG-bench Machine Learning Quantization
— Unverified 0MRAC-RL: A Framework for On-Line Policy Adaptation Under Parametric Model Uncertainty Nov 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0MSDF: A Deep Reinforcement Learning Framework for Service Function Chain Migration Nov 12, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection Oct 10, 2020 Answer Selection Reinforcement Learning (RL)
— Unverified 0MSRL: Distributed Reinforcement Learning with Dataflow Fragments Oct 3, 2022 CPU GPU
— Unverified 0MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation Sep 19, 2022 Imitation Learning reinforcement-learning
— Unverified 0