MAP Inference for Bayesian Inverse Reinforcement Learning Dec 1, 2011 reinforcement-learning Reinforcement Learning
— Unverified 0MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments Jul 30, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System Jul 10, 2020 Management Multi-agent Reinforcement Learning
— Unverified 0Marginalized Importance Sampling for Off-Environment Policy Evaluation Sep 4, 2023 Reinforcement Learning (RL)
— Unverified 0Marginalized Operators for Off-policy Reinforcement Learning Mar 30, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0MarineFormer: A Spatio-Temporal Attention Model for USV Navigation in Dynamic Marine Environments Oct 17, 2024 Collision Avoidance Graph Attention
— Unverified 0MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics Mar 12, 2025 Benchmarking GPU
— Unverified 0Market Making via Reinforcement Learning in China Commodity Market May 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Markov Chain Concentration with an Application in Reinforcement Learning Jan 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Markov Chain Monte Carlo Policy Optimization Jan 4, 2021 continuous-control Continuous Control
— Unverified 0Markov Chain Variance Estimation: A Stochastic Approximation Approach Sep 9, 2024 Reinforcement Learning (RL)
— Unverified 0Markov Cricket: Using Forward and Inverse Reinforcement Learning to Model, Predict And Optimize Batting Performance in One-Day International Cricket Mar 7, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Markov Decision Processes with Continuous Side Information Nov 15, 2017 PAC learning Reinforcement Learning
— Unverified 0Markovian Interference in Experiments Jun 6, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks Feb 2, 2023 Reinforcement Learning (RL)
— Unverified 0MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning Sep 29, 2021 Backdoor Attack Deep Reinforcement Learning
— Unverified 0MARTI-4: new model of human brain, considering neocortex and basal ganglia -- learns to play Atari game by reinforcement learning on a single CPU Aug 18, 2022 CPU OpenAI Gym
— Unverified 0Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks Mar 31, 2022 Atari Games Deep Reinforcement Learning
— Unverified 0Masked Deep Q-Recommender for Effective Question Scheduling Dec 19, 2021 Knowledge Tracing Reinforcement Learning (RL)
— Unverified 0Masked Generative Priors Improve World Models Sequence Modelling Capabilities Oct 10, 2024 continuous-control Continuous Control
— Unverified 0Masked World Models for Visual Control Jun 28, 2022 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0MASP: Scalable GNN-based Planning for Multi-Agent Navigation Dec 5, 2023 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0Mastering Complex Control in MOBA Games with Deep Reinforcement Learning Dec 20, 2019 AI Agent Deep Reinforcement Learning
— Unverified 0Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning Apr 1, 2023 PAIR TRADING reinforcement-learning
— Unverified 0Mastering Spatial Graph Prediction of Road Networks Oct 3, 2022 Prediction Reinforcement Learning (RL)
— Unverified 0Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning Aug 23, 2024 Computational Efficiency Decision Making
— Unverified 0Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning Jun 30, 2022 Board Games Decision Making
— Unverified 0Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning Jun 12, 2022 Continual Learning Hierarchical Reinforcement Learning
— Unverified 0(N,K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model Mar 11, 2024 Benchmarking Language Modeling
— Unverified 0B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Oct 4, 2023 Code Generation Deep Reinforcement Learning
— Unverified 0Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI Dec 21, 2024 Reinforcement Learning (RL) Survey
— Unverified 0Deep Policy Iteration with Integer Programming for Inventory Management Dec 4, 2021 Decision Making Management
— Unverified 0MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning Sep 10, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure May 24, 2023 Matrix Completion reinforcement-learning
— Unverified 0MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent Reinforcement Learning May 25, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Maximizing Confidence Alone Improves Reasoning May 28, 2025 GSM8K Math
— Unverified 0Maximizing Ensemble Diversity in Deep Reinforcement Learning Sep 29, 2021 Atari Games Decision Making
— Unverified 0Maximizing Information Gain in Partially Observable Environments via Prediction Reward May 11, 2020 Prediction Question Answering
— Unverified 0Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons Feb 9, 2020 continuous-control Continuous Control
— Unverified 0Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning Nov 3, 2019 Diversity reinforcement-learning
— Unverified 0Maximum Entropy Dueling Network Architecture in Atari Domain Jul 30, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Maximum entropy GFlowNets with soft Q-learning Dec 21, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Maximum Entropy Hindsight Experience Replay Oct 31, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Adversarial Inverse Reinforcement Learning for Mean Field Games Apr 29, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Maximum Entropy Model-based Reinforcement Learning Dec 2, 2021 Dota 2 model
— Unverified 0Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors Jun 8, 2020 model Model-based Reinforcement Learning
— Unverified 0Maximum Entropy Reinforcement Learning with Mixture Policies Mar 18, 2021 continuous-control Continuous Control
— Unverified 0Maximum Entropy RL (Provably) Solves Some Robust RL Problems Mar 10, 2021 Reinforcement Learning (RL)
— Unverified 0Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning Sep 12, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees Oct 4, 2022 counterfactual Imitation Learning
— Unverified 0