MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Dec 16, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems Nov 6, 2019 counterfactual Deep Reinforcement Learning
— Unverified 0MBMF: Model-Based Priors for Model-Free Reinforcement Learning Sep 10, 2017 model reinforcement-learning
— Unverified 0Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods Nov 4, 2020 Deep Reinforcement Learning Policy Gradient Methods
— Unverified 0A parallel-network continuous quantitative trading model with GARCH and PPO May 8, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed Apr 17, 2023 Imitation Learning Position
— Unverified 0Option Transfer and SMDP Abstraction with Successor Features Oct 18, 2021 Reinforcement Learning (RL)
— Unverified 0MDPFuzz: Testing Models Solving Markov Decision Processes Dec 6, 2021 Autonomous Driving Collision Avoidance
— Unverified 0MDP Playground: Controlling Orthogonal Dimensions of Hardness in Toy Environments Sep 28, 2020 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL) Sep 15, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Mean Field Games Flock! The Reinforcement Learning Way May 17, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Mean Field MARL Based Bandwidth Negotiation Method for Massive Devices Spectrum Sharing Apr 30, 2021 Decision Making Distributed Optimization
— Unverified 0Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network Approach Aug 5, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning Jun 15, 2022 Autonomous Driving continuous-control
— Unverified 0Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study Dec 8, 2024 Reinforcement Learning (RL)
— Unverified 0Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning May 29, 2025 Deep Reinforcement Learning MuJoCo
— Unverified 0Measurement-based adaptation protocol with quantum reinforcement learning in a Rigetti quantum computer Nov 19, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Measurement-based adaptation protocol with quantum reinforcement learning Mar 14, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Measurement-based Online Available Bandwidth Estimation employing Reinforcement Learning Jun 5, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Measurement Optimization under Uncertainty using Deep Reinforcement Learning Mar 17, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Measuring and Characterizing Generalization in Deep Reinforcement Learning Dec 7, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning Nov 26, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0How does Your RL Agent Explore? An Optimal Transport Analysis of Occupancy Measure Trajectories Feb 14, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Measuring Progress in Deep Reinforcement Learning Sample Efficiency Feb 9, 2021 Atari Games continuous-control
— Unverified 0Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark Mar 29, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Mechanic Maker 2.0: Reinforcement Learning for Evaluating Generated Rules Sep 18, 2023 Game Design reinforcement-learning
— Unverified 0MedAttacker: Exploring Black-Box Adversarial Attacks on Risk Prediction Models in Healthcare Dec 11, 2021 Adversarial Attack Position
— Unverified 0MedDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support May 26, 2025 Imputation Model-based Reinforcement Learning
— Unverified 0Medical Knowledge Integration into Reinforcement Learning Algorithms for Dynamic Treatment Regimes Jun 29, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Medium Access using Distributed Reinforcement Learning for IoTs with Low-Complexity Wireless Transceivers Apr 29, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0MEETING BOT: Reinforcement Learning for Dialogue Based Meeting Scheduling Dec 28, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Memory Lens: How Much Memory Does an Agent Use? Nov 21, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Memristor Hardware-Friendly Reinforcement Learning Jan 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning Oct 19, 2024 Deep Reinforcement Learning Mixture-of-Experts
— Unverified 0MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning Sep 22, 2021 Deep Reinforcement Learning Gaussian Processes
— Unverified 0MERLIN -- Malware Evasion with Reinforcement LearnINg Mar 24, 2022 Malware Detection reinforcement-learning
— Unverified 0MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance Dec 7, 2021 continuous-control Continuous Control
— Unverified 0Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning May 22, 2025 Reinforcement Learning (RL)
— Unverified 0Message-Dropout: An Efficient Training Method for Multi-Agent Deep Reinforcement Learning Feb 18, 2019 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Meta Attention For Off-Policy Actor-Critic Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Meta-Cognition. An Inverse-Inverse Reinforcement Learning Approach for Cognitive Radars May 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module Dec 14, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL May 31, 2023 MuJoCo Reinforcement Learning (RL)
— Unverified 0MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System Oct 23, 2022 energy management Management
— Unverified 0Meta-Gradient Reinforcement Learning with an Objective Discovered Online Jul 16, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning Jun 27, 2024 Reinforcement Learning (RL)
— Unverified 0Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis Oct 7, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Meta-learners' learning dynamics are unlike learners' May 3, 2019 Meta-Learning Multi-Armed Bandits
— Unverified 0Meta-Learning for Multi-objective Reinforcement Learning Nov 8, 2018 Computational Efficiency continuous-control
— Unverified 0Meta-Learning surrogate models for sequential decision making Mar 28, 2019 Bayesian Optimisation Decision Making
— Unverified 0