| Learn a Prior for RHEA for Better Online Planning | Feb 14, 2019 | Evolutionary AlgorithmsMuJoCo | —Unverified | 0 | 0 |
| Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations | Mar 29, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy | May 28, 2019 | counterfactualEfficient Exploration | —Unverified | 0 | 0 |
| Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Learning from Observations Using a Single Video Demonstration and Human Feedback | Sep 29, 2019 | MuJoCo | —Unverified | 0 | 0 |
| Learning Constraint Network from Demonstrations via Positive-Unlabeled Learning with Memory Replay | Jul 23, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Learning Intrinsic Symbolic Rewards in Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Learning Latent Representations for Inverse Dynamics using Generalized Experiences | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Learning Loss Landscapes in Preference Optimization | Nov 10, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Learning rigid-body simulators over implicit shapes for large-scale scenes and vision | May 22, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Learning to enhance multi-legged robot on rugged landscapes | Sep 14, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning | Feb 20, 2017 | Car RacingDecision Making | —Unverified | 0 | 0 |
| Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping | Nov 5, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Learning Transferable Friction Models and LuGre Identification via Physics Informed Neural Networks | Apr 16, 2025 | Computational EfficiencyFriction | —Unverified | 0 | 0 |
| Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains | Feb 9, 2024 | Depth EstimationMuJoCo | —Unverified | 0 | 0 |
| LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios | Oct 12, 2023 | Board GamesDecision Making | —Unverified | 0 | 0 |
| Likelihood Reward Redistribution | Mar 20, 2025 | MuJoCo | —Unverified | 0 | 0 |
| LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models | May 21, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning | Feb 8, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Lyceum: An efficient and scalable ecosystem for robot learning | Jan 21, 2020 | Model Predictive ControlMuJoCo | —Unverified | 0 | 0 |
| MANGA: Method Agnostic Neural-policy Generalization and Adaptation | Nov 19, 2019 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning | Aug 17, 2024 | Density EstimationImitation Learning | —Unverified | 0 | 0 |
| Markov flow policy -- deep MC | May 1, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations | Sep 16, 2022 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation | Jul 25, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees | Oct 4, 2022 | counterfactualImitation Learning | —Unverified | 0 | 0 |
| Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning | Jun 15, 2022 | Autonomous Drivingcontinuous-control | —Unverified | 0 | 0 |
| Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning | May 29, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents | Jun 18, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure | May 1, 2024 | Efficient ExplorationMuJoCo | —Unverified | 0 | 0 |
| MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL | May 31, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning | Apr 29, 2023 | Meta Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Meta-Reinforcement Learning via Exploratory Task Clustering | Feb 15, 2023 | ClusteringMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Meta Reinforcement Learning with Distribution of Exploration Parameters Learned by Evolution Strategies | Dec 29, 2018 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Mind's Eye: Grounded Language Model Reasoning through Simulation | Oct 11, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Model-based Adversarial Imitation Learning | Dec 7, 2016 | Imitation Learningmodel | —Unverified | 0 | 0 |
| Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments | Oct 4, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Model-Invariant State Abstractions for Model-Based Reinforcement Learning | Feb 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning | Jan 1, 2021 | Efficient ExplorationMuJoCo | —Unverified | 0 | 0 |
| Multi-Object Grasping in the Plane | Jun 1, 2022 | MuJoCoObject | —Unverified | 0 | 0 |
| Multi-Objective Algorithms for Learning Open-Ended Robotic Problems | Nov 11, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 | 0 |
| Multi-Path Policy Optimization | Nov 11, 2019 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| Multi-step Greedy Reinforcement Learning Algorithms | Oct 7, 2019 | Continuous ControlGame of Go | —Unverified | 0 | 0 |
| Multi-task Reinforcement Learning with a Planning Quasi-Metric | Feb 8, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning | Sep 11, 2019 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning | Dec 21, 2018 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Neural Episodic Control with State Abstraction | Jan 27, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Neural Population Learning beyond Symmetric Zero-sum Games | Jan 10, 2024 | MuJoCoTransfer Learning | —Unverified | 0 | 0 |