SOTAVerified

MuJoCo

Papers

Showing 126150 of 677 papers

TitleStatusHype
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning0
Diffusion Actor-Critic with Entropy RegulatorCode2
Variational Delayed Policy OptimizationCode0
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing FlowCode1
Learning rigid-body simulators over implicit shapes for large-scale scenes and vision0
Pure Planning to Pure Policies and In Between with a Recursive Tree Planner0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action SpaceCode0
Adaptive Exploration for Data-Efficient General Value Function EvaluationsCode0
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline0
Hard-Thresholding Meets Evolution Strategies in Reinforcement LearningCode0
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient ManipulationCode5
S^2AC: Energy-Based Reinforcement Learning with Stein Soft Actor CriticCode1
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure0
Markov flow policy -- deep MC0
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPOCode1
UCB-driven Utility Function Search for Multi-objective Reinforcement LearningCode1
Closed Loop Interactive Embodied Reasoning for Robot Manipulation0
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis0
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real TransferCode5
DIDA: Denoised Imitation Learning based on Domain Adaptation0
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process0
Robust Model Based Reinforcement Learning Using L_1 Adaptive Control0
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization0
Show:102550
← PrevPage 6 of 28Next →

No leaderboard results yet.