SOTAVerified

MuJoCo

Papers

Showing 150 of 677 papers

TitleStatusHype
Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback0
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound0
Deep Reinforcement Learning with Gradient Eligibility TracesCode1
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study0
Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
rQdia: Regularizing Q-Value Distributions With Image Augmentation0
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement LearningCode0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control0
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning0
Wasserstein Barycenter Soft Actor-Critic0
Modular Recurrence in Contextual MDPs for Universal Morphology Control0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
LLMs for sensory-motor control: Combining in-context and iterative learningCode0
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning0
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning0
Enhanced DACER Algorithm with High Diffusion Efficiency0
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control0
Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners0
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network0
Reinforcement Learning for Ballbot Navigation in Uneven TerrainCode1
LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models0
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
Offline Multi-agent Reinforcement Learning via Score Decomposition0
Model Tensor PlanningCode1
Directly Forecasting Belief for Reinforcement Learning with DelaysCode0
Variational OOD State Correction for Offline Reinforcement Learning0
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision0
Learning Transferable Friction Models and LuGre Identification via Physics Informed Neural Networks0
Adapting World Models with Latent-State Dynamics Residuals0
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning0
Handling Delay in Real-Time Reinforcement LearningCode0
Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent CooperationCode0
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning0
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning0
Likelihood Reward Redistribution0
Application of linear regression method to the deep reinforcement learning in continuous action casesCode0
Residual Policy Gradient: A Reward View of KL-regularized Objective0
An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable SimulationCode1
AVG-DICE: Stationary Distribution Correction by Regression0
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning0
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic0
Offline Reinforcement Learning via Inverse OptimizationCode0
RIZE: Regularized Imitation Learning via Distributional Reinforcement LearningCode0
Yes, Q-learning Helps Offline In-Context RL0
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningCode0
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.