SOTAVerified

MuJoCo

Papers

Showing 151200 of 677 papers

TitleStatusHype
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture0
Careful at Estimation and Bold at Exploration0
Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines?0
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning0
A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells0
Formal Language Constrained Markov Decision Processes0
Gaussian Process Policy Optimization0
Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations0
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning0
Multiagent Model-based Credit Assignment for Continuous Control0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)0
Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning0
An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning0
Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization0
A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells0
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation0
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals0
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States0
BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning0
Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots0
Adaptive N-step Bootstrapping with Off-policy Data0
Biased Estimates of Advantages over Path Ensembles0
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble0
Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts0
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients0
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem0
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control0
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration0
Diverse Imitation Learning via Self-OrganizingGenerative Models0
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Fast Convergence of Softmax Policy Mirror Ascent0
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments0
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming0
Gradientless Descent: High-Dimensional Zeroth-Order Optimization0
Distributional Decision Transformer for Hindsight Information Matching0
DisTop: Discovering a Topological representation to learn diverse and rewarding skills0
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction0
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning0
DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes0
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis0
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning0
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization0
DIDA: Denoised Imitation Learning based on Domain Adaptation0
Show:102550
← PrevPage 4 of 14Next →

No leaderboard results yet.