SOTAVerified

MuJoCo

Papers

Showing 351400 of 677 papers

TitleStatusHype
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting0
Meta-Reinforcement Learning via Exploratory Task Clustering0
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning0
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy OptimizationCode0
Online Reinforcement Learning in Non-Stationary Context-Driven EnvironmentsCode0
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)0
Neural Episodic Control with State Abstraction0
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over DropoutCode0
Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation0
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework0
Genetic Imitation Learning by Reward Extrapolation0
Contextual Conservative Q-Learning for Offline Reinforcement Learning0
Pontryagin Optimal Control via Neural NetworksCode0
On the Geometry of Reinforcement Learning in Continuous State and Action Spaces0
Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling0
Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks0
Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble0
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation0
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed DatasetsCode0
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking0
Continuous Neural Algorithmic Planners0
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning0
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement LearningCode0
Contextual Transformer for Offline Meta Reinforcement Learning0
Out-of-Dynamics Imitation Learning from Multimodal DemonstrationsCode0
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling0
Reward Shaping Using Convolutional Neural Network0
Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games0
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables0
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Simple Emergent Action Representations from Multi-Task Policy Training0
Policy Gradient With Serial Markov Chain Reasoning0
Mind's Eye: Grounded Language Model Reasoning through Simulation0
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees0
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees0
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States0
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies0
A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells0
Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations0
Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning0
On the Reuse Bias in Off-Policy Reinforcement LearningCode0
Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction0
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization0
Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking0
Entropy Augmented Reinforcement Learning0
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games0
A Game-Theoretic Perspective of Generalization in Reinforcement Learning0
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts0
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL0
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain RandomizationCode0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.