SOTAVerified

MuJoCo

Papers

Showing 601650 of 677 papers

TitleStatusHype
Action Robust Reinforcement Learning and Applications in Continuous ControlCode0
Off-Policy Average Reward Actor-Critic with Deterministic Policy SearchCode0
Balancing Value Underestimation and Overestimation with Realistic Actor-CriticCode0
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy OptimizationCode0
On Learning Intrinsic Rewards for Policy Gradient MethodsCode0
Application of linear regression method to the deep reinforcement learning in continuous action casesCode0
Constrained Intrinsic Motivation for Reinforcement LearningCode0
Mimicking Better by Matching the Approximate Action DistributionCode0
Scalable Kernel Inverse OptimizationCode0
On Rollouts in Model-Based Reinforcement LearningCode0
Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted RegressionCode0
On the Design of Safe Continual RL Methods for Control of Nonlinear SystemsCode0
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement LearningCode0
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy ImitationCode0
On the Perturbed States for Transformed Input-robust Reinforcement LearningCode0
On the Reuse Bias in Off-Policy Reinforcement LearningCode0
Collaborative Evolutionary Reinforcement LearningCode0
Fat-to-Thin Policy Optimization: Offline RL with Sparse PoliciesCode0
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from ObservationsCode0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environmentCode0
ORRB -- OpenAI Remote Rendering BackendCode0
Exploring Model-based Planning with Policy NetworksCode0
Out-of-Dynamics Imitation Learning from Multimodal DemonstrationsCode0
Understanding Adversarial Attacks on Observations in Deep Reinforcement LearningCode0
Self-Imitation LearningCode0
Self-Imitation Learning for Robot Tasks with Sparse and Delayed RewardsCode0
P3O: Policy-on Policy-off Policy OptimizationCode0
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy GradientsCode0
Self Reward Design with Fine-grained InterpretabilityCode0
Explaining RL Decisions with TrajectoriesCode0
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement LearningCode0
A novel DDPG method with prioritized experience replayCode0
Client Selection for Federated Policy Optimization with Environment HeterogeneityCode0
ADDQ: Adaptive Distributional Double Q-LearningCode0
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningCode0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement LearningCode0
Unifying Variational Inference and PAC-Bayes for Supervised Learning that ScalesCode0
CGAR: Critic Guided Action Redistribution in Reinforcement LeaningCode0
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement LearningCode0
Policy Optimization with Second-Order Advantage InformationCode0
WALL-E: An Efficient Reinforcement Learning Research FrameworkCode0
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation LearningCode0
An Invariant Information Geometric Method for High-Dimensional Online OptimizationCode0
Simple Noisy Environment Augmentation for Reinforcement LearningCode0
Calibrated Model-Based Deep Reinforcement LearningCode0
Pontryagin Optimal Control via Neural NetworksCode0
Simple random search of static linear policies is competitive for reinforcement learningCode0
Bootstrapping the Expressivity with Model-based PlanningCode0
Evolutionary Stochastic Policy DistillationCode0
Show:102550
← PrevPage 13 of 14Next →

No leaderboard results yet.