SOTAVerified

MuJoCo

Papers

Showing 551600 of 677 papers

TitleStatusHype
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation ErrorsCode1
Universal Successor Features for Transfer Reinforcement Learning0
Fast Adaptation to New Environments via Policy-Dynamics Value Functions0
Inferring DQN structure for high-dimensional continuous control0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Parareal with a Learned Coarse Model for Robotic Manipulation0
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online0
MANGA: Method Agnostic Neural-policy Generalization and Adaptation0
Gradientless Descent: High-Dimensional Zeroth-Order Optimization0
Multi-Path Policy Optimization0
Asynchronous Methods for Model-Based Reinforcement LearningCode0
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement LearningCode0
Unifying Variational Inference and PAC-Bayes for Supervised Learning that ScalesCode0
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-LearningCode1
On the Expressivity of Neural Networks for Deep Reinforcement LearningCode0
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary RewardsCode0
Multi-step Greedy Reinforcement Learning Algorithms0
Learning Calibratable Policies using Programmatic Style-ConsistencyCode0
Formal Language Constraints for Markov Decision ProcessesCode0
Improving Sample Efficiency in Model-Free Reinforcement Learning from ImagesCode1
Learning from Observations Using a Single Video Demonstration and Human Feedback0
A Generalized Training Approach for Multiagent Learning0
Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation0
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning0
Deep exploration by novelty-pursuit with maximum state entropy0
Regulatory Focus: Promotion and Prevention Inclinations in Policy Search0
Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning0
CrossNorm: On Normalization for Off-Policy Reinforcement Learning0
Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients0
Bootstrapping the Expressivity with Model-based PlanningCode0
Policy Tree Network0
Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning0
Learning Latent Representations for Inverse Dynamics using Generalized Experiences0
Safe Policy Learning for Continuous Control0
Multi-task Batch Reinforcement Learning with Metric Learning0
MDP Playground: An Analysis and Debug Testbed for Reinforcement LearningCode0
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning0
Biased Estimates of Advantages over Path Ensembles0
Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space0
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning0
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement LearningCode0
Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity0
Towards Model-based Reinforcement Learning for Industry-near EnvironmentsCode0
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment0
Learning Policies through Quantile Regression0
ORRB -- OpenAI Remote Rendering BackendCode0
Exploring Model-based Planning with Policy NetworksCode0
Calibrated Model-Based Deep Reinforcement LearningCode0
Reward Prediction Error as an Exploration Objective in Deep RL0
Robust Reinforcement Learning for Continuous Control with Model Misspecification0
Show:102550
← PrevPage 12 of 14Next →

No leaderboard results yet.