SOTAVerified

MuJoCo

Papers

Showing 351400 of 677 papers

TitleStatusHype
Value Gradient weighted Model-Based Reinforcement LearningCode1
Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study0
Safe adaptation in multiagent competition0
Context is Everything: Implicit Identification for Dynamics Adaptation0
AutoDIME: Automatic Design of Interesting Multi-Agent Environments0
A Recurrent Differentiable Engine for Modeling Tensegrity Robots Trainable with Low-Frequency Data0
User-Oriented Robust Reinforcement Learning0
Deconstructing the Inductive Biases of Hamiltonian Neural NetworksCode1
Lipschitz-constrained Unsupervised Skill DiscoveryCode1
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement LearningCode0
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence0
Recursive Least Squares Advantage Actor-Critic Algorithms0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement LearningCode0
SimSR: Simple Distance-based State Representation for Deep Reinforcement LearningCode1
Self Reward Design with Fine-grained InterpretabilityCode0
Multiagent Model-based Credit Assignment for Continuous Control0
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement LearningCode0
OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical LocomotionCode1
Residual Pathway Priors for Soft Equivariance ConstraintsCode1
Offline Model-based Adaptable Policy LearningCode1
EDGE: Explaining Deep Reinforcement Learning PoliciesCode1
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement LearningCode1
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
Generalized Decision Transformer for Offline Hindsight Information MatchingCode1
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning0
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance0
Improving Learning from Demonstrations by Learning from Experience0
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving0
V-MAO: Generative Modeling for Multi-Arm Manipulation of Articulated Objects0
Robust Deep Reinforcement Learning for Quadcopter ControlCode1
Time Discretization-Invariant Safe Action Repetition for Policy Gradient MethodsCode0
Smooth Imitation Learning via Smooth Costs and Smooth Policies0
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL0
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric0
Balancing Value Underestimation and Overestimation with Realistic Actor-CriticCode0
Wasserstein Unsupervised Reinforcement Learning0
On-Policy Model Errors in Reinforcement Learning0
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation0
Multi-Agent Constrained Policy OptimisationCode1
Generalized Maximum Entropy Reinforcement Learning via Reward Shaping0
Hypothesis Driven Coordinate Ascent for Reinforcement Learning0
Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts0
Auto-Encoding Inverse Reinforcement Learning0
Maximizing Ensemble Diversity in Deep Reinforcement Learning0
SPP-RL: State Planning Policy Reinforcement Learning0
OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning0
Distributional Decision Transformer for Hindsight Information Matching0
Diverse Imitation Learning via Self-OrganizingGenerative Models0
Evaluating Robustness of Cooperative MARL0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.