SOTAVerified

MuJoCo

Papers

Showing 251300 of 677 papers

TitleStatusHype
Coagent Networks: Generalized and Scaled0
Learning Constraint Network from Demonstrations via Positive-Unlabeled Learning with Memory Replay0
Learning Loss Landscapes in Preference Optimization0
Generalized Maximum Entropy Reinforcement Learning via Reward Shaping0
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks0
Balancing Constraints and Rewards with Meta-Gradient D4PG0
Deep exploration by novelty-pursuit with maximum state entropy0
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance0
Decorrelated Double Q-learning0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
AgentMixer: Multi-Agent Correlated Policy Factorization0
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations0
Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy0
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning0
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies0
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning0
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts0
Data Valuation for Offline Reinforcement Learning0
A Game-Theoretic Perspective of Generalization in Reinforcement Learning0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
AVG-DICE: Stationary Distribution Correction by Regression0
CrossNorm: On Normalization for Off-Policy Reinforcement Learning0
Average-Reward Reinforcement Learning with Trust Region Methods0
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning0
Learning from Observations Using a Single Video Demonstration and Human Feedback0
Learning rigid-body simulators over implicit shapes for large-scale scenes and vision0
Cross-Domain Imitation Learning with a Dual Structure0
Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization0
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization0
Cooperative Heterogeneous Deep Reinforcement Learning0
Auto-Encoding Inverse Reinforcement Learning0
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling0
AutoDIME: Automatic Design of Interesting Multi-Agent Environments0
Active Reinforcement Learning Strategies for Offline Policy Improvement0
A Unifying Framework for Causal Imitation Learning with Hidden Confounders0
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework0
Language to Rewards for Robotic Skill Synthesis0
Continuous Neural Algorithmic Planners0
Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL)0
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment0
Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method0
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization0
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience0
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates0
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning0
Adversarial Imitation Learning via Random Search0
Imitation Learning from Video by Leveraging Proprioception0
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning0
Continuous Control for Searching and Planning with a Learned Model0
Contextual Transformer for Offline Meta Reinforcement Learning0
Show:102550
← PrevPage 6 of 14Next →

No leaderboard results yet.