SOTAVerified

MuJoCo

Papers

Showing 501550 of 677 papers

TitleStatusHype
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies0
Decorrelated Double Q-learning0
Deep exploration by novelty-pursuit with maximum state entropy0
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks0
DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning0
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online0
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays0
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety0
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study0
DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects0
DIDA: Denoised Imitation Learning based on Domain Adaptation0
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction0
DisTop: Discovering a Topological representation to learn diverse and rewarding skills0
Distributional Decision Transformer for Hindsight Information Matching0
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Diverse Imitation Learning via Self-OrganizingGenerative Models0
dm_control: Software and Tasks for Continuous Control0
DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes0
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation0
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning0
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization0
Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods0
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning0
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration0
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling0
ELSIM: End-to-end learning of reusable skills through intrinsic motivation0
Enhanced DACER Algorithm with High Diffusion Efficiency0
EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning0
Entropy Augmented Reinforcement Learning0
Episodic Reinforcement Learning with Expanded State-reward Space0
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL0
Evaluating Robustness of Cooperative MARL0
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication0
Evolving Rewards to Automate Reinforcement Learning0
Expected Policy Gradients0
A Tractable Inference Perspective of Offline RL0
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator0
Fast Adaptation to New Environments via Policy-Dynamics Value Functions0
Fast Convergence of Softmax Policy Mirror Ascent0
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control0
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments0
Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts0
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming0
Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization0
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation0
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals0
Formal Language Constrained Markov Decision Processes0
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.