SOTAVerified

MuJoCo

Papers

Showing 151200 of 677 papers

TitleStatusHype
A novel DDPG method with prioritized experience replayCode0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Out-of-Dynamics Imitation Learning from Multimodal DemonstrationsCode0
Pontryagin Optimal Control via Neural NetworksCode0
Calibrated Model-Based Deep Reinforcement LearningCode0
On Rollouts in Model-Based Reinforcement LearningCode0
Off-Policy Average Reward Actor-Critic with Deterministic Policy SearchCode0
An Invariant Information Geometric Method for High-Dimensional Online OptimizationCode0
On Learning Intrinsic Rewards for Policy Gradient MethodsCode0
On the Design of Safe Continual RL Methods for Control of Nonlinear SystemsCode0
Bootstrapping the Expressivity with Model-based PlanningCode0
A dynamical clipping approach with task feedback for Proximal Policy OptimizationCode0
On the Expressivity of Neural Networks for Deep Reinforcement LearningCode0
Offline Reinforcement Learning via Inverse OptimizationCode0
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement LearningCode0
Adaptive Exploration for Data-Efficient General Value Function EvaluationsCode0
Novel Policy Seeking with Constrained OptimizationCode0
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel OptimizationCode0
An Empirical Study of Deep Reinforcement Learning in Continuing TasksCode0
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-TuningCode0
No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODECode0
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated PoliciesCode0
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement LearningCode0
dm_control: Software and Tasks for Continuous ControlCode0
MuJoCo: A physics engine for model-based controlCode0
NerveNet: Learning Structured Policy with Graph Neural NetworksCode0
On the Perturbed States for Transformed Input-robust Reinforcement LearningCode0
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
MDP Playground: An Analysis and Debug Testbed for Reinforcement LearningCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
Lyapunov-based Safe Policy Optimization for Continuous ControlCode0
Bayesian Policy Gradients via Alpha Divergence Dropout InferenceCode0
LLMs for sensory-motor control: Combining in-context and iterative learningCode0
Live in the Moment: Learning Dynamics Model Adapted to Evolving PolicyCode0
Online Reinforcement Learning in Non-Stationary Context-Driven EnvironmentsCode0
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement LearningCode0
Directly Forecasting Belief for Reinforcement Learning with DelaysCode0
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations OnlineCode0
Leveraging exploration in off-policy algorithms via normalizing flowsCode0
Balancing Value Underestimation and Overestimation with Realistic Actor-CriticCode0
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision ScenariosCode0
Locally Persistent Exploration in Continuous Control Tasks with Sparse RewardsCode0
Learning Powerful Policies by Using Consistent Dynamics ModelCode0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement LearningCode0
Decision Transformer under Random Frame DroppingCode0
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary DynamicsCode0
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement LearningCode0
Learning non-Markovian Decision-Making from State-only SequencesCode0
Learning to Play Cup-and-Ball with Noisy Camera ObservationsCode0
A Generalized Training Approach for Multiagent LearningCode0
Show:102550
← PrevPage 4 of 14Next →

No leaderboard results yet.