SOTAVerified

MuJoCo

Papers

Showing 401450 of 677 papers

TitleStatusHype
Live in the Moment: Learning Dynamics Model Adapted to Evolving PolicyCode0
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction0
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments0
Prompting Decision Transformer for Few-Shot Policy Generalization0
CGAR: Critic Guided Action Redistribution in Reinforcement LeaningCode0
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming0
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis0
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning0
Relative Policy-Transition Optimization for Fast Policy Transfer0
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies0
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning0
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble0
Multi-Object Grasping in the Plane0
TaSIL: Taylor Series Imitation LearningCode0
Efficient Reward Poisoning Attacks on Online Deep Reinforcement LearningCode0
SEREN: Knowing When to Explore and When to Exploit0
Data Valuation for Offline Reinforcement Learning0
Imitation Learning from Observations under Transition Model DisparityCode0
A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells0
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization0
Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study0
Safe adaptation in multiagent competition0
Context is Everything: Implicit Identification for Dynamics Adaptation0
AutoDIME: Automatic Design of Interesting Multi-Agent Environments0
A Recurrent Differentiable Engine for Modeling Tensegrity Robots Trainable with Low-Frequency Data0
User-Oriented Robust Reinforcement Learning0
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement LearningCode0
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence0
Recursive Least Squares Advantage Actor-Critic Algorithms0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Self Reward Design with Fine-grained InterpretabilityCode0
Multiagent Model-based Credit Assignment for Continuous Control0
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement LearningCode0
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning0
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance0
Improving Learning from Demonstrations by Learning from Experience0
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving0
V-MAO: Generative Modeling for Multi-Arm Manipulation of Articulated Objects0
Time Discretization-Invariant Safe Action Repetition for Policy Gradient MethodsCode0
Smooth Imitation Learning via Smooth Costs and Smooth Policies0
Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL0
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric0
Balancing Value Underestimation and Overestimation with Realistic Actor-CriticCode0
On-Policy Model Errors in Reinforcement Learning0
Wasserstein Unsupervised Reinforcement Learning0
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation0
Generalized Maximum Entropy Reinforcement Learning via Reward Shaping0
Auto-Encoding Inverse Reinforcement Learning0
Distributional Decision Transformer for Hindsight Information Matching0
Show:102550
← PrevPage 9 of 14Next →

No leaderboard results yet.