SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 54515475 of 15113 papers

TitleStatusHype
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
Robotic Search & Rescue via Online Multi-task Reinforcement Learning0
Robotic self-representation improves manipulation skills and transfer learning0
Robotic Table Tennis with Model-Free Reinforcement Learning0
Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization0
Robotic Tracking Control with Kernel Trick-based Reinforcement Learning0
Robot in a China Shop: Using Reinforcement Learning for Location-Specific Navigation Behaviour0
Robot Learning of Mobile Manipulation with Reachability Behavior Priors0
Robot Navigation with Reinforcement Learned Path Generation and Fine-Tuned Motion Control0
Robot path planning using deep reinforcement learning0
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination0
Robot Representation and Reasoning with Knowledge from Reinforcement Learning0
Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots0
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments0
Robot Sound Interpretation: Combining Sight and Sound in Learning-Based Control0
Robust Action Governor for Uncertain Piecewise Affine Systems with Non-convex Constraints and Safe Reinforcement Learning0
Robust Adversarial Attacks Detection based on Explainable Deep Reinforcement Learning For UAV Guidance and Planning0
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula0
Robust Algorithmic Collusion0
Robust Android Malware Detection System against Adversarial Attacks using Q-Learning0
Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs0
Robust and Versatile Bipedal Jumping Control through Reinforcement Learning0
Robusta: Robust AutoML for Feature Selection via Reinforcement Learning0
Robust Auto-landing Control of an agile Regional Jet Using Fuzzy Q-learning0
Show:102550
← PrevPage 219 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified