SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 36263650 of 15113 papers

TitleStatusHype
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots0
Decoupling Skill Learning from Robotic Control for Generalizable Object Manipulation0
adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems0
Graph Decision Transformer0
Evolutionary Reinforcement Learning: A Survey0
Learning Bipedal Walking for Humanoids with Current FeedbackCode3
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples0
Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers0
Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments0
Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment0
Perspectives on the Social Impacts of Reinforcement Learning with Human Feedback0
Safe Reinforcement Learning via Probabilistic Logic ShieldsCode0
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning0
Improved Sample Complexity Bounds for Distributionally Robust Reinforcement LearningCode0
Sparsity-Aware Intelligent Massive Random Access Control in Open RAN: A Reinforcement Learning Based Approach0
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control TasksCode0
Ensemble Reinforcement Learning: A Survey0
Bounding the Optimal Value Function in Compositional Reinforcement LearningCode0
Local Environment Poisoning Attacks on Federated Reinforcement Learning0
CFlowNets: Continuous Control with Generative Flow NetworksCode0
Look-Ahead AC Optimal Power Flow: A Model-Informed Reinforcement Learning Approach0
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games0
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control0
Neural Airport Ground HandlingCode1
Show:102550
← PrevPage 146 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified