SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 52765300 of 15113 papers

TitleStatusHype
Improved Sample Complexity for Stochastic Compositional Variance Reduced Gradient0
Improved Regret for Differentially Private Exploration in Linear MDP0
Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation0
Improved Reinforcement Learning through Imitation Learning Pretraining Towards Image-based Autonomous Driving0
Improved Reinforcement Learning with Curriculum0
Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning0
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs0
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration0
Improvements on Hindsight Learning0
Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method0
Improving generalization to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents0
Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark0
Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm0
Improving Assistive Robotics with Deep Reinforcement Learning0
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning0
Improving Cost Learning for JPEG Steganography by Exploiting JPEG Domain Knowledge0
Improving Deep Reinforcement Learning in Minecraft with Action Advice0
Improving Document Image Understanding with Reinforcement Finetuning0
Improving Exploration of Deep Reinforcement Learning using Planning for Policy Search0
Improving Fictitious Play Reinforcement Learning with Expanding Models0
Improving gearshift controllers for electric vehicles with reinforcement learning0
Improving Generalization in Intent Detection: GRPO with Reward-Based Curriculum Sampling0
Improving Generalization in Meta Reinforcement Learning using Learned Objectives0
Improving generalization in reinforcement learning through forked agents0
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning0
Show:102550
← PrevPage 212 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified