SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3281–3290 of 15113 papers

Title	Date	Tasks	Status	Hype
Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation	Jun 9, 2023	Policy Gradient Methodsreinforcement-learning	—Unverified	0
The Role of Diverse Replay for Generalisation in Reinforcement Learning	Jun 9, 2023	Diversityreinforcement-learning	—Unverified	0
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning	Jun 9, 2023	Reinforcement Learning (RL)Representation Learning	CodeCode Available	1
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel	Jun 9, 2023	Decision Makingreinforcement-learning	—Unverified	0
Approximate information state based convergence analysis of recurrent Q-learning	Jun 9, 2023	Q-LearningReinforcement Learning (RL)	—Unverified	0
Learning Not to Spoof	Jun 9, 2023	Reinforcement Learning (RL)	—Unverified	0
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning	Jun 9, 2023	D4RLOffline RL	—Unverified	0
An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint Programming	Jun 9, 2023	Combinatorial OptimizationFeature Engineering	CodeCode Available	1
Decoupled Prioritized Resampling for Offline RL	Jun 8, 2023	Offline RLReinforcement Learning (RL)	CodeCode Available	1
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning	Jun 8, 2023	Decision MakingOffline RL	—Unverified	0

Show:10 25 50

← PrevPage 329 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified