SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4081–4090 of 15113 papers

Title	Date	Tasks	Status
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks	Jul 26, 2023	Decision MakingLEMMA	—Unverified
Active 6D Multi-Object Pose Estimation in Cluttered Scenarios with Deep Reinforcement Learning	Oct 19, 2019	Deep Reinforcement LearningObject	—Unverified
Active Alignments of Lens Systems with Reinforcement Learning	Mar 3, 2025	reinforcement-learningReinforcement Learning	—Unverified
Active Classification of Moving Targets with Learned Control Policies	Dec 6, 2022	ClassificationReinforcement Learning (RL)	—Unverified
Active Coverage for PAC Reinforcement Learning	Jun 23, 2023	reinforcement-learningReinforcement Learning	—Unverified
Active Deep Q-learning with Demonstration	Dec 6, 2018	Q-Learningreinforcement-learning	—Unverified
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation	Apr 2, 2024	Active LearningBayesian Inference	—Unverified
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples	Jun 28, 2020	Active LearningDeep Reinforcement Learning	—Unverified
Active Hierarchical Imitation and Reinforcement Learning	Dec 14, 2020	Active LearningImitation Learning	—Unverified
Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning	Mar 19, 2023	Deep Reinforcement Learningreinforcement-learning	—Unverified

Show:10 25 50

← PrevPage 409 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified