SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 40814090 of 15113 papers

TitleStatusHype
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks0
Active 6D Multi-Object Pose Estimation in Cluttered Scenarios with Deep Reinforcement Learning0
Active Alignments of Lens Systems with Reinforcement Learning0
Active Classification of Moving Targets with Learned Control Policies0
Active Coverage for PAC Reinforcement Learning0
Active Deep Q-learning with Demonstration0
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation0
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples0
Active Hierarchical Imitation and Reinforcement Learning0
Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning0
Show:102550
← PrevPage 409 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified