SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2121–2130 of 15113 papers

Title	Date	Tasks	Status	Hype
Learning to combine primitive skills: A step towards versatile robotic manipulation	Aug 2, 2019	Data AugmentationImitation Learning	CodeCode Available	1
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning	Jul 11, 2019	Natural Language Understandingreinforcement-learning	CodeCode Available	1
An Optimistic Perspective on Offline Reinforcement Learning	Jul 10, 2019	Atari GamesDiversity	CodeCode Available	1
Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis	Jun 24, 2019	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Reinforcement Learning with Convex Constraints	Jun 21, 2019	Diversityreinforcement-learning	CodeCode Available	1
A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry	Jun 21, 2019	Decision MakingLifelong learning	CodeCode Available	1
Split Q Learning: Reinforcement Learning with Two-Stream Rewards	Jun 21, 2019	Decision MakingQ-Learning	CodeCode Available	1
Unsupervised Learning of Object Keypoints for Perception and Control	Jun 19, 2019	3D Action Recognitionimage-classification	CodeCode Available	1
When to Trust Your Model: Model-Based Policy Optimization	Jun 19, 2019	modelModel-based Reinforcement Learning	CodeCode Available	1
MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement Learning	Jun 16, 2019	Game of GoImitation Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 213 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified