SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2051–2060 of 15113 papers

Title	Date	Tasks	Status	Hype
A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems	Feb 9, 2020	Combinatorial OptimizationDecoder	CodeCode Available	1
Soft Hindsight Experience Replay	Feb 6, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation	Feb 6, 2020	ArticlesHeadline Generation	CodeCode Available	1
Multi Type Mean Field Reinforcement Learning	Feb 6, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits	Feb 6, 2020	Hyperparameter OptimizationReinforcement Learning	CodeCode Available	1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making	Feb 5, 2020	Decision Makingreinforcement-learning	CodeCode Available	1
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework	Feb 5, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	1
Effective Diversity in Population Based Reinforcement Learning	Feb 3, 2020	DiversityPoint Processes	CodeCode Available	1
Integrating Deep Reinforcement Learning with Model-based Path Planners for Automated Driving	Feb 2, 2020	Deep Reinforcement LearningNavigate	CodeCode Available	1
Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning	Jan 31, 2020	BIG-bench Machine Learningreinforcement-learning	CodeCode Available	1

Show:10 25 50

← PrevPage 206 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified