SOTAVerified|Agents Browse Leaderboard About

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1210 of 15113 papers

Title	Date	Tasks	Status	Hype	Score
Extreme Q-Learning: MaxEnt RL without Entropy	Jan 5, 2023	D4RLDeep Reinforcement Learning	CodeCode Available	1	5
Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards	Aug 6, 2020	AttributeImage Captioning	CodeCode Available	1	5
Game-Theoretic Multiagent Reinforcement Learning	Nov 1, 2020	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	1	5
B-Pref: Benchmarking Preference-Based Reinforcement Learning	Nov 4, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1	5
Embodied Synaptic Plasticity with Online Reinforcement learning	Mar 3, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1	5
Fast Template Matching and Update for Video Object Tracking and Segmentation	Apr 16, 2020	Object Trackingreinforcement-learning	CodeCode Available	1	5
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games	Oct 5, 2020	Real-Time Strategy GamesReinforcement Learning (RL)	CodeCode Available	1	5
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs	Jul 24, 2020	Decision MakingDeep Reinforcement Learning	CodeCode Available	1	5
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning	Oct 23, 2020	Deep Reinforcement LearningModel-based Reinforcement Learning	CodeCode Available	1	5
Automating DBSCAN via Deep Reinforcement Learning	Aug 9, 2022	ClusteringComputational Efficiency	CodeCode Available	1	5

Show:10 25 50

← PrevPage 121 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified