SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2161–2170 of 15113 papers

Title	Date	Tasks	Status	Hype
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks	Apr 25, 2024	FairnessMulti-Armed Bandits	—Unverified	0
REBEL: Reinforcement Learning via Regressing Relative Rewards	Apr 25, 2024	continuous-controlContinuous Control	CodeCode Available	2
Offline Reinforcement Learning with Behavioral Supervisor Tuning	Apr 25, 2024	Offline RLreinforcement-learning	—Unverified	0
A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning	Apr 24, 2024	Deep Reinforcement Learningenergy management	CodeCode Available	1
ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling	Apr 24, 2024	Reinforcement Learning (RL)	—Unverified	0
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning	Apr 24, 2024	Benchmarkingreinforcement-learning	—Unverified	0
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL	Apr 24, 2024	reinforcement-learningReinforcement Learning	—Unverified	0
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models	Apr 23, 2024	image-classificationImage Classification	—Unverified	0
Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League Environments	Apr 23, 2024	Motion PlanningReinforcement Learning (RL)	CodeCode Available	0
Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot	Apr 23, 2024	Reinforcement Learning (RL)	—Unverified	0

Show:10 25 50

← PrevPage 217 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified