SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1060110625 of 15113 papers

TitleStatusHype
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
Examining average and discounted reward optimality criteria in reinforcement learning0
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks0
Exchangeable Input Representations for Reinforcement Learning0
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Execute Order 66: Targeted Data Poisoning for Reinforcement Learning0
ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs0
Expected Policy Gradients for Reinforcement Learning0
Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making0
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning0
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning0
Experience-driven Networking: A Deep Reinforcement Learning based Approach0
Experience enrichment based task independent reward model0
Experience Replay More When It's a Key Transition in Deep Reinforcement Learning0
Experience Replay Optimization0
Experience Replay Using Transition Sequences0
Experience Sharing Between Cooperative Reinforcement Learning Agents0
Experimental analysis of data-driven control for a building heating system0
Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar0
Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments0
Experimental results : Reinforcement Learning of POMDPs using Spectral Methods0
Experimental Study on Reinforcement Learning-based Control of an Acrobot0
Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning0
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning0
Show:102550
← PrevPage 425 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified