SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1082610850 of 15113 papers

TitleStatusHype
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning0
Reinforcement Learning as Iterative and Amortised Inference0
StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search0
Using Reinforcement Learning to Allocate and Manage Service Function Chains in Cellular Networks0
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning0
Potential Field Guided Actor-Critic Reinforcement Learning0
Mutual Information Based Knowledge Transfer Under State-Action Dimension MismatchCode0
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling0
Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine0
Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed EnvironmentsCode0
Decorrelated Double Q-learning0
Hierarchical reinforcement learning for efficent exploration and transfer0
Exchangeable Models in Meta Reinforcement LearningCode0
Human and Multi-Agent collaboration in a human-MARL teaming framework0
Explore then Execute: Adapting without Rewards via Factorized Meta-Reinforcement Learning0
Generalizing Curricula for Reinforcement Learning0
Learning Intrinsically Motivated Options to Stimulate Policy Exploration0
Deep Reinforcement Learning for Neural Control0
A Brief Look at Generalization in Visual Meta-Reinforcement Learning0
Logical Composition in Lifelong Reinforcement Learning0
Bridging Worlds in Reinforcement Learning with Model-Advantage0
Continuous Control for Searching and Planning with a Learned Model0
Deep Reinforcement Learning for Electric Transmission Voltage Control0
Multi-Agent Reinforcement Learning in Stochastic Networked SystemsCode0
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework0
Show:102550
← PrevPage 434 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified