SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 52015225 of 15113 papers

TitleStatusHype
Hindsight Learning for MDPs with Exogenous InputsCode0
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test EnvironmentsCode0
GriddlyJS: A Web IDE for Reinforcement Learning0
Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless Networks0
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework0
Optimistic PAC Reinforcement Learning: the Instance-Dependent View0
Learning Bellman Complete Representations for Offline Policy EvaluationCode0
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy OptimizationCode1
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement LearningCode1
Online Game Level Generation from MusicCode0
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement LearningCode1
PAC Reinforcement Learning for Predictive State Representations0
Grounding Aleatoric Uncertainty for Unsupervised Environment Design0
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning0
Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning0
Reinforcement Learningx2013Based Transient Response Shaping for Microgrids0
State Dropout-Based Curriculum Reinforcement Learning for Self-Driving at Unsignalized Intersections0
Deep Reinforcement Learning for Long-Term Voltage Stability Control0
Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents0
High Performance Simulation for Scalable Multi-Agent Reinforcement Learning0
Interaction Pattern Disentangling for Multi-Agent Reinforcement LearningCode1
CompoSuite: A Compositional Reinforcement Learning BenchmarkCode1
Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse ManagementCode1
Safe reinforcement learning for multi-energy management systems with known constraint functions0
Reinforced Lin-Kernighan-Helsgaun Algorithms for the Traveling Salesman ProblemsCode1
Show:102550
← PrevPage 209 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified