SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 41764200 of 15113 papers

TitleStatusHype
Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized CriticsCode0
Monitored Markov Decision ProcessesCode0
On Credit Assignment in Hierarchical Reinforcement LearningCode0
Mutation Testing of Deep Reinforcement Learning Based on Real FaultsCode0
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement LearningCode0
Robust Offline Reinforcement learning with Heavy-Tailed RewardsCode0
Towards optimized actions in critical situations of soccer games with deep reinforcement learningCode0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge TracingCode0
Monolithic vs. hybrid controller for multi-objective Sim-to-Real learningCode0
Refining Few-Step Text-to-Multiview Diffusion via Reinforcement LearningCode0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Solving Offline Reinforcement Learning with Decision Tree RegressionCode0
Towards Practical Multi-Object Manipulation using Relational Reinforcement LearningCode0
Robust optimal well control using an adaptive multi-grid reinforcement learning frameworkCode0
Mutual Information Based Knowledge Transfer Under State-Action Dimension MismatchCode0
Robust Policy Optimization in Deep Reinforcement LearningCode0
Towards Robust Deep Reinforcement Learning for Traffic Signal Control: Demand Surges, Incidents and Sensor FailuresCode0
UNSAT Solver Synthesis via Monte Carlo Forest SearchCode0
Personalized Multimorbidity Management for Patients with Type 2 Diabetes Using Reinforcement Learning of Electronic Health RecordsCode0
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and SmoothnessCode0
Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement LearningCode0
Towards Safe Policy Improvement for Non-Stationary MDPsCode0
Kernel-Based Reinforcement Learning: A Finite-Time AnalysisCode0
Towards Sample Efficient Agents through Algorithmic AlignmentCode0
MyCaffe: A Complete C# Re-Write of Caffe with Reinforcement LearningCode0
Show:102550
← PrevPage 168 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified