SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 85268550 of 15113 papers

TitleStatusHype
Human-Inspired Multi-Agent Navigation using Knowledge DistillationCode1
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning0
Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving IntelligenceCode1
Deep Reinforcement Learning-Aided RAN Slicing Enforcement for B5G Latency Sensitive Services0
Maximum Entropy Reinforcement Learning with Mixture Policies0
Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety CagesCode0
Near Optimal Policy Optimization via REPS0
Decentralized Reinforcement Learning for Multi-Target Search and Detection by a Team of Drones0
Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm0
A Practical Guide to Multi-Objective Reinforcement Learning and PlanningCode0
Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight Campaign Design0
Inclined Quadrotor Landing using Deep Reinforcement LearningCode1
Learning to Shape Rewards using a Game of Two Partners0
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving0
Lyapunov Barrier Policy OptimizationCode1
Transfer Learning for Automated Test Case Prioritization Using XCSFCode0
Accelerating Online Reinforcement Learning via Model-Based Meta-Learning0
Deep Reinforcement Learning for Band Selection in Hyperspectral Image ClassificationCode1
Autonomous Drone Racing with Deep Reinforcement Learning0
Learning Symbolic Rules for Interpretable Deep Reinforcement Learning0
Modelling Human Kinetics and Kinematics during Walking using Reinforcement Learning0
Reinforcement Learning with Algorithms from Probabilistic Structure EstimationCode0
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics ModelCode1
Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction0
Offline Reinforcement Learning with Fisher Divergence Critic Regularization0
Show:102550
← PrevPage 342 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified