SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 60766100 of 15113 papers

TitleStatusHype
Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit0
Watch from sky: machine-learning-based multi-UAV network for predictive police surveillance0
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning0
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment0
A Multi-Document Coverage Reward for RELAXed Multi-Document SummarizationCode0
Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation0
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations0
Deep Reinforcement Learning based Model-free On-line Dynamic Multi-Microgrid Formation to Enhance Resilience0
Safe Reinforcement Learning for Legged Locomotion0
Target Network and Truncation Overcome The Deadly Triad in Q-Learning0
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions0
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning0
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning0
Bilateral Deep Reinforcement Learning Approach for Better-than-human Car Following Model0
Intrinsically-Motivated Reinforcement Learning: A Brief Introduction0
Deep Q-network using reservoir computing with multi-layered readout0
On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical EfficiencyCode0
The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches0
Testing Stationarity and Change Point Detection in Reinforcement LearningCode1
Quantum Reinforcement Learning via Policy Iteration0
Optimized cost function for demand response coordination of multiple EV charging stations using reinforcement learning0
Reasoning about Counterfactuals to Improve Human Inverse Reinforcement LearningCode0
Reliable validation of Reinforcement Learning Benchmarks0
Andes_gym: A Versatile Environment for Deep Reinforcement Learning in Power SystemsCode0
Integrating Contrastive Learning with Dynamic Models for Reinforcement Learning from ImagesCode0
Show:102550
← PrevPage 244 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified