SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 52265250 of 15113 papers

TitleStatusHype
Resilient Legged Local Navigation: Learning to Traverse with Compromised Perception End-to-End0
Resilient robot teams: a review integrating decentralised control, change-detection, and learning0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning0
Resolving Causal Confusion in Reinforcement Learning via Robust Exploration0
Resolving Congestions in the Air Traffic Management Domain via Multiagent Reinforcement Learning Methods0
Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning0
Resource Abstraction for Reinforcement Learning in Multiagent Congestion Problems0
Resource Allocation for a Wireless Coexistence Management System Based on Reinforcement Learning0
Resource Allocation in Mobility-Aware Federated Learning Networks: A Deep Reinforcement Learning Approach0
Resource Allocation in Multicore Elastic Optical Networks: A Deep Reinforcement Learning Approach0
The state-of-the-art review on resource allocation problem using artificial intelligence methods on various computing paradigms0
Resource Constrained Deep Reinforcement Learning0
Resource-Constrained Station-Keeping for Helium Balloons using Reinforcement Learning0
Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning0
Resource Management for Blockchain-enabled Federated Learning: A Deep Reinforcement Learning Approach0
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning0
Resource Optimization for Tail-Based Control in Wireless Networked Control Systems0
Responding to Illegal Activities Along the Canadian Coastlines Using Reinforcement Learning0
Response to Comment on 'Perceptual Learning Incepted by Decoded fMRI Neurofeedback Without Stimulus Presentation'; How can a decoded neurofeedback method (DecNef) lead to successful reinforcement and visual perceptual learning?0
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods0
Responsive Safety in Reinforcement Learning0
Restarted Bayesian Online Change-point Detection for Non-Stationary Markov Decision Processes0
Restoring Chaos Using Deep Reinforcement Learning0
Rethink AI-based Power Grid Control: Diving Into Algorithm Design0
Show:102550
← PrevPage 210 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified