SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 74517475 of 15113 papers

TitleStatusHype
Coordinated Random Access for Industrial IoT With Correlated Traffic By Reinforcement-Learning0
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach0
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive Deep Reinforcement Learning0
Soft Actor-Critic With Integer Actions0
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning0
Reinforcement Learning on Encrypted Data0
Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing0
Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning0
Estimation of Warfarin Dosage with Reinforcement LearningCode0
Short Quantum Circuits in Reinforcement Learning Policies for the Vehicle Routing Problem0
What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Balancing detectability and performance of attacks on the control channel of Markov Decision ProcessesCode0
Back to Basics: Deep Reinforcement Learning in Traffic Signal ControlCode0
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback0
Automatically Exposing Problems with Neural Dialog ModelsCode0
WaveCorr: Correlation-savvy Deep Reinforcement Learning for Portfolio ManagementCode0
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation0
Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal LocomotionCode1
Towards optimized actions in critical situations of soccer games with deep reinforcement learningCode0
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain0
Continuous Homeostatic Reinforcement Learning for Self-Regulated Autonomous Agents0
Show:102550
← PrevPage 299 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified