SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 997610000 of 15113 papers

TitleStatusHype
Constrained Markov Decision Processes via Backward Value Functions0
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon0
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation0
t-Soft Update of Target Network for Deep Reinforcement Learning0
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing0
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based Reinforcement Learning0
Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep Reinforcement Learning Approach0
Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning0
Improved Memories Learning0
Learning Off-Policy with Online PlanningCode1
DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets0
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning0
Mobile Networks for Computer Go0
Social-Aware Incentive Mechanism for VehicularCrowdsensing by Deep Reinforcement LearningCode1
Model-Free Episodic Control with State Aggregation0
Biomechanic Posture Stabilisation via Iterative Training of Multi-policy Deep Reinforcement Learning Agents0
Adversarial Imitation Learning via Random Search0
Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication0
A Composable Specification Language for Reinforcement Learning TasksCode1
Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems0
NANCY: Neural Adaptive Network Coding methodologY for video distribution over wireless networks0
Static Neural Compiler Optimization via Deep Reinforcement Learning0
Model-free optimal control of discrete-time systems with additive and multiplicative noises0
Reinforcement Learning based dynamic weighing of Ensemble Models for Time Series Forecasting0
Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learningCode1
Show:102550
← PrevPage 400 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified