SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1095110975 of 15113 papers

TitleStatusHype
Adaptive Reinforcement Learning through Evolving Self-Modifying Neural Networks0
Towards Automated Safety Coverage and Testing for Autonomous Vehicles with Reinforcement Learning0
Reinforcement learning with human advice: a survey0
Q-NAV: NAV Setting Method based on Reinforcement Learning in Underwater Wireless Networks0
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension0
Novel Policy Seeking with Constrained OptimizationCode0
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks0
Reinforcement Learning for Variable Selection in a Branch and Bound Algorithm0
Learning and Reasoning for Robot Dialog and Navigation Tasks0
Deep Reinforcement Learning for High Level Character Control0
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise0
A reinforcement learning based decision support system in textile manufacturing process0
Batch-Augmented Multi-Agent Reinforcement Learning for Efficient Traffic Signal Optimization0
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments0
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text0
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning0
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning0
Privileged Information Dropout in Reinforcement Learning0
Reinforcement Learning for Caching with Space-Time Popularity Dynamics0
Optimal Charging Method for Effective Li-ion Battery Life Extension Based on Reinforcement Learning0
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation0
Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency MapsCode0
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning0
Learning Transferable Concepts in Deep Reinforcement Learning0
A Simple Imitation Learning Method via Contrastive Regularization0
Show:102550
← PrevPage 439 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified