SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 26262650 of 15113 papers

TitleStatusHype
Autonomous Driving in Reality with Reinforcement Learning and Image Translation0
Autonomous Control of a Particle Accelerator using Deep Reinforcement Learning0
Alternating Good-for-MDP Automata0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
Cost-Sensitive Exploration in Bayesian Reinforcement Learning0
AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks0
Autonomous Braking and Throttle System: A Deep Reinforcement Learning Approach for Naturalistic Driving0
Adaptive model selection in photonic reservoir computing by reinforcement learning0
Autonomous Attack Mitigation for Industrial Control Systems0
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning0
AlphaStar: An Evolutionary Computation Perspective0
A Bayesian Approach to Robust Reinforcement Learning0
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention0
Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach0
AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process0
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning0
Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing0
AlphaSeq: Sequence Discovery with Deep Reinforcement Learning0
A Coarse to Fine Question Answering System based on Reinforcement Learning0
Correlation Priors for Reinforcement Learning0
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning0
Automating the resolution of flight conflicts: Deep reinforcement learning in service of air traffic controllers0
AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree Search0
Automating Staged Rollout with Reinforcement Learning0
Adaptive Load Shedding for Grid Emergency Control via Deep Reinforcement Learning0
Show:102550
← PrevPage 106 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified