SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 48764900 of 15113 papers

TitleStatusHype
Foundations for Restraining Bolts: Reinforcement Learning with LTLf/LDLf restraining specifications0
Reinforcement Learning for Machine Learning Model Deployment: Evaluating Multi-Armed Bandits in ML Ops Environments0
Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving0
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?0
Reinforcement Learning for Matrix Computations: PageRank as an Example0
Reinforcement Learning for Mean Field Game0
Reinforcement Learning for Mean Field Games, with Applications to Economics0
Distributed Reinforcement Learning for Age of Information Minimization in Real-Time IoT Systems0
Reinforcement Learning for Mitigating Intermittent Interference in Terahertz Communication Networks0
Reinforcement Learning for Mixed-Integer Problems Based on MPC0
Reinforcement Learning for Motor Control: A Comprehensive Review0
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems0
Reinforcement Learning for Multi-Truck Vehicle Routing Problems0
Reinforcement Learning for Nested Polar Code Construction0
Reinforcement Learning for Node Selection in Branch-and-Bound0
Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system0
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism0
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP0
Reinforcement Learning for on-line Sequence Transformation0
Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study0
Reinforcement Learning for Optimal Control of a District Cooling Energy Plant0
Reinforcement Learning for Optimal Load Distribution Sequencing in Resource-Sharing System0
Reinforcement learning for optimization of variational quantum circuit architectures0
On-line reinforcement learning for optimization of real-life energy trading strategy0
Reinforcement Learning for Optimized Beam Training in Multi-Hop Terahertz Communications0
Show:102550
← PrevPage 196 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified