SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 86018625 of 15113 papers

TitleStatusHype
Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning0
Non-iterative generation of an optimal mesh for a blade passage using deep reinforcement learning0
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies0
Optimal Neuron Selection: NK Echo State Networks for Reinforcement Learning0
Optimal Observer Design Using Reinforcement Learning and Quadratic Neural Networks0
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling0
Optimal Operating Strategy for PV-BESS Households: Balancing Self-Consumption and Self-Sufficiency0
Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints0
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem0
Optimal Placement of Public Electric Vehicle Charging Stations Using Deep Reinforcement Learning0
Optimal Portfolio Liquidation0
Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning0
Optimal Reinforcement Learning for Gaussian Systems0
Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes0
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning0
Optimal scheduling of island integrated energy systems considering multi-uncertainties and hydrothermal simultaneous transmission: A deep reinforcement learning approach0
Optimal Scheduling of Isolated Microgrids Using Automated Reinforcement Learning-based Multi-period Forecasting0
Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach0
Optimal Tap Setting of Voltage Regulation Transformers Using Batch Reinforcement Learning0
Optimal Task Assignment and Path Planning using Conflict-Based Search with Precedence and Temporal Constraints0
Optimal Use of Experience in First Person Shooter Environments0
Optimisation of the Accelerator Control by Reinforcement Learning: A Simulation-Based Approach0
Optimising 4th-Order Runge-Kutta Methods: A Dynamic Heuristic Approach for Efficiency and Low Storage0
Show:102550
← PrevPage 345 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified