SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 79768000 of 15113 papers

TitleStatusHype
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning0
Optimal Demand Response Using Device Based Reinforcement Learning0
Optimal Dispatch in Emergency Service System via Reinforcement Learning0
Optimal Hierarchical Learning Path Design with Reinforcement Learning0
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs0
Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning0
Non-iterative generation of an optimal mesh for a blade passage using deep reinforcement learning0
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies0
Optimal Neuron Selection: NK Echo State Networks for Reinforcement Learning0
Optimal Observer Design Using Reinforcement Learning and Quadratic Neural Networks0
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling0
Optimal Operating Strategy for PV-BESS Households: Balancing Self-Consumption and Self-Sufficiency0
Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints0
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem0
Optimal Placement of Public Electric Vehicle Charging Stations Using Deep Reinforcement Learning0
Optimal Portfolio Liquidation0
Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning0
Optimal Reinforcement Learning for Gaussian Systems0
Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes0
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning0
Optimal scheduling of island integrated energy systems considering multi-uncertainties and hydrothermal simultaneous transmission: A deep reinforcement learning approach0
Optimal Scheduling of Isolated Microgrids Using Automated Reinforcement Learning-based Multi-period Forecasting0
Show:102550
← PrevPage 320 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified