SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 79518000 of 15113 papers

TitleStatusHype
Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies0
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning0
Open Problems and Modern Solutions for Deep Reinforcement Learning0
Open Problem: Tight Online Confidence Intervals for RKHS Elements0
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning0
Operator Shifting for Model-based Policy Evaluation0
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning0
Operator Splitting Value Iteration0
Opinion shaping in social networks using reinforcement learning0
Opportunistic Episodic Reinforcement Learning0
Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control0
Opportunities of Reinforcement Learning in South Africa's Just Transition0
Low-Rank MDPs with Continuous Action Spaces0
Optimal and Learning Control for Autonomous Robots0
Optimal Attacks on Reinforcement Learning Policies0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Optimal Bidding Strategy without Exploration in Real-time Bidding0
Optimal Charging Method for Effective Li-ion Battery Life Extension Based on Reinforcement Learning0
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian0
Optimal control barrier functions for RL based safe powertrain control0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
Optimal control of eye-movements during visual search0
Optimal Control of Material Micro-Structures0
Optimal control of point-to-point navigation in turbulent time-dependent flows using Reinforcement Learning0
Optimal coordination of resources: A solution from reinforcement learning0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning0
Optimal Demand Response Using Device Based Reinforcement Learning0
Optimal Dispatch in Emergency Service System via Reinforcement Learning0
Optimal Hierarchical Learning Path Design with Reinforcement Learning0
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs0
Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning0
Non-iterative generation of an optimal mesh for a blade passage using deep reinforcement learning0
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies0
Optimal Neuron Selection: NK Echo State Networks for Reinforcement Learning0
Optimal Observer Design Using Reinforcement Learning and Quadratic Neural Networks0
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling0
Optimal Operating Strategy for PV-BESS Households: Balancing Self-Consumption and Self-Sufficiency0
Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints0
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem0
Optimal Placement of Public Electric Vehicle Charging Stations Using Deep Reinforcement Learning0
Optimal Portfolio Liquidation0
Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning0
Optimal Reinforcement Learning for Gaussian Systems0
Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes0
Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning0
Optimal scheduling of island integrated energy systems considering multi-uncertainties and hydrothermal simultaneous transmission: A deep reinforcement learning approach0
Optimal Scheduling of Isolated Microgrids Using Automated Reinforcement Learning-based Multi-period Forecasting0
Show:102550
← PrevPage 160 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified