SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 53765400 of 15113 papers

TitleStatusHype
Risk-sensitive Reinforcement Learning0
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path0
Risk-Sensitive Reinforcement Learning via Policy Gradient Search0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Risk-Sensitive Reinforcement Learning Applied to Control under Constraints0
Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions0
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret0
Risk-Sensitive Reinforcement Learning with Exponential Criteria0
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations0
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks0
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models0
RL4ReAl: Reinforcement Learning for Register Allocation0
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments0
RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning0
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents0
R-LAtte: Attention Module for Visual Control via Reinforcement Learning0
RL-Based Cargo-UAV Trajectory Planning and Cell Association for Minimum Handoffs, Disconnectivity, and Energy Consumption0
RL-based Control of UAS Subject to Significant Disturbance0
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies0
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems0
RLCache: Automated Cache Management Using Reinforcement Learning0
RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation0
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning0
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning0
RL-Controller: a reinforcement learning framework for active structural control0
Show:102550
← PrevPage 216 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified