SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 53515375 of 15113 papers

TitleStatusHype
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs0
RIS-assisted UAV Communications for IoT with Wireless Power Transfer Using Deep Reinforcement Learning0
RISCLESS: A Reinforcement Learning Strategy to Exploit Unused Cloud Resources0
Risk-Averse Bayes-Adaptive Reinforcement Learning0
Risk-Averse Learning by Temporal Difference Methods0
Risk-averse policies for natural gas futures trading using distributional reinforcement learning0
Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures0
Risk Averse Robust Adversarial Reinforcement Learning0
Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning0
Risk Aware and Multi-Objective Decision Making with Distributional Monte Carlo Tree Search0
Risk-Aware Reinforcement Learning through Optimal Transport Theory0
Risk-Aware Safe Reinforcement Learning for Control of Stochastic Linear Systems0
Risk-Aware Transfer in Reinforcement Learning using Successor Features0
Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning0
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning0
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation0
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria0
Risk Perspective Exploration in Distributional Reinforcement Learning0
Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning0
Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning under Policy Uncertainty0
Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response0
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning0
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy0
Risk-sensitive Markov Decision Process and Learning under General Utility Functions0
Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning0
Show:102550
← PrevPage 215 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified