SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 55265550 of 15113 papers

TitleStatusHype
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation0
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training0
Robust Reinforcement Learning-based Autonomous Driving Agent for Simulation and Real World0
Robust Reinforcement Learning for Autonomous Driving0
Robust Reinforcement Learning for Continuous Control with Model Misspecification0
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations0
Robust Reinforcement Learning on Graphs for Logistics optimization0
Robust Reinforcement Learning through Efficient Adversarial Herding0
Robust Reinforcement Learning under Diffusion Models for Data with Jumps0
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel0
Robust Reinforcement Learning via Genetic Curriculum0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Robust Reinforcement Learning with Wasserstein Constraint0
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning0
Robust Risk-Aware Option Hedging0
Robust Risk-Sensitive Reinforcement Learning Agents for Trading Markets0
Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk0
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving0
Robust Spoken Language Understanding with RL-based Value Error Recovery0
Robust synchronization and policy adaptation for networked heterogeneous agents0
Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning0
Role Playing Learning for Socially Concomitant Mobile Robot Navigation0
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation0
RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states0
Room Clearance with Feudal Hierarchical Reinforcement Learning0
Show:102550
← PrevPage 222 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified