SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1195112000 of 15113 papers

TitleStatusHype
Robust Offline Reinforcement Learning for Non-Markovian Decision Processes0
Robust Offline Reinforcement Learning from Low-Quality Data0
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Robust off-policy Reinforcement Learning via Soft Constrained Adversary0
Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games0
Robust Path Selection in Software-defined WANs using Deep Reinforcement Learning0
Robust Policy Learning over Multiple Uncertainty Sets0
Robust Policy Learning via Offline Skill Diffusion0
Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments0
Robust Predictable Control0
Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning0
Robust Quadruped Jumping via Deep Reinforcement Learning0
Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning0
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation0
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training0
Robust Reinforcement Learning-based Autonomous Driving Agent for Simulation and Real World0
Robust Reinforcement Learning for Autonomous Driving0
Robust Reinforcement Learning for Continuous Control with Model Misspecification0
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations0
Robust Reinforcement Learning on Graphs for Logistics optimization0
Robust Reinforcement Learning through Efficient Adversarial Herding0
Robust Reinforcement Learning under Diffusion Models for Data with Jumps0
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel0
Robust Reinforcement Learning via Genetic Curriculum0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Robust Reinforcement Learning with Wasserstein Constraint0
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning0
Robust Risk-Aware Option Hedging0
Robust Risk-Sensitive Reinforcement Learning Agents for Trading Markets0
Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk0
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving0
Robust Spoken Language Understanding with RL-based Value Error Recovery0
Robust synchronization and policy adaptation for networked heterogeneous agents0
Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning0
Role Playing Learning for Socially Concomitant Mobile Robot Navigation0
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation0
RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states0
Room Clearance with Feudal Hierarchical Reinforcement Learning0
Route Optimization via Environment-Aware Deep Network and Reinforcement Learning0
Routing algorithms as tools for integrating social distancing with emergency evacuation0
Routing and Placement of Macros using Deep Reinforcement Learning0
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning0
RSO: A Novel Reinforced Swarm Optimization Algorithm for Feature Selection0
RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels0
Rule-Aware Reinforcement Learning for Knowledge Graph Reasoning0
Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction0
Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents0
Rule Mining over Knowledge Graphs via Reinforcement Learning0
Run-and-tumble chemotaxis using reinforcement learning0
Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning0
Show:102550
← PrevPage 240 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified