SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1180111850 of 15113 papers

TitleStatusHype
Risk-Aware Transfer in Reinforcement Learning using Successor Features0
Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning0
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning0
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation0
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria0
Risk Perspective Exploration in Distributional Reinforcement Learning0
Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning0
Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning under Policy Uncertainty0
Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response0
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning0
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy0
Risk-sensitive Markov Decision Process and Learning under General Utility Functions0
Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning0
Risk-sensitive Reinforcement Learning0
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path0
Risk-Sensitive Reinforcement Learning via Policy Gradient Search0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Risk-Sensitive Reinforcement Learning Applied to Control under Constraints0
Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions0
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret0
Risk-Sensitive Reinforcement Learning with Exponential Criteria0
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations0
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks0
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models0
RL4ReAl: Reinforcement Learning for Register Allocation0
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments0
RLAD: Time Series Anomaly Detection through Reinforcement Learning and Active Learning0
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents0
R-LAtte: Attention Module for Visual Control via Reinforcement Learning0
RL-Based Cargo-UAV Trajectory Planning and Cell Association for Minimum Handoffs, Disconnectivity, and Energy Consumption0
RL-based Control of UAS Subject to Significant Disturbance0
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies0
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems0
RLCache: Automated Cache Management Using Reinforcement Learning0
RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation0
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning0
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning0
RL-Controller: a reinforcement learning framework for active structural control0
RLCorrector: Reinforced Proofreading for Cell-level Microscopy Image Segmentation0
RL-CoSeg : A Novel Image Co-Segmentation Algorithm with Deep Reinforcement Learning0
RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real0
RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles0
RL-DistPrivacy: Privacy-Aware Distributed Deep Inference for low latency IoT systems0
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning0
RL-GA: A Reinforcement Learning-Based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem0
R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making0
RLEEGNet: Integrating Brain-Computer Interfaces with Adaptive AI for Intuitive Responsiveness and High-Accuracy Motor Imagery Classification0
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning0
RL for Consistency Models: Faster Reward Guided Text-to-Image Generation0
RL-GPT: Integrating Reinforcement Learning and Code-as-policy0
Show:102550
← PrevPage 237 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified