SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 65766600 of 15113 papers

TitleStatusHype
District Cooling System Control for Providing Operating Reserve based on Safe Deep Reinforcement Learning0
Disturbing Reinforcement Learning Agents with Corrupted Rewards0
DITTO: Offline Imitation Learning with World Models0
Divergence-Regularized Multi-Agent Actor-Critic0
Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning0
Diverse Exploration for Fast and Safe Policy Improvement0
Diverse Priors for Deep Reinforcement Learning0
Diverse Projection Ensembles for Distributional Reinforcement Learning0
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement0
Diversity-Aware Policy Optimization for Large Language Model Reasoning0
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning0
Diversity Progress for Goal Selection in Discriminability-Motivated RL0
Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation0
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Divide-and-Conquer Monte Carlo Tree Search0
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games0
DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning0
DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation0
DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV0
dm_control: Software and Tasks for Continuous Control0
DNN-Opt: An RL Inspired Optimization for Analog Circuit Sizing using Deep Neural Networks0
Do Androids Dream of Electric Fences? Safety-Aware Reinforcement Learning with Latent Shielding0
Show:102550
← PrevPage 264 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified