SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1005110075 of 15113 papers

TitleStatusHype
Distributional Reinforcement Learning with Ensembles0
Distributional Reinforcement Learning with Monotonic Splines0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Distributional Reward Decomposition for Reinforcement Learning0
Distributional Robustness and Regularization in Reinforcement Learning0
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning0
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios0
Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach0
District Cooling System Control for Providing Operating Reserve based on Safe Deep Reinforcement Learning0
Disturbing Reinforcement Learning Agents with Corrupted Rewards0
DITTO: Offline Imitation Learning with World Models0
Divergence-Regularized Multi-Agent Actor-Critic0
Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning0
Diverse Exploration for Fast and Safe Policy Improvement0
Diverse Priors for Deep Reinforcement Learning0
Diverse Projection Ensembles for Distributional Reinforcement Learning0
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement0
Diversity-Aware Policy Optimization for Large Language Model Reasoning0
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning0
Diversity Progress for Goal Selection in Discriminability-Motivated RL0
Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation0
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition0
Show:102550
← PrevPage 403 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified