SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 65516600 of 15113 papers

TitleStatusHype
Distributional Decision Transformer for Hindsight Information Matching0
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning0
Distributionally Adaptive Meta Reinforcement Learning0
Distributionally-Constrained Policy Optimization via Unbalanced Optimal Transport0
Distributionally Robust Imitation Learning0
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity0
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Distributionally Robust Reinforcement Learning0
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm0
Distributional Method for Risk Averse Reinforcement Learning0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distributional Reinforcement Learning for Efficient Exploration0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes0
Distributional reinforcement learning with linear function approximation0
Distributional Reinforcement Learning with Ensembles0
Distributional Reinforcement Learning with Monotonic Splines0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Distributional Reward Decomposition for Reinforcement Learning0
Distributional Robustness and Regularization in Reinforcement Learning0
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning0
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios0
Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach0
District Cooling System Control for Providing Operating Reserve based on Safe Deep Reinforcement Learning0
Disturbing Reinforcement Learning Agents with Corrupted Rewards0
DITTO: Offline Imitation Learning with World Models0
Divergence-Regularized Multi-Agent Actor-Critic0
Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning0
Diverse Exploration for Fast and Safe Policy Improvement0
Diverse Priors for Deep Reinforcement Learning0
Diverse Projection Ensembles for Distributional Reinforcement Learning0
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement0
Diversity-Aware Policy Optimization for Large Language Model Reasoning0
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning0
Diversity Progress for Goal Selection in Discriminability-Motivated RL0
Diversity-Promoting Deep Reinforcement Learning for Interactive Recommendation0
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Divide-and-Conquer Monte Carlo Tree Search0
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games0
DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning0
DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation0
DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV0
dm_control: Software and Tasks for Continuous Control0
DNN-Opt: An RL Inspired Optimization for Analog Circuit Sizing using Deep Neural Networks0
Do Androids Dream of Electric Fences? Safety-Aware Reinforcement Learning with Latent Shielding0
Show:102550
← PrevPage 132 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified