SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1007610100 of 15113 papers

TitleStatusHype
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Divide-and-Conquer Monte Carlo Tree Search0
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games0
DIVINE: A Generative Adversarial Imitation Learning Framework for Knowledge Graph Reasoning0
DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation0
DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV0
dm_control: Software and Tasks for Continuous Control0
DNN-Opt: An RL Inspired Optimization for Analog Circuit Sizing using Deep Neural Networks0
Do Androids Dream of Electric Fences? Safety-Aware Reinforcement Learning with Latent Shielding0
Do Artificial Reinforcement-Learning Agents Matter Morally?0
Do as I can, not as I get0
Do Autonomous Agents Benefit from Hearing?0
DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances0
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI0
Do Deep Reinforcement Learning Algorithms really Learn to Navigate?0
Does Explicit Prediction Matter in Deep Reinforcement Learning-Based Energy Management?0
How Does an Approximate Model Help in Reinforcement Learning?0
Does Sparsity Help in Learning Misspecified Linear Bandits?0
Domain Adaptation for Deep Reinforcement Learning in Visually Distinct Games0
Domain Adaptation for Offline Reinforcement Learning with Limited Samples0
Domain Adaptation for Reinforcement Learning on the Atari0
Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity0
DOMAIN ADAPTATION VIA DISTRIBUTION AND REPRESENTATION MATCHING: A CASE STUDY ON TRAINING DATA SELECTION VIA REINFORCEMENT LEARNING0
Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition0
Domain Adaptive Fake News Detection via Reinforcement Learning0
Show:102550
← PrevPage 404 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified