SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1010110125 of 15113 papers

TitleStatusHype
Domain Adversarial Reinforcement Learning0
Domain Adversarial Reinforcement Learning for Partial Domain Adaptation0
Domain Generalization for Robust Model-Based Offline Reinforcement Learning0
Domain-Independent Optimistic Initialization for Reinforcement Learning0
Domain Knowledge-Based Automated Analog Circuit Design with Deep Reinforcement Learning0
Domain Knowledge Integration By Gradient Matching For Sample-Efficient Reinforcement Learning0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Domain Randomization for Robust, Affordable and Effective Closed-loop Control of Soft Robots0
Domain Randomization via Entropy Maximization0
Dominion: A New Frontier for AI Research0
Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition0
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning0
Don't do it: Safer Reinforcement Learning With Rule-based Guidance0
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL0
Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework0
Don't Get Yourself into Trouble! Risk-aware Decision-Making for Autonomous Vehicles0
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning0
Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation0
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS0
DOP: Deep Optimistic Planning with Approximate Value Function Evaluation0
Do recent advancements in model-based deep reinforcement learning really improve data efficiency?0
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari0
Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation0
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games0
Double Deep Q Networks for Sensor Management in Space Situational Awareness0
Show:102550
← PrevPage 405 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified