SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1075110775 of 15113 papers

TitleStatusHype
Control-Aware Representations for Model-based Reinforcement Learning0
Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments0
Batch-Constrained Reinforcement Learning for Dynamic Distribution Network Reconfiguration0
Environment Shaping in Reinforcement Learning using State Abstraction0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping0
Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control0
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning0
Show me the Way: Intrinsic Motivation from Demonstrations0
On the Relationship Between Active Inference and Control as Inference0
Near-Optimal Reinforcement Learning with Self-Play0
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data0
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs0
QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning0
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret0
Constrained Combinatorial Optimization with Reinforcement Learning0
Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning with Application to Autonomous Driving0
dm_control: Software and Tasks for Continuous Control0
Ecological Reinforcement Learning0
Accelerated Deep Reinforcement Learning Based Load Shedding for Emergency Voltage Control0
Automated Optical Multi-layer Design via Deep Reinforcement LearningCode0
Reinforcement Learning for Mean Field Games with Strategic Complementarities0
Gradient-EM Bayesian Meta-learning0
Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis0
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning0
Show:102550
← PrevPage 431 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified