SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 33763400 of 15113 papers

TitleStatusHype
Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control0
DEEP ADVERSARIAL FORWARD MODEL0
Deep Adversarial Reinforcement Learning for Object Disentangling0
DeepAGREL: Biologically plausible deep learning via direct reinforcement0
Deep Anomaly Detection and Search via Reinforcement Learning0
Deep Apprenticeship Learning for Playing Games0
Deep-Attack over the Deep Reinforcement Learning0
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning0
Adaptive Control of Differentially Private Linear Quadratic Systems0
AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework0
Deep Bellman Hedging0
Deep Binary Reinforcement Learning for Scalable Verification0
Cover Tree Bayesian Reinforcement Learning0
DeepCAS: A Deep Reinforcement Learning Algorithm for Control-Aware Scheduling0
Deep Coherent Exploration For Continuous Control0
Accelerating the Learning of TAMER with Counterfactual Explanations0
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey0
Augmenting Automated Game Testing with Deep Reinforcement Learning0
DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks0
DeepCrawl: Deep Reinforcement Learning for Turn-based Strategy Games0
Deep Curiosity Loops in Social Environments0
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability0
Deep Reinforcement Learning in mmW-NOMA: Joint Power Allocation and Hybrid Beamforming0
Deep differentiable reinforcement learning and optimal trading0
Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result0
Show:102550
← PrevPage 136 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified