SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 36013625 of 15113 papers

TitleStatusHype
Deep Reinforcement Learning for Data-Driven Adaptive Scanning in Ptychography0
Deep Reinforcement Learning for Day-to-day Dynamic Tolling in Tradable Credit Schemes0
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey0
Automatic Text Summarization Using Reinforcement Learning with Embedding Features0
Deep Reinforcement Learning for DER Cyber-Attack Mitigation0
Deep Reinforcement Learning for Detecting Malicious Websites0
Cost-Sensitive Exploration in Bayesian Reinforcement Learning0
Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks0
Deep Reinforcement Learning for Distributed Uncoordinated Cognitive Radios Resource Allocation0
Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation0
A State Representation for Diminishing Rewards0
Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data0
Deep Reinforcement Learning for Dynamic Spectrum Sensing and Aggregation in Multi-Channel Wireless Networks0
Deep Reinforcement Learning for Dynamic Spectrum Sharing of LTE and NR0
CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning0
Cost-Effective Two-Stage Network Slicing for Edge-Cloud Orchestrated Vehicular Networks0
A State Representation Dueling Network for Deep Reinforcement Learning0
Deep Reinforcement Learning for Dynamic Urban Transportation Problems0
Agent-Agnostic Human-in-the-Loop Reinforcement Learning0
Deep Reinforcement Learning for Electric Vehicle Routing Problem with Time Windows0
Deep Reinforcement Learning for Entity Alignment0
Deterministic Value-Policy Gradients0
Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures0
Adaptive Load Shedding for Grid Emergency Control via Deep Reinforcement Learning0
A State Augmentation based approach to Reinforcement Learning from Human Preferences0
Show:102550
← PrevPage 145 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified