SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 96519675 of 15113 papers

TitleStatusHype
Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes0
Deep Reinforcement Learning for Multi-user Massive MIMO with Channel Aging0
Deep Reinforcement Learning for Navigation in AAA Video Games0
Deep Reinforcement Learning for Neural Control0
Deep Reinforcement Learning for NLP0
Deep Reinforcement Learning for On-line Dialogue State Tracking0
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems0
Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations0
Deep Reinforcement Learning for Online Routing of Unmanned Aerial Vehicles with Wireless Power Transfer0
Deep Reinforcement Learning for Online Error Detection in Cyber-Physical Systems0
Deep reinforcement learning for optical systems: A case study of mode-locked lasers0
Deep Reinforcement Learning for Optimal Control of Space Heating0
Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks0
Deep Reinforcement Learning for Optimal Investment and Saving Strategy Selection in Heterogeneous Profiles: Intelligent Agents working towards retirement0
Deep Reinforcement Learning for Optimal Power Flow with Renewables Using Graph Information0
Deep reinforcement learning for optimal well control in subsurface systems with uncertain geology0
Deep Reinforcement Learning for Optimizing RIS-Assisted HD-FD Wireless Systems0
Deep Reinforcement Learning for Option Replication and Hedging0
Deep Reinforcement Learning for Organ Localization in CT0
Deep Reinforcement Learning for Orienteering Problems Based on Decomposition0
Deep Reinforcement Learning for Page-wise Recommendations0
Deep Reinforcement Learning for Personalized Search Story Recommendation0
Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module0
Deep Reinforcement Learning for Power Control in Next-Generation WiFi Network Systems0
Domain-adapted Learning and Imitation: DRL for Power Arbitrage0
Show:102550
← PrevPage 387 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified