SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 71517175 of 15113 papers

TitleStatusHype
Improving Generalization of Deep Reinforcement Learning-based TSP Solvers0
Scalable Multi-Agent Reinforcement Learning for Residential Load Scheduling under Data Governance0
Hierarchical Potential-based Reward Shaping from Task SpecificationsCode0
Adaptive control of a mechatronic system using constrained residual reinforcement learning0
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing ProblemCode1
Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning0
Decentralized Cooperative Lane Changing at Freeway Weaving Areas Using Multi-Agent Deep Reinforcement Learning0
Deep reinforcement learning for guidewire navigation in coronary artery phantom0
CARL: A Benchmark for Contextual and Adaptive Reinforcement LearningCode1
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing0
A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing0
A study of first-passage time minimization via Q-learning in heated gridworlds0
Dropout Q-Functions for Doubly Efficient Reinforcement LearningCode1
OTTR: Off-Road Trajectory Tracking using Reinforcement Learning0
NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback0
Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
Multi-Agent Path Planning Using Deep Reinforcement Learning0
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-EnsembleCode1
Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding0
Large Batch Experience ReplayCode1
Behaviour-conditioned policies for cooperative reinforcement learning tasks0
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
Learning to Assist Agents by Observing Them0
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation0
Show:102550
← PrevPage 287 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified