SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 82768300 of 15113 papers

TitleStatusHype
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise RolloutsCode1
Reward prediction for representation learning and reward shaping0
Deep Graph Convolutional Reinforcement Learning for Financial Portfolio Management -- DeepPocket0
A Reinforcement Learning-based Economic Model Predictive Control Framework for Autonomous Operation of Chemical Reactors0
Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization ProblemsCode1
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning0
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance0
Solving Sokoban with forward-backward reinforcement learning0
Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network0
UVIP: Model-Free Approach to Evaluate Reinforcement Learning AlgorithmsCode0
Learning Algorithms for Regenerative Stopping Problems with Applications to Shipping Consolidation in Logistics0
Reinforcement Learning for Scalable Logic Optimization with Graph Neural Networks0
On the Linear convergence of Natural Policy Gradient Algorithm0
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning0
Data-Efficient Reinforcement Learning for Malaria Control0
Deep Reinforcement Learning for Adaptive Exploration of Unknown EnvironmentsCode1
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference0
Learning swimming escape patterns for larval fish under energy constraints0
Hierarchical Reinforcement Learning for Air-to-Air Combat0
Robotic Surgery With Lean Reinforcement LearningCode0
RL-IoT: Reinforcement Learning to Interact with IoT DevicesCode1
Reinforcement Learning for Ridesharing: An Extended Survey0
Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning0
CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Show:102550
← PrevPage 332 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified