SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 90269050 of 15113 papers

TitleStatusHype
Application of deep reinforcement learning for Indian stock trading automation0
Learning and Information in Stochastic Networks and Queues0
Gym-ANM: Open-source software to leverage reinforcement learning for power system management in research and education0
Adaptive ABAC Policy Learning: A Reinforcement Learning Approach0
Behavior-based Neuroevolutionary Training in Reinforcement LearningCode0
Generic Itemset Mining Based on Reinforcement LearningCode0
Mean Field Games Flock! The Reinforcement Learning Way0
RL-GRIT: Reinforcement Learning for Grammar Inference0
RAIDER: Reinforcement-aided Spear Phishing Detector0
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting0
Model-Based Offline Planning with Trajectory PruningCode0
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling0
Regret Minimization Experience Replay in Off-Policy Reinforcement LearningCode0
Ordering-Based Causal Discovery with Reinforcement Learning0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement0
Efficient PAC Reinforcement Learning in Regular Decision Processes0
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models0
Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning0
Intelligence and Unambitiousness Using Algorithmic Information Theory0
Online Algorithms and Policies Using Adaptive and Machine Learning Approaches0
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning0
Reinforcement Learning Based Safe Decision Making for Highway Autonomous Driving0
Principled Exploration via Optimistic Bootstrapping and Backward InductionCode0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Show:102550
← PrevPage 362 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified