Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1121–1130 of 15113 papers

Title	Date	Tasks	Status	Hype
Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach	Dec 26, 2024	Deep Reinforcement Learningenergy management	—Unverified	0
Provably Efficient Exploration in Reward Machines with Low Regret	Dec 26, 2024	Efficient ExplorationReinforcement Learning (RL)	—Unverified	0
Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning	Dec 26, 2024	Decision MakingDeep Reinforcement Learning	—Unverified	0
A Reinforcement Learning-Based Task Mapping Method to Improve the Reliability of Clustered Manycores	Dec 26, 2024	Q-LearningReinforcement Learning (RL)	—Unverified	0
xSRL: Safety-Aware Explainable Reinforcement Learning -- Safety as a Product of Explainability	Dec 26, 2024	Autonomous VehiclesReinforcement Learning (RL)	CodeCode Available	0
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs	Dec 25, 2024	Reinforcement Learning (RL)	CodeCode Available	5
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL	Dec 25, 2024	Offline RLReinforcement Learning (RL)	CodeCode Available	0
Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search	Dec 24, 2024	Computational EfficiencyDecision Making	—Unverified	0
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization	Dec 24, 2024	Offline RLReinforcement Learning (RL)	—Unverified	0
Reinforcement Learning for Motor Control: A Comprehensive Review	Dec 23, 2024	reinforcement-learningReinforcement Learning	—Unverified	0

Show:10 25 50

← PrevPage 113 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified