Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1091–1100 of 15113 papers

Title	Date	Tasks	Status	Hype
Digital Twin Aided Channel Estimation: Zone-Specific Subspace Prediction and Calibration	Jan 6, 2025	Reinforcement Learning (RL)	CodeCode Available	0
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes	Jan 6, 2025	Reinforcement Learning (RL)	—Unverified	0
Interpretable Recognition of Fused Magnesium Furnace Working Conditions with Deep Convolutional Stochastic Configuration Networks	Jan 6, 2025	Reinforcement Learning (RL)	—Unverified	0
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies	Jan 6, 2025	Decision MakingDeep Reinforcement Learning	CodeCode Available	1
Sim-to-Real Transfer for Mobile Robots with Reinforcement Learning: from NVIDIA Isaac Sim to Gazebo and Real ROS 2 Robots	Jan 6, 2025	Deep Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2
AMM: Adaptive Modularized Reinforcement Model for Multi-city Traffic Signal Control	Jan 5, 2025	Domain AdaptationMeta-Learning	—Unverified	0
Representation Convergence: Mutual Distillation is Secretly a Form of Regularization	Jan 5, 2025	Deep Reinforcement LearningForm	CodeCode Available	0
A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model	Jan 5, 2025	Reinforcement Learning (RL)	—Unverified	0
SR-Reward: Taking The Path More Traveled	Jan 4, 2025	D4RLImitation Learning	—Unverified	0
On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures	Jan 3, 2025	Offline RLReinforcement Learning (RL)	—Unverified	0

Show:10 25 50

← PrevPage 110 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified