Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3351–3375 of 15113 papers

Title	Date	Tasks	Status	Score
Active Object Localization with Deep Reinforcement Learning	Nov 18, 2015	Active Object LocalizationDeep Reinforcement Learning	CodeCode Available	5
Handling Delay in Real-Time Reinforcement Learning	Mar 30, 2025	MuJoCoreinforcement-learning	CodeCode Available	5
Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged Agent	Sep 21, 2022	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	5
LEACH-RLC: Enhancing IoT Data Transmission with Optimized Clustering and Reinforcement Learning	Jan 28, 2024	Clusteringreinforcement-learning	CodeCode Available	5
Gym-Ignition: Reproducible Robotic Simulations for Reinforcement Learning	Nov 5, 2019	OpenAI Gymreinforcement-learning	CodeCode Available	5
Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes	May 19, 2022	Reinforcement Learning (RL)	CodeCode Available	5
Compositional Learning of Visually-Grounded Concepts Using Reinforcement	Sep 8, 2023	Deep Reinforcement LearningNavigate	CodeCode Available	5
Deconfounding Reinforcement Learning in Observational Settings	Dec 26, 2018	OpenAI Gymreinforcement-learning	CodeCode Available	5
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and Gazebo	Mar 14, 2019	BenchmarkingOpenAI Gym	CodeCode Available	5
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning	Apr 6, 2024	D4RLOffline RL	CodeCode Available	5
Guiding Evolutionary Strategies by Differentiable Robot Simulators	Oct 1, 2021	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	5
Actively Learning Costly Reward Functions for Reinforcement Learning	Nov 23, 2022	Active LearningDeep Reinforcement Learning	CodeCode Available	5
Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization	Jun 25, 2022	continuous-controlContinuous Control	CodeCode Available	5
Guided Dialogue Policy Learning without Adversarial Learning in the Loop	Nov 1, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	5
Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics	Jan 24, 2019	reinforcement-learningReinforcement Learning	CodeCode Available	5
Decoupling regularization from the action space	Jun 10, 2024	Reinforcement Learning (RL)	CodeCode Available	5
Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents	May 22, 2018	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	5
Composable Deep Reinforcement Learning for Robotic Manipulation	Mar 19, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
Guided Dialog Policy Learning without Adversarial Learning in the Loop	Apr 7, 2020	Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	5
Neural Logic Reinforcement Learning	Apr 24, 2019	Deep Reinforcement LearningInductive logic programming	CodeCode Available	5
Guided Policy Optimization under Partial Observability	May 21, 2025	continuous-controlContinuous Control	CodeCode Available	5
Complex Model Transformations by Reinforcement Learning with Uncertain Human Guidance	Jun 25, 2025	Reinforcement Learning (RL)	CodeCode Available	5
A Reinforcement Learning Approach for Performance-aware Reduction in Power Consumption of Data Center Compute Nodes	Aug 15, 2023	ManagementReinforcement Learning (RL)	CodeCode Available	5
Guided Deep Reinforcement Learning for Swarm Systems	Sep 18, 2017	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	5
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout	Sep 24, 2023	Hierarchical Reinforcement Learningreinforcement-learning	CodeCode Available	5

Show:10 25 50

← PrevPage 135 of 605Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified