Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3061–3070 of 15113 papers

Title	Date	Tasks	Status
Reinforced Self-Training (ReST) for Language Modeling	Aug 17, 2023	Language ModelingLanguage Modelling	—Unverified
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games	Aug 17, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified
ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents	Aug 17, 2023	reinforcement-learningReinforcement Learning	—Unverified
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making	Aug 17, 2023	Decision MakingImitation Learning	—Unverified
Partially Observable Multi-Agent Reinforcement Learning with Information Sharing	Aug 16, 2023	Computational EfficiencyMulti-agent Reinforcement Learning	—Unverified
Planning to Learn: A Novel Algorithm for Active Learning during Model-Based Planning	Aug 15, 2023	Active Learningcounterfactual	CodeCode Available
A Reinforcement Learning Approach for Performance-aware Reduction in Power Consumption of Data Center Compute Nodes	Aug 15, 2023	ManagementReinforcement Learning (RL)	CodeCode Available
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World	Aug 15, 2023	Offline RLreinforcement-learning	—Unverified
On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing	Aug 15, 2023	Cloud ComputingCPU	—Unverified
ACRE: Actor-Critic with Reward-Preserving Exploration	Aug 14, 2023	continuous-controlContinuous Control	CodeCode Available

Show:10 25 50

← PrevPage 307 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified