Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3381–3390 of 15113 papers

Title	Date	Tasks	Status	Hype
Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations	May 22, 2023	Dynamic Time Warpingreinforcement-learning	—Unverified	0
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation	May 22, 2023	Imitation LearningMotion Planning	CodeCode Available	2
Towards Optimal Energy Management Strategy for Hybrid Electric Vehicle with Reinforcement Learning	May 21, 2023	energy managementManagement	—Unverified	0
BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer	May 21, 2023	16kreinforcement-learning	CodeCode Available	0
SneakyPrompt: Jailbreaking Text-to-image Generative Models	May 20, 2023	Reinforcement Learning (RL)Semantic Similarity	CodeCode Available	1
Model-based adaptation for sample efficient transfer in reinforcement learning control of parameter-varying systems	May 20, 2023	Model Predictive Controlreinforcement-learning	—Unverified	0
Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning	May 19, 2023	Multi-agent Reinforcement Learningreinforcement-learning	—Unverified	0
Learning Diverse Risk Preferences in Population-based Self-play	May 19, 2023	Diversityreinforcement-learning	CodeCode Available	1
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond	May 18, 2023	Q-LearningReinforcement Learning (RL)	—Unverified	0
Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL	May 18, 2023	Reinforcement Learning (RL)	—Unverified	0

Show:10 25 50

← PrevPage 339 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified