SOTAVerified|Agents Browse Leaderboard About Blog

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1411–1420 of 15113 papers

Title	Date	Tasks	Status	Hype
Conservative Offline Distributional Reinforcement Learning	Jul 12, 2021	D4RLDistributional Reinforcement Learning	CodeCode Available	1
Explore and Control with Adversarial Surprise	Jul 12, 2021	reinforcement-learningReinforcement Learning	CodeCode Available	1
Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results	Jul 11, 2021	Reinforcement Learning (RL)Time Series	CodeCode Available	1
Backprop-Free Reinforcement Learning with Active Neural Generative Coding	Jul 10, 2021	Q-Learningreinforcement-learning	CodeCode Available	1
BayesSimIG: Scalable Parameter Inference for Adaptive Domain Randomization with IsaacGym	Jul 9, 2021	GPUReinforcement Learning (RL)	CodeCode Available	1
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers	Jul 8, 2021	Reinforcement Learning (RL)	CodeCode Available	1
Offline Meta-Reinforcement Learning with Online Self-Supervision	Jul 8, 2021	Meta Reinforcement LearningOffline RL	CodeCode Available	1
Distributed Online Service Coordination Using Deep Reinforcement Learning	Jul 7, 2021	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
THE SJTU SYSTEM FOR DCASE2021 CHALLENGE TASK 6: AUDIO CAPTIONING BASED ON ENCODER PRE-TRAINING AND REINFORCEMENT LEARNING	Jul 6, 2021	Audio captioningAudio Tagging	CodeCode Available	1
Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning	Jul 6, 2021	Deep Reinforcement LearningMuJoCo	CodeCode Available	1

Show:10 25 50

← PrevPage 142 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified