SOTAVerified|Agents Browse Leaderboard About

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1121–1130 of 15113 papers

Title	Date	Tasks	Status	Hype
Efficient Risk-Averse Reinforcement Learning	May 10, 2022	Autonomous Drivingreinforcement-learning	CodeCode Available	1
Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning	May 10, 2022	Distributional Reinforcement LearningPosition	CodeCode Available	1
DxFormer: A Decoupled Automatic Diagnostic System Based on Decoder-Encoder Transformer with Dense Symptom Representations	May 8, 2022	DecoderDiagnostic	CodeCode Available	1
Learning to Brachiate via Simplified Model Imitation	May 8, 2022	Humanoid Controlmodel	CodeCode Available	1
Multivariate Prediction Intervals for Random Forests	May 4, 2022	PredictionPrediction Intervals	CodeCode Available	1
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning	May 2, 2022	Data AugmentationQ-Learning	CodeCode Available	1
Large Neighborhood Search based on Neural Construction Heuristics	May 2, 2022	reinforcement-learningReinforcement Learning (RL)	CodeCode Available	1
TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning	Apr 30, 2022	reinforcement-learningReinforcement Learning	CodeCode Available	1
Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study	Apr 27, 2022	Contact-rich ManipulationReinforcement Learning (RL)	CodeCode Available	1
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning	Apr 26, 2022	Offline RLreinforcement-learning	CodeCode Available	1

Show:10 25 50

← PrevPage 113 of 1512Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PPG	Mean Normalized Performance	0.76	—	Unverified
2	PPO	Mean Normalized Performance	0.58	—	Unverified