SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 15511560 of 15113 papers

TitleStatusHype
A Deep Reinforced Model for Abstractive SummarizationCode1
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement LearningCode1
Gated Hierarchical Attention for Image CaptioningCode1
Differentiable Trust Region Layers for Deep Reinforcement LearningCode1
Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement LearningCode1
Barrier Certified Safety Learning Control: When Sum-of-Square Programming Meets Reinforcement LearningCode1
A Cooperative Multi-Agent Reinforcement Learning Framework for Resource Balancing in Complex Logistics NetworkCode1
Batch Exploration with Examples for Scalable Robotic Reinforcement LearningCode1
Gaussian RAM: Lightweight Image Classification via Stochastic Retina-Inspired Glimpse and Reinforcement LearningCode1
Generalizable Visual Reinforcement Learning with Segment Anything ModelCode1
Show:102550
← PrevPage 156 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified