SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 33413350 of 15113 papers

TitleStatusHype
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels0
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning0
Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition0
Decoding Molecular Graph Embeddings with Reinforcement Learning0
Decoding Polar Codes with Reinforcement Learning0
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse0
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks0
Show:102550
← PrevPage 335 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified