SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 33263350 of 15113 papers

TitleStatusHype
Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information0
Deception in Social Learning: A Multi-Agent Reinforcement Learning Perspective0
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots0
Deceptive Reinforcement Learning for Privacy-Preserving Planning0
Deceptive Reinforcement Learning in Model-Free Domains0
Deep Reinforcement Learning for Smart Home Energy Management0
CQM: Curriculum Reinforcement Learning with a Quantized World Model0
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning0
Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems0
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making0
Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning0
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon0
Decision Making in Non-Stationary Environments with Policy-Augmented Monte Carlo Tree Search0
Decision-making Strategy on Highway for Autonomous Vehicles using Deep Reinforcement Learning0
Attention-driven Robotic Manipulation0
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling0
Decision SpikeFormer: Spike-Driven Transformer for Decision Making0
Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels0
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning0
Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition0
Decoding Molecular Graph Embeddings with Reinforcement Learning0
Decoding Polar Codes with Reinforcement Learning0
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse0
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks0
Show:102550
← PrevPage 134 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified