SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 60016025 of 15113 papers

TitleStatusHype
Simultaneous Translation with Flexible Policy via Restricted Imitation Learning0
Solving Collaborative Dec-POMDPs with Deep Reinforcement Learning Heuristics0
Single-Agent vs. Multi-Agent Techniques for Concurrent Reinforcement Learning of Negotiation Dialogue Policies0
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial0
Single Cell Training on Architecture Search for Image Denoising0
Single-Loop Federated Actor-Critic across Heterogeneous Environments0
Single photon in hierarchical architecture for physical reinforcement learning: Photon intelligence0
Single-Shot Pruning for Offline Reinforcement Learning0
Data-Incremental Continual Offline Reinforcement Learning0
Single-Trajectory Distributionally Robust Reinforcement Learning0
STEEL: Singularity-aware Reinforcement Learning0
Singular Perturbation-based Reinforcement Learning of Two-Point Boundary Optimal Control Systems0
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks0
Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning0
Sketch-Based Linear Value Function Approximation0
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches0
Skill-based Meta-Reinforcement Learning0
Skill-based Model-based Reinforcement Learning0
Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning0
Skill Discovery in Continuous Reinforcement Learning Domains using Skill Chaining0
Skill Discovery of Coordination in Multi-agent Reinforcement Learning0
Skilled Experience Catalogue: A Skill-Balancing Mechanism for Non-Player Characters using Reinforcement Learning0
Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations0
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration0
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
Show:102550
← PrevPage 241 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified