SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 57265750 of 15113 papers

TitleStatusHype
SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning0
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering0
SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning0
SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning0
SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning0
SatNet: A Benchmark for Satellite Scheduling Optimization0
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation0
Say What I Want: Towards the Dark Side of Neural Dialogue Models0
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation0
^2-exploration for Reinforcement Learning0
Scaffolding Reflection in Reinforcement Learning Framework for Confinement Escape Problem0
Scalable and Incremental Learning of Gaussian Mixture Models0
Scalable and Sample Efficient Distributed Policy Gradient Algorithms in Multi-Agent Networked Systems0
Scalable Bayesian Inverse Reinforcement Learning by Auto-Encoding Reward0
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients0
Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism0
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies0
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games0
Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer0
Scalable Deep Reinforcement Learning for Ride-Hailing0
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot0
Scalable Evolution Strategies Pipeline for Solving the Vehicle Routing Problem0
Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning0
Scalable Grid-Aware Dynamic Matching using Deep Reinforcement Learning0
Scalable Joint Learning of Wireless Multiple-Access Policies and their Signaling0
Show:102550
← PrevPage 230 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified