SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1215112200 of 15113 papers

TitleStatusHype
Sample-Efficient Reinforcement Learning of Koopman eNMPC0
Sample-efficient reinforcement learning using deep Gaussian processes0
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation0
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation0
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion0
Sample Efficient Reinforcement Learning with REINFORCE0
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost0
Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty0
Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions0
Sample Efficient Social Navigation Using Inverse Reinforcement Learning0
Sampling from Energy-based Policies using Diffusion0
Sampling Strategies for GAN Synthetic Data0
Sampling Through the Lens of Sequential Decision Making0
SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning0
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering0
SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning0
SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning0
SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning0
SatNet: A Benchmark for Satellite Scheduling Optimization0
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation0
Say What I Want: Towards the Dark Side of Neural Dialogue Models0
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation0
^2-exploration for Reinforcement Learning0
Scaffolding Reflection in Reinforcement Learning Framework for Confinement Escape Problem0
Scalable and Incremental Learning of Gaussian Mixture Models0
Scalable and Sample Efficient Distributed Policy Gradient Algorithms in Multi-Agent Networked Systems0
Scalable Bayesian Inverse Reinforcement Learning by Auto-Encoding Reward0
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients0
Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism0
Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies0
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games0
Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer0
Scalable Deep Reinforcement Learning for Ride-Hailing0
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot0
Scalable Evolution Strategies Pipeline for Solving the Vehicle Routing Problem0
Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning0
Scalable Grid-Aware Dynamic Matching using Deep Reinforcement Learning0
Scalable Joint Learning of Wireless Multiple-Access Policies and their Signaling0
Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic0
Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information0
Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward0
Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility0
Scalable Multi-Agent Reinforcement Learning with General Utilities0
Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling0
Scalable Multi-Task Imitation Learning with Autonomous Improvement0
Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning0
Scalable photonic reinforcement learning by time-division multiplexing of laser chaos0
Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems0
Scalable Reinforcement-Learning-Based Neural Architecture Search for Cancer Deep Learning Research0
Scalable Reinforcement Learning-based Neural Architecture Search0
Show:102550
← PrevPage 244 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified