SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1210112150 of 15113 papers

TitleStatusHype
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process Estimation0
Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies0
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning0
Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach0
Safety Verification of Model Based Reinforcement Learning Controllers0
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning0
SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes0
SAINT-ACC: Safety-Aware Intelligent Adaptive Cruise Control for Autonomous Vehicles Using Deep Reinforcement Learning0
Saliency-based Sequential Image Attention with Multiset Prediction0
SaLinA: Sequential Learning of Agents0
SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance0
Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions0
Sample-based Distributional Policy Gradient0
Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms0
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning0
Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems0
Sample Complexity of Kernel-Based Q-Learning0
Sample Complexity of Multi-task Reinforcement Learning0
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds0
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes0
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points0
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles0
Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning0
Sample Efficiency in Sparse Reinforcement Learning: Or Your Money Back0
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management0
Sample-efficient Adversarial Imitation Learning from Observation0
Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents0
Curriculum Reinforcement Learning for Complex Reward Functions0
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces0
Sample-efficient Deep Reinforcement Learning for Dialog Control0
Sample Efficient Deep Reinforcement Learning via Local Planning0
Sample-efficient Deep Reinforcement Learning with Imaginary Rollouts for Human-Robot Interaction0
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems0
Sample Efficient Feature Selection for Factored MDPs0
Physics-informed Imitative Reinforcement Learning for Real-world Driving0
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions0
Sample-Efficient Multi-Agent Reinforcement Learning with Demonstrations for Flocking Control0
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks0
Sample-efficient policy learning in multi-agent Reinforcement Learning via meta-learning0
Sample efficient Quality Diversity for neural continuous control0
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors0
Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks0
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model0
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency0
Sample-efficient Reinforcement Learning in Robotic Table Tennis0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information0
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting0
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs0
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games0
Show:102550
← PrevPage 243 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified