SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 57015725 of 15113 papers

TitleStatusHype
Sample-efficient policy learning in multi-agent Reinforcement Learning via meta-learning0
Sample efficient Quality Diversity for neural continuous control0
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors0
Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks0
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model0
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency0
Sample-efficient Reinforcement Learning in Robotic Table Tennis0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information0
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting0
Sample-Efficient Reinforcement Learning of Undercomplete POMDPs0
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games0
Sample-Efficient Reinforcement Learning of Koopman eNMPC0
Sample-efficient reinforcement learning using deep Gaussian processes0
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation0
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation0
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion0
Sample Efficient Reinforcement Learning with REINFORCE0
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost0
Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty0
Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions0
Sample Efficient Social Navigation Using Inverse Reinforcement Learning0
Sampling from Energy-based Policies using Diffusion0
Sampling Strategies for GAN Synthetic Data0
Sampling Through the Lens of Sequential Decision Making0
Show:102550
← PrevPage 229 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified