SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 73517375 of 15113 papers

TitleStatusHype
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation0
Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty0
Zero-Shot Reinforcement Learning with Deep Attention Convolutional Neural Networks0
Zero-Shot Reward Specification via Grounded Natural Language0
Sim-to-Real Transfer of Robot Learning with Variable Length Inputs0
Zero-shot Text Classification via Reinforced Self-training0
Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning0
Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots0
Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach0
Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer0
Zeroth-Order Optimization is Secretly Single-Step Policy Optimization0
Zeroth-Order Supervised Policy Improvement0
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning0
Zooming for Efficient Model-Free Reinforcement Learning in Metric Spaces0
Reinforcement Learning in Low-Rank MDPs with Density Features0
Low-Resource Machine Translation based on Asynchronous Dynamic Programming0
Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling0
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning0
LPaintB: Learning to Paint from Self-Supervision0
LPMARL: Linear Programming based Implicit Task Assigment for Hiearchical Multi-Agent Reinforcement Learning0
LSD-Net: Look, Step and Detect for Joint Navigation and Multi-View Recognition with Deep Reinforcement Learning0
LSTD with Random Projections0
LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement0
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control0
Lyapunov-Based Reinforcement Learning State Estimator0
Show:102550
← PrevPage 295 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified