SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 55765600 of 15113 papers

TitleStatusHype
Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics0
Inverse Reinforcement Learning: A Control Lyapunov Approach0
Inverse Reinforcement Learning Based Stochastic Driver Behavior Learning0
Inverse reinforcement learning conditioned on brain scan0
Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning0
Inverse Reinforcement Learning for Marketing0
Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems0
Inverse Reinforcement Learning for Strategy Identification0
Inverse Reinforcement Learning for Text Summarization0
Inverse Reinforcement Learning from a Gradient-based Learner0
Graph Inverse Reinforcement Learning from Diverse Videos0
Inverse Reinforcement Learning from Summary Data0
Inverse Reinforcement Learning in Large State Spaces via Function Approximation0
Inverse Reinforcement Learning in Swarm Systems0
Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees0
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities0
WFA-IRL: Inverse Reinforcement Learning of Autonomous Behaviors Encoded as Weighted Finite Automata0
Inverse Reinforcement Learning through Structured Classification0
Inverse Reinforcement Learning Under Noisy Observations0
Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling0
Inverse Reinforcement Learning via Deep Gaussian Process0
Inverse Reinforcement Learning via Matching of Optimality Profiles0
Inverse Reinforcement Learning with Conditional Choice Probabilities0
Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics0
Inverse Reinforcement Learning with Explicit Policy Estimates0
Show:102550
← PrevPage 224 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified