SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 39013925 of 15113 papers

TitleStatusHype
Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement LearningCode0
Flight Controller Synthesis Via Deep Reinforcement LearningCode0
Certification of Iterative Predictions in Bayesian Neural NetworksCode0
Semantic RL with Action Grammars: Data-Efficient Learning of Hierarchical Task AbstractionsCode0
Safe Exploration Method for Reinforcement Learning under Existence of DisturbanceCode0
Flexible Option LearningCode0
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement LearningCode0
Centralized Model and Exploration Policy for Multi-Agent RLCode0
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement LearningCode0
Safe Model-based Reinforcement Learning with Stability GuaranteesCode0
Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and AnimalsCode0
A Novel Update Mechanism for Q-Networks Based On Extreme Learning MachinesCode0
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with DemonstrationsCode0
Fleet Control using Coregionalized Gaussian Process Policy IterationCode0
Foresee then Evaluate: Decomposing Value Estimation with Latent Future PredictionCode0
Safe Policy Optimization with Local Generalized Linear Function ApproximationsCode0
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation ProblemCode0
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approachCode0
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement LearningCode0
A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion RecognitionCode0
Financial Trading as a Game: A Deep Reinforcement Learning ApproachCode0
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement LearningCode0
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement LearningCode0
FeUdal Networks for Hierarchical Reinforcement LearningCode0
Feudal Graph Reinforcement LearningCode0
Show:102550
← PrevPage 157 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified