SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 55015525 of 15113 papers

TitleStatusHype
Interactive Reinforcement Learning for Feature Selection with Decision Tree in the Loop0
Interactive Reinforcement Learning for Object Grounding via Self-Talking0
Interactive Reinforcement Learning for Table Balancing Robot0
Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human/Agent's Demonstration0
Interactive Search Based on Deep Reinforcement Learning0
Interactive Spoken Content Retrieval by Deep Reinforcement Learning0
Interactive Teaching Algorithms for Inverse Reinforcement Learning0
Interactive Video Corpus Moment Retrieval using Reinforcement Learning0
Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning0
Interleaved Reasoning for Large Language Models via Reinforcement Learning0
Internal Model from Observations for Reward Shaping0
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning0
Interpretability via Model Extraction0
Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection0
Interpretable and Efficient Data-driven Discovery and Control of Distributed Systems0
Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction0
Interpretable Control by Reinforcement Learning0
Interpretable Deep Reinforcement Learning for Green Security Games with Real-Time Information0
Interpretable Disease Prediction based on Reinforcement Path Reasoning over Knowledge Graphs0
Interpretable Dynamics Models for Data-Efficient Reinforcement Learning0
Interpretable end-to-end Neurosymbolic Reinforcement Learning agents0
Interpretable Hidden Markov Model-Based Deep Reinforcement Learning Hierarchical Framework for Predictive Maintenance of Turbofan Engines0
Interpretable Meta-Reinforcement Learning with Actor-Critic Method0
Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming0
Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration0
Show:102550
← PrevPage 221 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified