SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 62266250 of 15113 papers

TitleStatusHype
Leveraging human Domain Knowledge to model an empirical Reward function for a Reinforcement Learning problem0
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects0
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains0
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning0
Leveraging Offline Data in Online Reinforcement Learning0
Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments0
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks0
PerfRL: A Small Language Model Framework for Efficient Code Optimization0
Leveraging Reinforcement Learning for evaluating Robustness of KNN Search Algorithms0
Leveraging Reinforcement Learning in Red Teaming for Advanced Ransomware Attack Simulations0
Leveraging Reinforcement Learning Techniques for Effective Policy Adoption and Validation0
Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations0
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning0
Leveraging the Variance of Return Sequences for Exploration Policy0
Leveraging Topological Maps in Deep Reinforcement Learning for Multi-Object Navigation0
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents0
Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems0
Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach0
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines0
Lifelong Robotic Reinforcement Learning by Retaining Experiences0
Lifted Model Checking for Relational MDPs0
Lifting the veil on hyper-parameters for value-based deep reinforcement learning0
Policy Gradient Methods for Distortion Risk Measures0
lilGym: Natural Language Visual Reasoning with Reinforcement Learning0
LiMIIRL: Lightweight Multiple-Intent Inverse Reinforcement Learning0
Show:102550
← PrevPage 250 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified