SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 51015150 of 15113 papers

TitleStatusHype
Holistic Deep-Reinforcement-Learning-based Training of Autonomous Navigation Systems0
HoME: a Household Multimodal Environment0
Homotopy Based Reinforcement Learning with Maximum Entropy for Autonomous Air Combat0
Hope For The Best But Prepare For The Worst: Cautious Adaptation In RL Agents0
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare0
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform0
Horizon-Free Regret for Linear Markov Decision Processes0
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes0
Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies0
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs0
Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning0
How an Electrical Engineer Became an Artificial Intelligence Researcher, a Multiphase Active Contours Analysis0
How Can Creativity Occur in Multi-Agent Systems?0
How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study0
How does AI play football? An analysis of RL and real-world football strategies0
How does the structure embedded in learning policy affect learning quadruped locomotion?0
How Does Return Distribution in Distributional Reinforcement Learning Help Optimization?0
How do Offline Measures for Exploration in Reinforcement Learning behave?0
How hard is my MDP?" The distribution-norm to the rescue"0
How many weights are enough : can tensor factorization learn efficient policies ?0
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning0
How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?0
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning0
How to Combine Tree-Search Methods in Reinforcement Learning0
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies0
How to Enable Uncertainty Estimation in Proximal Policy Optimization0
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies0
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
How to Organize your Deep Reinforcement Learning Agents: The Importance of Communication Topology0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks0
How To Train Your HERON0
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory0
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 2: Method and Applications0
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement Learning0
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback0
Human-Agent Cooperation in Bridge Bidding0
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning0
Human AI interaction loop training: New approach for interactive reinforcement learning0
Human and Multi-Agent collaboration in a human-MARL teaming framework0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
Human-centered collaborative robots with deep reinforcement learning0
Human-centered mechanism design with Democratic AI0
Human-centric Dialog Training via Offline Reinforcement Learning0
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment0
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning0
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text0
Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning0
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems0
Show:102550
← PrevPage 103 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified