SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1185111900 of 15113 papers

TitleStatusHype
RL-GRIT: Reinforcement Learning for Grammar Inference0
RL-Guided MPC for Autonomous Greenhouse Control0
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models0
RLINK: Deep Reinforcement Learning for User Identity Linkage0
RLInspect: An Interactive Visual Approach to Assess Reinforcement Learning Algorithm0
RLIRank: Learning to Rank with Reinforcement Learning for Dynamic Search0
LIMIS: Locally Interpretable Modeling using Instance-wise Subsampling0
RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery0
RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks0
RL + Model-based Control: Using On-demand Optimal Control to Learn Versatile Legged Locomotion0
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems0
RLocator: Reinforcement Learning for Bug Localization0
RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm for Continuous Control of Nonlinear Dynamical Systems0
RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning0
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN0
RL-PINNs: Reinforcement Learning-Driven Adaptive Sampling for Efficient Training of PINNs0
RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems0
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression0
RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception0
RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment0
RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation0
RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads0
RL with KL penalties is better viewed as Bayesian inference0
RLZero: Direct Policy Inference from Language Without In-Domain Supervision0
Efficient Reinforcement Learning Development with RLzoo0
RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents0
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents0
RMIX: Risk-Sensitive Multi-Agent Reinforcement Learning0
ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving0
Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach0
ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR0
Robo-Advising: Enhancing Investment with Inverse Optimization and Deep Reinforcement Learning0
Robo-advising: Learning Investors' Risk Preferences via Portfolio Choices0
RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment0
Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning0
Robot gains Social Intelligence through Multimodal Deep Reinforcement Learning0
Robotic Arm Control and Task Training through Deep Reinforcement Learning0
Robotic Grasp Manipulation Using Evolutionary Computing and Deep Reinforcement Learning0
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
Robotic Search & Rescue via Online Multi-task Reinforcement Learning0
Robotic self-representation improves manipulation skills and transfer learning0
Robotic Table Tennis with Model-Free Reinforcement Learning0
Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization0
Robotic Tracking Control with Kernel Trick-based Reinforcement Learning0
Robot in a China Shop: Using Reinforcement Learning for Location-Specific Navigation Behaviour0
Robot Learning of Mobile Manipulation with Reachability Behavior Priors0
Robot Navigation with Reinforcement Learned Path Generation and Fine-Tuned Motion Control0
Robot path planning using deep reinforcement learning0
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination0
Show:102550
← PrevPage 238 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified