SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 88018850 of 15113 papers

TitleStatusHype
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising0
Real-Time Integrated Dispatching and Idle Fleet Steering with Deep Reinforcement Learning for A Meal Delivery Platform0
Real-time Local Feature with Global Visual Information Enhancement0
Real-Time Measurement-Driven Reinforcement Learning Control Approach for Uncertain Nonlinear Systems0
Real-Time Model Calibration with Deep Reinforcement Learning0
Real-Time Network-Level Traffic Signal Control: An Explicit Multiagent Coordination Method0
Real-Time Optimal Design of Experiment for Parameter Identification of Li-Ion Cell Electrochemical Model0
Real-time Policy Distillation in Deep Reinforcement Learning0
Real-time scheduling of renewable power systems through planning-based reinforcement learning0
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings0
The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability0
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning0
Real-World Human-Robot Collaborative Reinforcement Learning0
Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households0
Real World Offline Reinforcement Learning with Realistic Data Source0
Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning0
Real-world Video Adaptation with Reinforcement Learning0
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network0
Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning0
Reasoning Beyond Limits: Advances and Open Problems for LLMs0
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL0
Reasoning with Exploration: An Entropy Perspective0
Reasoning With Hierarchical Symbols: Reclaiming Symbolic Policies For Visual Reinforcement Learning0
Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation0
Rebalanced Multimodal Learning with Data-aware Unimodal Sampling0
REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback0
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation0
Recall Traces: Backtracking Models for Efficient Reinforcement Learning0
Receding Horizon Differential Dynamic Programming0
Receding Horizon Inverse Reinforcement Learning0
Recent Advances in Reinforcement Learning in Finance0
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective0
Recent Progress in Energy Management of Connected Hybrid Electric Vehicles Using Reinforcement Learning0
Reinforcement Learning as a Robotics-Inspired Framework for Insect Navigation: From Spatial Representations to Neural Implementation0
Recognition Method of Important Words in Korean Text based on Reinforcement Learning0
Recommendation Fairness: From Static to Dynamic0
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning0
Recommendation System-based Upper Confidence Bound for Online Advertising0
Recommending the optimal policy by learning to act from temporal data0
Re-conceptualising the Language Game Paradigm in the Framework of Multi-Agent Reinforcement Learning0
RECONNAISSANCE FOR REINFORCEMENT LEARNING WITH SAFETY CONSTRAINTS0
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning0
ReCoRe: Regularized Contrastive Representation Learning of World Model0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Rectifying Reinforcement Learning for Reward Matching0
Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition0
Recurrent Attention Models for Depth-Based Person Identification0
Recurrent Control Nets for Deep Reinforcement Learning0
Recurrent Reinforcement Learning: A Hybrid Approach0
Recurrent Value Functions0
Show:102550
← PrevPage 177 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified