SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 46514675 of 15113 papers

TitleStatusHype
Reinforcement Learning for Solving the Vehicle Routing ProblemCode0
Multiagent Reinforcement Learning based Energy Beamforming ControlCode0
Predictive World Models from Real-World Partial ObservationsCode0
Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models0
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks0
Reinforced Multi-task Approach for Multi-hop Question Generation0
Reinforced Pedestrian Attribute Recognition with Group Optimization Reward0
Reinforced Self-Training (ReST) for Language Modeling0
Reinforced Training Data Selection for Domain Adaptation0
Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API0
Reinforced Video Captioning with Entailment Rewards0
Reinforced Workload Distribution Fairness0
Reinforcement and Imitation Learning via Interactive No-Regret Learning0
Reinforcement-based frugal learning for satellite image change detection0
Reinforcement Evolutionary Learning Method for self-learning0
Reinforcement Explanation Learning0
Reinforcement Leaning for Infinite-Dimensional Systems0
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound0
Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies0
Reinforcement Learning Agent Design and Optimization with Bandwidth Allocation Model0
Reinforcement Learning Agents for Ubisoft's Roller Champions0
Reinforcement Learning Agent Training with Goals for Real World Tasks0
Reinforcement Learning Algorithm for Traffic Steering in Heterogeneous Network0
Reinforcement Learning Algorithms: An Overview and Classification0
Reinforcement Learning Algorithm Selection0
Show:102550
← PrevPage 187 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified