SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 87518775 of 15113 papers

TitleStatusHype
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery0
RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning0
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning0
RACE: A Reinforcement Learning Framework for Improved Adaptive Control of NoC Channel Buffers0
Racing Towards Reinforcement Learning based control of an Autonomous Formula SAE Car0
RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search0
Radiology Report Generation via Multi-objective Preference Optimization0
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning0
RAIDER: Reinforcement-aided Spear Phishing Detector0
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating Security Assessment of Network Systems0
RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning0
Railway Operation Rescheduling System via Dynamic Simulation and Reinforcement Learning0
Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits0
Random Copolymer inverse design system orienting on Accurate discovering of Antimicrobial peptide-mimetic copolymers0
Random Ensemble Reinforcement Learning for Traffic Signal Control0
Randomized Policy Learning for Continuous State and Action MDPs0
Random Latent Exploration for Deep Reinforcement Learning0
Random Network Distillation as a Diversity Metric for Both Image and Text Generation0
RangL: A Reinforcement Learning Competition Platform0
Ranking Items in Large-Scale Item Search Engines with Reinforcement Learning0
Ranking sentences from product description & bullets for better search0
Rapid Learning of Spatial Representations for Goal-Directed Navigation Based on a Novel Model of Hippocampal Place Fields0
Rapid Locomotion via Reinforcement Learning0
Rapidly Personalizing Mobile Health Treatment Policies with Limited Data0
Show:102550
← PrevPage 351 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified