SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1065110675 of 15113 papers

TitleStatusHype
TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture0
Taxable Stock Trading with Deep Reinforcement Learning0
Taylor Expansion of Discount Factors0
Taylor Expansion Policy Optimization0
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning0
T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy0
Teacher-Critical Training Strategies for Image Captioning0
Teacher-student curriculum learning for reinforcement learning0
Teaching a Robot to Walk Using Reinforcement Learning0
Teaching GANs to Sketch in Vector Format0
Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning0
Teaching robots to perceive time -- A reinforcement learning approach (Extended version)0
TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning0
Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning0
Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum0
Techniques for Automated Machine Learning0
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning0
Temporal Abstraction in Reinforcement Learning with the Successor Representation0
Temporal-adaptive Hierarchical Reinforcement Learning0
Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification0
Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation0
Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb0
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning0
Temporal Difference Learning with Experience Replay0
Temporal Difference Models: Model-Free Deep RL for Model-Based Control0
Show:102550
← PrevPage 427 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified