SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1065110700 of 15113 papers

TitleStatusHype
TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture0
Taxable Stock Trading with Deep Reinforcement Learning0
Taylor Expansion of Discount Factors0
Taylor Expansion Policy Optimization0
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning0
T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy0
Teacher-Critical Training Strategies for Image Captioning0
Teacher-student curriculum learning for reinforcement learning0
Teaching a Robot to Walk Using Reinforcement Learning0
Teaching GANs to Sketch in Vector Format0
Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning0
Teaching robots to perceive time -- A reinforcement learning approach (Extended version)0
TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning0
Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning0
Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum0
Techniques for Automated Machine Learning0
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning0
Temporal Abstraction in Reinforcement Learning with the Successor Representation0
Temporal-adaptive Hierarchical Reinforcement Learning0
Temporal Complementarity-Guided Reinforcement Learning for Image-to-Video Person Re-Identification0
Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation0
Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb0
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning0
Temporal Difference Learning with Experience Replay0
Temporal Difference Models: Model-Free Deep RL for Model-Based Control0
Temporal Difference Weighted Ensemble For Reinforcement Learning0
Temporal-Differential Learning in Continuous Environments0
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning0
Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking0
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks0
Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions0
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning0
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction0
Temporal-related Convolutional-Restricted-Boltzmann-Machine capable of learning relational order via reinforcement learning procedure?0
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning0
TensorRL-QAS: Reinforcement learning with tensor networks for scalable quantum architecture search0
Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning0
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning0
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning0
Test-Cost Sensitive Methods for Identifying Nearby Points0
Testing match-3 video games with Deep Reinforcement Learning0
Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning0
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning0
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model0
Text-Based Interactive Recommendation via Constraint-Augmented Reinforcement Learning0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches0
TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis0
Text Generation with Efficient (Soft) Q-Learning0
Show:102550
← PrevPage 214 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified