SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1067610700 of 15113 papers

TitleStatusHype
Temporal Difference Weighted Ensemble For Reinforcement Learning0
Temporal-Differential Learning in Continuous Environments0
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning0
Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking0
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks0
Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions0
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning0
TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction0
Temporal-related Convolutional-Restricted-Boltzmann-Machine capable of learning relational order via reinforcement learning procedure?0
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning0
TensorRL-QAS: Reinforcement learning with tensor networks for scalable quantum architecture search0
Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning0
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning0
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning0
Test-Cost Sensitive Methods for Identifying Nearby Points0
Testing match-3 video games with Deep Reinforcement Learning0
Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning0
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning0
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model0
Text-Based Interactive Recommendation via Constraint-Augmented Reinforcement Learning0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches0
TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis0
Text Generation with Efficient (Soft) Q-Learning0
Show:102550
← PrevPage 428 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified