SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1085110875 of 15113 papers

TitleStatusHype
To bootstrap or to rollout? An optimal and adaptive interpolation0
To Combine or Not To Combine? A Rainbow Deep Reinforcement Learning Agent for Dialog Policies0
Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI Agents0
Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous Plays0
Toggling a Genetic Switch Using Reinforcement Learning0
Token-Efficient RL for LLM Reasoning0
Token-Mol 1.0: Tokenized drug design with large language model0
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems0
TOMA: Topological Map Abstraction for Reinforcement Learning0
Toolpath design for additive manufacturing using deep reinforcement learning0
Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach0
To Risk or Not to Risk: Learning with Risk Quantification for IoT Task Offloading in UAVs0
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning0
Total stochastic gradient algorithms and applications in reinforcement learning0
To the Noise and Back: Diffusion for Shared Autonomy0
ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving0
Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning0
Toward a Reinforcement-Learning-Based System for Adjusting Medication to Minimize Speech Disfluency0
Toward Compositional Generalization in Object-Oriented World Modeling0
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping0
Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control0
Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models0
Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges0
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control0
Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees0
Show:102550
← PrevPage 435 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified