SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 66266650 of 15113 papers

TitleStatusHype
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning0
Total stochastic gradient algorithms and applications in reinforcement learning0
To the Noise and Back: Diffusion for Shared Autonomy0
ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving0
Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning0
Toward a Reinforcement-Learning-Based System for Adjusting Medication to Minimize Speech Disfluency0
Toward Compositional Generalization in Object-Oriented World Modeling0
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping0
Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control0
Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models0
Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges0
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control0
Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees0
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making0
Toward Pareto Efficient Fairness-Utility Trade-off inRecommendation through Reinforcement Learning0
Toward Real-Time Decentralized Reinforcement Learning using Finite Support Basis Functions0
Toward Reliable Designs of Data-Driven Reinforcement Learning Tracking Control for Euler-Lagrange Systems0
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning0
Towards a Better Understanding of Representation Dynamics under TD-learning0
Towards a Deep Reinforcement Learning Approach for Tower Line Wars0
Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation0
Towards a Fully Autonomous UAV Controller for Moving Platform Detection and Landing0
Towards a General Framework for ML-based Self-tuning Databases0
Towards AI-controlled FES-restoration of arm movements: Controlling for progressive muscular fatigue with Gaussian state-space models0
Towards AI-controlled FES-restoration of arm movements: neuromechanics-based reinforcement learning for 3-D reaching0
Show:102550
← PrevPage 266 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified