SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1100111025 of 15113 papers

TitleStatusHype
Adaptive Dialog Policy Learning with Hindsight and User Modeling0
Reinforcement Learning with Feedback Graphs0
Safe Reinforcement Learning through Meta-learned Instincts0
Robotic Arm Control and Task Training through Deep Reinforcement Learning0
Reinforcement Learning for UAV Autonomous Navigation, Mapping and Target Detection0
Gifting in multi-agent reinforcement learningCode0
A Survey on Dialog Management: Recent Advances and Challenges0
Generalized Planning With Deep Reinforcement Learning0
Discrete-to-Deep Supervised Policy LearningCode0
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning0
Generalized Reinforcement Meta Learning for Few-Shot Optimization0
Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation0
Reward Constrained Interactive Recommendation with Natural Language Feedback0
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning0
Setting up experimental Bell test with reinforcement learning0
Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning0
Multi-agent Reinforcement Learning for Decentralized Stable Matching0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey0
Enhancing Text-based Reinforcement Learning Agents with Commonsense Knowledge0
Learning the Arrow of Time for Problems in Reinforcement Learning0
AMRL: Aggregated Memory For Reinforcement Learning0
Learning Heuristics for Quantified Boolean Formulas through Reinforcement Learning0
Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning0
Explain Your Move: Understanding Agent Actions Using Focused Feature SaliencyCode0
Show:102550
← PrevPage 441 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified