SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1045110475 of 15113 papers

TitleStatusHype
Control of a Nature-inspired Scorpion using Reinforcement Learning0
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems0
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant Movement Primitives0
How does the structure embedded in learning policy affect learning quadruped locomotion?0
Reinforcement Learning with Feedback-modulated TD-STDP0
Real-world Video Adaptation with Reinforcement Learning0
Sample Efficiency in Sparse Reinforcement Learning: Or Your Money Back0
Meta Reinforcement Learning-Based Lane Change Strategy for Autonomous Vehicles0
Query Focused Multi-document Summarisation of Biomedical TextsCode0
The Advantage Regret-Matching Actor-Critic0
Query Focused Multi-document Summarisation of Biomedical Texts: Macquarie Universiy and the Australian National University at BioASQ8bCode0
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI0
Controlling Level of Unconsciousness by Titrating Propofol with Deep Reinforcement Learning0
AutoFS: Automated Feature Selection via Diversity-aware Interactive Reinforcement LearningCode0
Constrained Markov Decision Processes via Backward Value Functions0
Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement Learning with Continuous Action Horizon0
Identifying Critical States by the Action-Based Variance of Expected Return0
Synthetic Sample Selection via Reinforcement Learning0
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning0
t-Soft Update of Target Network for Deep Reinforcement Learning0
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation0
Ensuring Monotonic Policy Improvement in Entropy-regularized Value-based Reinforcement Learning0
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing0
Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning0
Improved Memories Learning0
Show:102550
← PrevPage 419 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified