SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 25512575 of 15113 papers

TitleStatusHype
A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers0
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning0
Autotuning PID control using Actor-Critic Deep Reinforcement Learning0
Auto-tuning Distributed Stream Processing Systems using Reinforcement Learning0
A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming0
Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments0
A Method for Fast Autonomy Transfer in Reinforcement Learning0
Adaptive perturbation adversarial training: based on reinforcement learning0
Conversational Question Answering with Reformulations over Knowledge Graph0
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications0
A Meta-Reinforcement Learning Approach to Process Control0
Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
Adaptive patch foraging in deep reinforcement learning agents0
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER0
Convergent and Efficient Deep Q Learning Algorithm0
Autonomous Warehouse Robot using Deep Q-Learning0
A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles0
myGym: Modular Toolkit for Visuomotor Robotic Tasks0
Convergent NMPC-based Reinforcement Learning Using Deep Expected Sarsa and Nonlinear Temporal Difference Learning0
Autonomous Voltage Control for Grid Operation Using Deep Reinforcement Learning0
Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning0
Task-Agnostic Learning to Accomplish New Tasks0
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review0
Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach0
Show:102550
← PrevPage 103 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified