SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1035110375 of 15113 papers

TitleStatusHype
Reinforcement Learning-based N-ary Cross-Sentence Relation Extraction0
Lineage Evolution Reinforcement Learning0
Complementary Meta-Reinforcement Learning for Fault-Adaptive Control0
Graph neural induction of value iteration0
Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics0
Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based GamesCode0
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles0
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey0
ReLeaSER: A Reinforcement Learning Strategy for Optimizing Utilization Of Ephemeral Cloud Resources0
Probabilistic Machine Learning for Healthcare0
Robust Reinforcement Learning-based Autonomous Driving Agent for Simulation and Real World0
What is the Reward for Handwriting? -- Handwriting Generation by Imitation Learning0
Demand Responsive Dynamic Pricing Framework for Prosumer Dominated Microgrids using Multiagent Reinforcement Learning0
A Multi-Agent Deep Reinforcement Learning Approach for a Distributed Energy Marketplace in Smart Grids0
Is Q-Learning Provably Efficient? An Extended Analysis0
A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearnCode0
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management0
Deep Reinforcement Learning for On-line Dialogue State Tracking0
SUMBT+LaRL: Effective Multi-domain End-to-end Neural Task-oriented Dialog System0
Reinforcement Learning Approaches in Social Robotics0
Mobile Cellular-Connected UAVs: Reinforcement Learning for Sky Limits0
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue SystemsCode0
Deep Reinforcement Learning Methods for Structure-Guided Processing Path OptimizationCode0
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning0
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning0
Show:102550
← PrevPage 415 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified