SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 67766800 of 15113 papers

TitleStatusHype
Towards Safe, Explainable, and Regulated Autonomous Driving0
Triples-to-Text Generation with Reinforcement Learning Based Graph-augmented Neural Networks0
HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments0
Explainable Biomedical Recommendations via Reinforcement Learning Reasoning on Knowledge Graphs0
An Improved Reinforcement Learning Model Based on Sentiment Analysis0
Learn Quasi-stationary Distributions of Finite State Markov Chain0
Machine Learning for Mechanical Ventilation Control (Extended Abstract)0
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning0
Reinforcement Learning with Adaptive Curriculum Dynamics Randomization for Fault-Tolerant Robot Control0
Generalized Decision Transformer for Offline Hindsight Information MatchingCode1
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines0
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI TeammatesCode0
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement LearningCode0
Self-Learning Tuning for Post-Silicon Validation0
SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition0
Post-processing Networks: A Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning0
Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems0
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems0
A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization0
MAD for Robust Reinforcement Learning in Machine Translation0
Probing the Robustness of Trained Metrics for Conversational Dialogue Systems0
Deep Reinforcement Learning for Entity Alignment0
Improving Learning from Demonstrations by Learning from Experience0
Compressive Features in Offline Reinforcement Learning for Recommender Systems0
Show:102550
← PrevPage 272 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified