SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 67516800 of 15113 papers

TitleStatusHype
Reinforcement Learning based Path Exploration for Sequential Explainable Recommendation0
On the (In)Tractability of Reinforcement Learning for LTL Objectives0
Reversible Action Design for Combinatorial Optimization with ReinforcementLearning0
Reinforcement Learning for Volt-Var Control: A Novel Two-stage Progressive Training Strategy0
Symbol-Based Over-the-Air Digital Predistortion Using Reinforcement Learning0
Semantic-Aware Collaborative Deep Reinforcement Learning Over Wireless Cellular Networks0
Inducing Functions through Reinforcement Learning without Task Specification0
Fixed Points in Cyber Space: Rethinking Optimal Evasion Attacks in the Age of AI-NIDS0
Independent Learning in Stochastic Games0
Generating GPU Compiler Heuristics using Reinforcement Learning0
Efficient Bayesian Inverse Reinforcement Learning via Conditional Kernel Density Estimation0
Real-World Dexterous Object Manipulation based Deep Reinforcement LearningCode0
Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures0
Reinforcement Learning for Few-Shot Text Generation AdaptationCode0
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor RectificationCode1
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning0
Off-Policy Correction For Multi-Agent Reinforcement LearningCode0
An application of reinforcement learning to residential energy storage under real-time pricing0
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning0
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven ExplorationCode1
A Hybrid Neuro-Symbolic approach for Text-Based Games using Inductive Logic Programming0
Reinforcement Learning with General LTL Objectives is Intractable0
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning0
Vulcan: Solving the Steiner Tree Problem with Graph Neural Networks and Deep Reinforcement Learning0
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation0
Towards Safe, Explainable, and Regulated Autonomous Driving0
Triples-to-Text Generation with Reinforcement Learning Based Graph-augmented Neural Networks0
HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments0
Explainable Biomedical Recommendations via Reinforcement Learning Reasoning on Knowledge Graphs0
An Improved Reinforcement Learning Model Based on Sentiment Analysis0
Learn Quasi-stationary Distributions of Finite State Markov Chain0
Machine Learning for Mechanical Ventilation Control (Extended Abstract)0
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning0
Reinforcement Learning with Adaptive Curriculum Dynamics Randomization for Fault-Tolerant Robot Control0
Generalized Decision Transformer for Offline Hindsight Information MatchingCode1
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines0
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI TeammatesCode0
Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement LearningCode0
Self-Learning Tuning for Post-Silicon Validation0
SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition0
Post-processing Networks: A Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning0
Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems0
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems0
A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization0
MAD for Robust Reinforcement Learning in Machine Translation0
Probing the Robustness of Trained Metrics for Conversational Dialogue Systems0
Deep Reinforcement Learning for Entity Alignment0
Improving Learning from Demonstrations by Learning from Experience0
Compressive Features in Offline Reinforcement Learning for Recommender Systems0
Show:102550
← PrevPage 136 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified