SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1010110150 of 15113 papers

TitleStatusHype
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial0
Motion Prediction on Self-driving Cars: A Review0
Sample-efficient Reinforcement Learning in Robotic Table Tennis0
The Value Equivalence Principle for Model-Based Reinforcement Learning0
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments0
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping0
LBGP: Learning Based Goal Planning for Autonomous Following in Front0
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities0
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks0
Generative Inverse Deep Reinforcement Learning for Online Recommendation0
XCSF for Automatic Test Case PrioritizationCode0
Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods0
Online Observer-Based Inverse Reinforcement Learning0
Control with adaptive Q-learningCode0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time0
Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning0
Incorporating Rivalry in Reinforcement Learning for a Competitive Game0
Causal Campbell-Goodhart's law and Reinforcement LearningCode0
Cooperative Heterogeneous Deep Reinforcement Learning0
A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting0
Exact Asymptotics for Linear Quadratic Adaptive ControlCode0
Fast Reinforcement Learning with Incremental Gaussian Mixture Models0
Depth Self-Optimized Learning Toward Data ScienceCode0
Information-theoretic Task Selection for Meta-Reinforcement Learning0
Instance based Generalization in Reinforcement LearningCode0
Interpreting Graph Drawing with Multi-Agent Reinforcement Learning0
Reinforcement Learning of Structured Control for Linear Systems with Unknown State Matrix0
NEARL: Non-Explicit Action Reinforcement Learning for Robotic Control0
Multi-Agent Reinforcement Learning for Visibility-based Persistent MonitoringCode0
Sample-efficient reinforcement learning using deep Gaussian processes0
Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models0
Reinforcement Learning with Efficient Active Feature Acquisition0
Production-based Cognitive Models as a Test Suite for Reinforcement Learning Algorithms0
Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation0
Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network0
Guided Dialogue Policy Learning without Adversarial Learning in the LoopCode0
Few-Shot Multi-Hop Relation Reasoning over Knowledge Bases0
Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach0
Personalized Multimorbidity Management for Patients with Type 2 Diabetes Using Reinforcement Learning of Electronic Health RecordsCode0
Abstract Value Iteration for Hierarchical Reinforcement Learning0
Reinforcement Learning of Causal Variables Using Mediation Analysis0
Machine versus Human Attention in Deep Reinforcement Learning Tasks0
How do Offline Measures for Exploration in Reinforcement Learning behave?0
Learning Personalized Discretionary Lane-Change Initiation for Fully Autonomous Driving Based on Reinforcement Learning0
Learning to Represent Action Values as a Hypergraph on the Action Vertices0
Learning to Unknot0
Designing Interpretable Approximations to Deep Reinforcement Learning0
DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding Proteins0
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment0
Show:102550
← PrevPage 203 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified