SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 90019050 of 15113 papers

TitleStatusHype
Stochastic Approximation of Gaussian Free Energy for Risk-Sensitive Reinforcement Learning0
Reinforcement learning for instance segmentation with high-level priors0
Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games0
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
On Instrumental Variable Regression for Deep Offline Policy EvaluationCode0
RLIRank: Learning to Rank with Reinforcement Learning for Dynamic Search0
On the use of feature-maps and parameter control for improved quality-diversity meta-evolution0
Meta Reinforcement Learning for Fast Adaptation of Hierarchical Policies0
Reinforcement Learning based Disease Progression Model for Alzheimer’s Disease0
Objective-aware Traffic Simulation via Inverse Reinforcement Learning0
Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics0
Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning0
A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning0
Deep Reinforcement Learning for Optimal Stopping with Application in Financial EngineeringCode0
Learn Fine-grained Adaptive Loss for Multiple Anatomical Landmark Detection in Medical Images0
Robo-Advising: Enhancing Investment with Inverse Optimization and Deep Reinforcement Learning0
Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations0
Reinforcement Learning Assisted Oxygen Therapy for COVID-19 Patients Under Intensive Care0
Online Multimodal Transportation Planning using Deep Reinforcement Learning0
Reinforcement Learning for Adaptive Video Compressive Sensing0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Meta-Reinforcement Learning by Tracking Task Non-stationarityCode0
Sublinear Least-Squares Value Iteration via Locality Sensitive Hashing0
PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies0
Application of deep reinforcement learning for Indian stock trading automation0
Learning and Information in Stochastic Networks and Queues0
Gym-ANM: Open-source software to leverage reinforcement learning for power system management in research and education0
Adaptive ABAC Policy Learning: A Reinforcement Learning Approach0
Behavior-based Neuroevolutionary Training in Reinforcement LearningCode0
Generic Itemset Mining Based on Reinforcement LearningCode0
Mean Field Games Flock! The Reinforcement Learning Way0
RL-GRIT: Reinforcement Learning for Grammar Inference0
RAIDER: Reinforcement-aided Spear Phishing Detector0
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting0
Model-Based Offline Planning with Trajectory PruningCode0
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling0
Regret Minimization Experience Replay in Off-Policy Reinforcement LearningCode0
Ordering-Based Causal Discovery with Reinforcement Learning0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement0
Efficient PAC Reinforcement Learning in Regular Decision Processes0
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models0
Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning0
Intelligence and Unambitiousness Using Algorithmic Information Theory0
Online Algorithms and Policies Using Adaptive and Machine Learning Approaches0
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning0
Reinforcement Learning Based Safe Decision Making for Highway Autonomous Driving0
Principled Exploration via Optimistic Bootstrapping and Backward InductionCode0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Show:102550
← PrevPage 181 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified