SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 82018250 of 15113 papers

TitleStatusHype
Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning0
Objective-aware Traffic Simulation via Inverse Reinforcement Learning0
Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics0
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point DetectionCode1
A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning0
Deep Reinforcement Learning for Optimal Stopping with Application in Financial EngineeringCode0
Enforcing Policy Feasibility Constraints through Differentiable Projection for Energy OptimizationCode1
Learn Fine-grained Adaptive Loss for Multiple Anatomical Landmark Detection in Medical Images0
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement LearningCode1
Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations0
Robo-Advising: Enhancing Investment with Inverse Optimization and Deep Reinforcement Learning0
Reinforcement Learning Assisted Oxygen Therapy for COVID-19 Patients Under Intensive Care0
Application of deep reinforcement learning for Indian stock trading automation0
Learning and Information in Stochastic Networks and Queues0
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team CompositionCode1
Gym-ANM: Open-source software to leverage reinforcement learning for power system management in research and education0
Adaptive ABAC Policy Learning: A Reinforcement Learning Approach0
Online Multimodal Transportation Planning using Deep Reinforcement Learning0
PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Sublinear Least-Squares Value Iteration via Locality Sensitive Hashing0
Reinforcement Learning for Adaptive Video Compressive Sensing0
Meta-Reinforcement Learning by Tracking Task Non-stationarityCode0
RL-GRIT: Reinforcement Learning for Grammar Inference0
Uncertainty Weighted Actor-Critic for Offline Reinforcement LearningCode1
RAIDER: Reinforcement-aided Spear Phishing Detector0
Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting0
Mean Field Games Flock! The Reinforcement Learning Way0
Behavior-based Neuroevolutionary Training in Reinforcement LearningCode0
Generic Itemset Mining Based on Reinforcement LearningCode0
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling0
Model-Based Offline Planning with Trajectory PruningCode0
Regret Minimization Experience Replay in Off-Policy Reinforcement LearningCode0
Ordering-Based Causal Discovery with Reinforcement Learning0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models0
Efficient PAC Reinforcement Learning in Regular Decision Processes0
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement0
Reinforcement Learning Based Safe Decision Making for Highway Autonomous Driving0
Online Algorithms and Policies Using Adaptive and Machine Learning Approaches0
Principled Exploration via Optimistic Bootstrapping and Backward InductionCode0
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning0
Intelligence and Unambitiousness Using Algorithmic Information Theory0
Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning0
A Survey on Reinforcement Learning-Aided Caching in Mobile Edge Networks0
Deep Multi-agent Reinforcement Learning for Highway On-Ramp Merging in Mixed TrafficCode1
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning0
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with PybulletCode1
Adversarial Reinforcement Learning in Dynamic Channel Access and Power Control0
Show:102550
← PrevPage 165 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified