SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 71517200 of 15113 papers

TitleStatusHype
Improving Generalization of Deep Reinforcement Learning-based TSP Solvers0
Scalable Multi-Agent Reinforcement Learning for Residential Load Scheduling under Data Governance0
Hierarchical Potential-based Reward Shaping from Task SpecificationsCode0
Adaptive control of a mechatronic system using constrained residual reinforcement learning0
Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing ProblemCode1
Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning0
Decentralized Cooperative Lane Changing at Freeway Weaving Areas Using Multi-Agent Deep Reinforcement Learning0
Deep reinforcement learning for guidewire navigation in coronary artery phantom0
CARL: A Benchmark for Contextual and Adaptive Reinforcement LearningCode1
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing0
A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing0
A study of first-passage time minimization via Q-learning in heated gridworlds0
Dropout Q-Functions for Doubly Efficient Reinforcement LearningCode1
OTTR: Off-Road Trajectory Tracking using Reinforcement Learning0
NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback0
Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
Multi-Agent Path Planning Using Deep Reinforcement Learning0
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-EnsembleCode1
Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding0
Large Batch Experience ReplayCode1
Behaviour-conditioned policies for cooperative reinforcement learning tasks0
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
Learning to Assist Agents by Observing Them0
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation0
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley ValuesCode1
Automating Privilege Escalation with Deep Reinforcement Learning0
DRL-Clusters: Buffer Management with Clustering based Deep Reinforcement Learning0
Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming EventsCode0
Decentralized Safe Reinforcement Learning for Voltage Control0
A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances0
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations0
Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement LearningCode0
Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning0
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning0
Offline Reinforcement Learning with Reverse Model-based ImaginationCode1
Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems0
Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning0
Multi-lane Cruising Using Hierarchical Planning and Reinforcement Learning0
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement LearningCode0
Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning0
Guiding Evolutionary Strategies by Differentiable Robot SimulatorsCode0
DNN-Opt: An RL Inspired Optimization for Analog Circuit Sizing using Deep Neural Networks0
Divergence-Regularized Multi-Agent Actor-Critic0
Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines0
Neural Network Verification in Control0
MOLUCINATE: A Generative Model for Molecules in 3D SpaceCode1
Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces0
Show:102550
← PrevPage 144 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified