SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 96019650 of 15113 papers

TitleStatusHype
A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers0
Solving optimal stopping problems with Deep Q-Learning0
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning0
Safe Learning and Optimization Techniques: Towards a Survey of the State of the Art0
Feature Selection Using Reinforcement Learning0
BF++: a language for general-purpose program synthesisCode0
Theory of Mind for Deep Reinforcement Learning in HanabiCode0
Prior Preference Learning from Experts:Designing a Reward with Active Inference0
Model-based Policy Search for Partially Measurable Systems0
Adversarial Machine Learning for Flooding Attacks on 5G Radio Access Network Slicing0
Flocking and Collision Avoidance for a Dynamic Squad of Fixed-Wing UAVs Using Deep Reinforcement Learning0
Deep Reinforcement Learning Optimizes Graphene Nanopores for Efficient Desalination0
Dynamic Bicycle Dispatching of Dockless Public Bicycle-sharing Systems using Multi-objective Reinforcement Learning0
Meta-Reinforcement Learning for Adaptive Motor Control in Changing Robot Dynamics and Environments0
Spatial Assembly: Generative Architecture With Reinforcement Learning, Self Play and Tree Search0
Regularized Policies are Reward Robust0
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask0
Model-Based Reinforcement Learning for Approximate Optimal Control with Temporal Logic Specifications0
Cooperative and Competitive Biases for Multi-Agent Reinforcement Learning0
Natural Language Specification of Reinforcement Learning Policies through Differentiable Decision Trees0
HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned MessagingCode0
Deep Reinforcement Learning with Embedded LQR Controllers0
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition0
A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning0
Local Navigation and Docking of an Autonomous Robot Mower using Reinforcement Learning and Computer Vision0
Deep Reinforcement Learning for Haptic Shared Control in Unknown Tasks0
Affordance-based Reinforcement Learning for Urban Driving0
Empirical Evaluation of Supervision Signals for Style Transfer Models0
Stochastic Learning Approach to Binary Optimization for Optimal Design of Experiments0
Robusta: Robust AutoML for Feature Selection via Reinforcement Learning0
Reinforcement learning based recommender systems: A survey0
Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time SystemsCode0
Learning and Fast Adaptation for Grid Emergency Control via Deep Meta Reinforcement Learning0
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement LearningCode0
Automated Synthesis of Steady-State Continuous Processes using Reinforcement Learning0
Linear Representation Meta-Reinforcement Learning for Instant Adaptation0
Queue-Learning: A Reinforcement Learning Approach for Providing Quality of Service0
Solving Common-Payoff Games with Approximate Policy IterationCode0
Independent Policy Gradient Methods for Competitive Reinforcement Learning0
First-Order Problem Solving through Neural MCTS based Reinforcement Learning0
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning0
Action Priors for Large Action Spaces in RoboticsCode0
Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment0
Deep Reinforcement Learning with Function Properties in Mean Reversion StrategiesCode0
Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs0
Safe Coupled Deep Q-Learning for Recommendation Systems0
qRRT: Quality-Biased Incremental RRT for Optimal Motion Planning in Non-Holonomic Systems0
CoachNet: An Adversarial Sampling Approach for Reinforcement Learning0
Coding for Distributed Multi-Agent Reinforcement Learning0
An Adaptive Multi-Agent Physical Layer Security Framework for Cognitive Cyber-Physical Systems0
Show:102550
← PrevPage 193 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified