SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 99019950 of 15113 papers

TitleStatusHype
Predictive Synthesis of Quantum Materials by Probabilistic Reinforcement Learning0
Multi-Agent Reinforcement Learning in Cournot Games0
Variance-Reduced Off-Policy Memory-Efficient Policy Search0
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement LearningCode0
Efficient Competitive Self-Play Policy Optimization0
Guided Policy Search Based Control of a High Dimensional Advanced Manufacturing Process0
Extended Radial Basis Function Controller for Reinforcement Learning0
Deep Learning Interference Cancellation in Wireless Networks0
Reinforcement Learning for Optimal Primary Frequency Control: A Lyapunov ApproachCode1
Semantic-preserving Reinforcement Learning Attack Against Graph Neural Networks for Malware DetectionCode1
Physically Embedded Planning Problems: New Challenges for Reinforcement LearningCode0
Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments0
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning0
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their EnvironmentsCode0
A framework for reinforcement learning with autocorrelated actionsCode0
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning0
Importance Weighted Policy Learning and Adaptation0
Deep Reinforcement Learning for Option Replication and Hedging0
AoI Minimization in Status Update Control with Energy Harvesting Sensors0
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous ControlCode1
Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement LearningCode1
Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control0
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning0
Phasic Policy GradientCode1
Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games0
Bayesian Inverse Reinforcement Learning for Collective Animal MovementCode0
Induction and Exploitation of Subgoal Automata for Reinforcement Learning0
Deep Active Inference for Partially Observable MDPsCode1
Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search0
Energy Expenditure Estimation Through Daily Activity Recognition Using a Smart-phone0
Graph neural networks-based Scheduler for Production planning problems using Reinforcement Learning0
Detecting and adapting to crisis pattern with context based Deep Reinforcement Learning0
Deep Learning and Reinforcement Learning for Autonomous Unmanned Aerial Systems: Roadmap for Theory to Deployment0
Driving Tasks Transfer in Deep Reinforcement Learning for Decision-making of Autonomous Vehicles0
Active Learning of Causal Structures with Deep Reinforcement Learning0
Robust Spoken Language Understanding with RL-based Value Error Recovery0
PAC Reinforcement Learning Algorithm for General-Sum Markov Games0
A Hybrid PAC Reinforcement Learning Algorithm0
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization0
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement LearningCode1
DRLE: Decentralized Reinforcement Learning at the Edge for Traffic Light Control in the IoVCode2
Optimality-based Analysis of XCSF Compaction in Discrete Reinforcement LearningCode0
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling0
Sample-Efficient Automated Deep Reinforcement LearningCode1
TAP-Net: Transport-and-Pack using Reinforcement Learning0
Adaptive Reinforcement Learning Model for Simulation of Urban Mobility during Crises0
A reinforcement learning approach to hybrid control design0
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics0
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning0
Solving the single-track train scheduling problem via Deep Reinforcement Learning0
Show:102550
← PrevPage 199 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified