SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1040110450 of 15113 papers

TitleStatusHype
Decoding Polar Codes with Reinforcement Learning0
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents0
Efficient Transformers: A Survey0
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement LearningCode0
Variance-Reduced Off-Policy Memory-Efficient Policy Search0
Reinforcement Learning for Dynamic Resource Optimization in 5G Radio Access Network Slicing0
Multi-Agent Reinforcement Learning in Cournot Games0
Predictive Synthesis of Quantum Materials by Probabilistic Reinforcement Learning0
Efficient Competitive Self-Play Policy Optimization0
Extended Radial Basis Function Controller for Reinforcement Learning0
Guided Policy Search Based Control of a High Dimensional Advanced Manufacturing Process0
Deep Learning Interference Cancellation in Wireless Networks0
Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments0
Physically Embedded Planning Problems: New Challenges for Reinforcement LearningCode0
TripleTree: A Versatile Interpretable Representation of Black Box Agents and their EnvironmentsCode0
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning0
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning0
Importance Weighted Policy Learning and Adaptation0
A framework for reinforcement learning with autocorrelated actionsCode0
AoI Minimization in Status Update Control with Energy Harvesting Sensors0
Deep Reinforcement Learning for Option Replication and Hedging0
Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control0
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning0
Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games0
Graph neural networks-based Scheduler for Production planning problems using Reinforcement Learning0
Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search0
Energy Expenditure Estimation Through Daily Activity Recognition Using a Smart-phone0
Bayesian Inverse Reinforcement Learning for Collective Animal MovementCode0
Induction and Exploitation of Subgoal Automata for Reinforcement Learning0
Detecting and adapting to crisis pattern with context based Deep Reinforcement Learning0
Deep Learning and Reinforcement Learning for Autonomous Unmanned Aerial Systems: Roadmap for Theory to Deployment0
Active Learning of Causal Structures with Deep Reinforcement Learning0
Driving Tasks Transfer in Deep Reinforcement Learning for Decision-making of Autonomous Vehicles0
Robust Spoken Language Understanding with RL-based Value Error Recovery0
PAC Reinforcement Learning Algorithm for General-Sum Markov Games0
A Hybrid PAC Reinforcement Learning Algorithm0
Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization0
Optimality-based Analysis of XCSF Compaction in Discrete Reinforcement LearningCode0
TAP-Net: Transport-and-Pack using Reinforcement Learning0
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling0
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics0
Adaptive Reinforcement Learning Model for Simulation of Urban Mobility during Crises0
A reinforcement learning approach to hybrid control design0
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning0
Solving the single-track train scheduling problem via Deep Reinforcement Learning0
Reinforcement Learning-based Black-Box Evasion Attacks to Link Prediction in Dynamic Graphs0
Ranking Policy DecisionsCode0
Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL0
Beyond variance reduction: Understanding the true impact of baselines on policy optimization0
Data-driven Outer-Loop Control Using Deep Reinforcement Learning for Trajectory Tracking0
Show:102550
← PrevPage 209 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified