SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 48264850 of 15113 papers

TitleStatusHype
Soft Action Priors: Towards Robust Policy Transfer0
Locally Constrained Representations in Reinforcement Learning0
Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability0
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning0
Graph Value Iteration0
Deep Q-Network for AI Soccer0
IRS Assisted NOMA Aided Mobile Edge Computing with Queue Stability: Heterogeneous Multi-Agent Reinforcement Learning0
A Spiking Neural Network Learning Markov Chain0
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline RegretCode0
A Deep Reinforcement Learning-Based Charging Scheduling Approach with Augmented Lagrangian for Electric Vehicle0
Transferring Knowledge for Reinforcement Learning in Contact-Rich Manipulation0
Meta-Reinforcement Learning for Adaptive Control of Second Order Systems0
MAN: Multi-Action Networks LearningCode1
Safe reinforcement learning control for continuous-time nonlinear systems without a backup controller0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
Measuring Interventional Robustness in Reinforcement LearningCode0
MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation0
BOME! Bilevel Optimization Made Easy: A Simple First-Order ApproachCode1
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning0
Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control0
A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection0
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems0
Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic EnvironmentsCode1
"Guess what I'm doing": Extending legibility to sequential decision tasks0
Latent Plans for Task-Agnostic Offline Reinforcement LearningCode1
Show:102550
← PrevPage 194 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified