SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 95269550 of 15113 papers

TitleStatusHype
Unlocking Pixels for Reinforcement Learning via Implicit Attention0
Neurogenetic Programming Framework for Explainable Reinforcement LearningCode0
Generate and Revise: Reinforcement Learning in Neural Poetry0
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning0
Introduction to Machine Learning for the Sciences0
Learning Optimal Strategies for Temporal Tasks in Stochastic Games0
An Analysis of Frame-skipping in Reinforcement Learning0
Sparsely ensembled convolutional neural network classifiers via reinforcement learningCode0
Multi-Agent Deep Reinforcement Learning for Request Dispatching in Distributed-Controller Software-Defined Networking0
A bandit approach to curriculum generation for automatic speech recognition0
Improving Model and Search for Computer Go0
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management0
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency0
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning0
Deceptive Reinforcement Learning for Privacy-Preserving Planning0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Deep reinforcement learning for smart calibration of radio telescopesCode0
Revisiting Prioritized Experience Replay: A Value PerspectiveCode0
Provably Efficient Algorithms for Multi-Objective Competitive RL0
Persistent Rule-based Interactive Reinforcement Learning0
Hybrid Adversarial Imitation Learning0
A review of motion planning algorithms for intelligent robotics0
Deep reinforcement learning-based image classification achieves perfect testing set accuracy for MRI brain tumors with a training set of only 30 images0
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned0
Show:102550
← PrevPage 382 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified