SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 49264950 of 15113 papers

TitleStatusHype
A Scalable Reinforcement Learning Approach for Attack Allocation in Swarm to Swarm Engagement Problems0
A Scalable Reinforcement Learning-based System Using On-Chain Data for Cryptocurrency Portfolio Management0
A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis0
A Secure Learning Control Strategy via Dynamic Camouflaging for Unknown Dynamical Systems under Attacks0
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction0
A storage expansion planning framework using reinforcement learning and simulation-based optimization0
ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning0
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play0
A Complete Characterization of Linear Estimators for Offline Policy Evaluation0
A Short Note on Soft-max and Policy Gradients in Bandits Problems0
A Short Note on the Relationship of Information Gain and Eluder Dimension0
A Short Survey On Memory Based Reinforcement Learning0
A Short Survey on Probabilistic Reinforcement Learning0
A short variational proof of equivalence between policy gradients and soft Q learning0
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony0
A Signaling Game Approach to Databases Querying and Interaction0
A Simple Imitation Learning Method via Contrastive Regularization0
A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning0
A Simple Reward-free Approach to Constrained Reinforcement Learning0
A Simple Sparse Denoising Layer for Robust Deep Learning0
A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
Novelty Detection in Reinforcement Learning with World Models0
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences0
Ask1: Development and Reinforcement Learning-Based Control of a Custom Quadruped Robot0
Show:102550
← PrevPage 198 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified