SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 97519800 of 15113 papers

TitleStatusHype
Model-based reinforcement learning for protein backbone design0
Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows0
Model Based Reinforcement Learning for Atari0
Model-Based Reinforcement Learning for Sepsis Treatment0
Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control0
Whole-Chain Recommendations0
Model-based Reinforcement Learning from Signal Temporal Logic Specifications0
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games0
Model-Based Reinforcement Learning via Imagination with Derived Memory0
Model-Based Reinforcement Learning via Meta-Policy Optimization0
Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration0
Model-based Reinforcement Learning with Ensembled Model-value Expansion0
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation0
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network0
Model-Based Reinforcement Learning with SINDy0
Model-Based Reinforcement Learning with Value-Targeted Regression0
Model Based Residual Policy Learning with Applications to Antenna Control0
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds0
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles0
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning0
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning0
Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think!0
Model Embedding Model-Based Reinforcement Learning0
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation0
Model Ensemble-Based Intrinsic Reward for Sparse Reward Reinforcement Learning0
Model Extraction Attacks Against Reinforcement Learning Based Controllers0
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games0
Model-Free Approach to Fair Solar PV Curtailment Using Reinforcement Learning0
Model-Free Control for Distributed Stream Data Processing using Deep Reinforcement Learning0
Model-free Control of Chaos with Continuous Deep Q-learning0
Model-Free Deep Reinforcement Learning in Software-Defined Networks0
Model-Free Episodic Control with State Aggregation0
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-20
Model-Free Imitation Learning with Policy Optimization0
Model-free Learning Control of Nonlinear Stochastic Systems with Stability Guarantee0
Model-Free Learning of Safe yet Effective Controllers0
Model-Free Linear Quadratic Control via Reduction to Expert Prediction0
Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning0
Model-Free μ Synthesis via Adversarial Reinforcement Learning0
Model-free Nearly Optimal Control of Constrained-Input Nonlinear Systems Based on Synchronous Reinforcement Learning0
Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach0
Model-free optimal control of discrete-time systems with additive and multiplicative noises0
Model-Free Optimal Control of Linear Multi-Agent Systems via Decomposition and Hierarchical Approximation0
Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs0
Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents0
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey0
Model-free Reinforcement Learning for Stochastic Stackelberg Security Games0
Model-free Reinforcement Learning for Branching Markov Decision Processes0
Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives0
Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care0
Show:102550
← PrevPage 196 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified