SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 69266950 of 15113 papers

TitleStatusHype
Context Meta-Reinforcement Learning via NeuromodulationCode0
On Joint Learning for Solving Placement and Routing in Chip DesignCode1
Reinforced Workload Distribution Fairness0
Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning0
Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System0
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL0
Attacking Video Recognition Models with Bullet-Screen CommentsCode1
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning0
Adaptive Discretization in Online Reinforcement Learning0
Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control0
Open Problem: Tight Online Confidence Intervals for RKHS Elements0
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision ProcessesCode0
Efficient Meta Subspace OptimizationCode0
URLB: Unsupervised Reinforcement Learning BenchmarkCode1
D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning0
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples0
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives0
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning0
Extracting Expert's Goals by What-if Interpretable Modeling0
Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection0
Stabilising viscous extensional flows using Reinforcement LearningCode0
The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning0
Model based Multi-agent Reinforcement Learning with Tensor Decompositions0
Show:102550
← PrevPage 278 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified