SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 79517975 of 15113 papers

TitleStatusHype
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings0
Context Meta-Reinforcement Learning via NeuromodulationCode0
Adjacency constraint for efficient hierarchical reinforcement learning0
Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner0
A Decentralized Reinforcement Learning Framework for Efficient Passage of Emergency Vehicles0
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning0
Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System0
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL0
Adaptive Discretization in Online Reinforcement Learning0
Reinforced Workload Distribution Fairness0
Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning0
Open Problem: Tight Online Confidence Intervals for RKHS Elements0
Efficient Meta Subspace OptimizationCode0
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision ProcessesCode0
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives0
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples0
Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning0
Extracting Expert's Goals by What-if Interpretable Modeling0
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning0
Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control0
D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning0
Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem0
Enhancing Reinforcement Learning with discrete interfaces to learn the Dyck Language0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids0
Show:102550
← PrevPage 319 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified