SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 79518000 of 15113 papers

TitleStatusHype
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings0
Context Meta-Reinforcement Learning via NeuromodulationCode0
Adjacency constraint for efficient hierarchical reinforcement learning0
Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner0
A Decentralized Reinforcement Learning Framework for Efficient Passage of Emergency Vehicles0
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning0
Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System0
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL0
Adaptive Discretization in Online Reinforcement Learning0
Reinforced Workload Distribution Fairness0
Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning0
Open Problem: Tight Online Confidence Intervals for RKHS Elements0
Efficient Meta Subspace OptimizationCode0
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision ProcessesCode0
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives0
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples0
Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning0
Extracting Expert's Goals by What-if Interpretable Modeling0
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning0
Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control0
D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning0
Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem0
Enhancing Reinforcement Learning with discrete interfaces to learn the Dyck Language0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids0
DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention0
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems0
The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning0
Stabilising viscous extensional flows using Reinforcement LearningCode0
APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs0
Model based Multi-agent Reinforcement Learning with Tensor Decompositions0
Transfer learning with causal counterfactual reasoning in Decision Transformers0
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection0
Reinforcement Learning in Factored Action Spaces using Tensor Decompositions0
Towards Hyperparameter-free Policy Selection for Offline Reinforcement LearningCode0
The Difficulty of Passive Learning in Deep Reinforcement Learning0
Multi-Agent Advisor Q-LearningCode0
Fragment-based Sequential Translation for Molecular Optimization0
Average-Reward Learning and Planning with Options0
Distributional Reinforcement Learning for Multi-Dimensional Reward FunctionsCode0
Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling0
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization0
Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective0
Automating Control of Overestimation Bias for Reinforcement Learning0
Learning Robust Controllers Via Probabilistic Model-Based Policy Search0
Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey0
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control0
Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning0
Can Q-Learning be Improved with Advice?0
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning0
Show:102550
← PrevPage 160 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified