SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 80518100 of 15113 papers

TitleStatusHype
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement LearningCode1
RewardsOfSum: Exploring Reinforcement Learning Rewards for Summarisation0
Don't Get Yourself into Trouble! Risk-aware Decision-Making for Autonomous Vehicles0
Curriculum Design for Teaching via Demonstrations: Theory and ApplicationsCode0
A Deep Value-network Based Approach for Multi-Driver Order Dispatching0
Learning Markov State Abstractions for Deep Reinforcement LearningCode1
Left Ventricle Contouring in Cardiac Images Based on Deep Reinforcement LearningCode0
Dynamic Sparse Training for Deep Reinforcement LearningCode1
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning0
Towards Practical Credit Assignment for Deep Reinforcement Learning0
Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty0
Verifiable and Compositional Reinforcement Learning SystemsCode0
Correcting Momentum in Temporal Difference LearningCode0
Entropy Regularized Reinforcement Learning Using Large Deviation TheoryCode0
A Computational Model of Representation Learning in the Brain Cortex, Integrating Unsupervised and Reinforcement Learning0
Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint0
Causal Influence Detection for Improving Efficiency in Reinforcement LearningCode1
Learning to Guide a Saturation-Based Theorem Prover0
Identifiability in inverse reinforcement learning0
Learning Combinatorial Node Labeling Algorithms0
Task-driven Semantic Coding via Reinforcement LearningCode1
XIRL: Cross-embodiment Inverse Reinforcement LearningCode0
Multi-agent Battery Storage Management using MPC-based Reinforcement Learning0
Towards robust and domain agnostic reinforcement learning competitions0
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces0
Average-Reward Reinforcement Learning with Trust Region Methods0
Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems0
Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning0
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
DisTop: Discovering a Topological representation to learn diverse and rewarding skills0
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning0
Efficient Continuous Control with Double Actors and Regularized CriticsCode1
Distributional Reinforcement Learning with Unconstrained Monotonic Neural NetworksCode1
3D UAV Trajectory and Data Collection Optimisation via Deep Reinforcement Learning0
Control-Oriented Model-Based Reinforcement Learning with Implicit DifferentiationCode1
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learningCode1
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement LearningCode1
Learning Routines for Effective Off-Policy Reinforcement Learning0
Heuristic-Guided Reinforcement Learning0
Same State, Different Task: Continual Reinforcement Learning without InterferenceCode1
Reinforcement Learning for Assignment Problem with Time Constraints0
Resource Allocation in Disaggregated Data Centre Systems with Reinforcement Learning0
Online reinforcement learning with sparse rewards through an active inference capsuleCode1
Model-agnostic and Scalable Counterfactual Explanations via Reinforcement LearningCode2
Differentiable Architecture Search for Reinforcement LearningCode1
Robustifying Reinforcement Learning Policies with L_1 Adaptive Control0
Detecting and Adapting to Novelty in Games0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RLCode0
Celebrating Diversity in Shared Multi-Agent Reinforcement LearningCode1
Show:102550
← PrevPage 162 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified