SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 61516200 of 15113 papers

TitleStatusHype
Argumentative Reward Learning: Reasoning About Human Preferences0
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter0
DCE: Offline Reinforcement Learning With Double Conservative Estimates0
Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.2650
Reinforcement Learning with Non-Exponential Discounting0
Safe Reinforcement Learning of Dynamic High-Dimensional Robotic Tasks: Navigation, Manipulation, Interaction0
Reinforcement Learning for Cognitive Delay/Disruption Tolerant Network Node Management in an LEO-based Satellite Constellation0
Paused Agent Replay Refresh0
Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning0
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective0
DEFT: Diverse Ensembles for Fast Transfer in Reinforcement Learning0
Actor-Critic Network for O-RAN Resource Allocation: xApp Design, Deployment, and Analysis0
Improving Document Image Understanding with Reinforcement Finetuning0
Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning0
Deep Reinforcement Learning for Adaptive Mesh Refinement0
Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics ScenarioCode0
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation0
Reward Learning using Structural Motifs in Inverse Reinforcement Learning0
Explainable Reinforcement Learning via Model TransformsCode0
Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations0
Unified Algorithms for RL with Decision-Estimation Coefficients: PAC, Reward-Free, Preference-Based Learning, and Beyond0
SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning0
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning0
Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning0
Reinforcement Learning in Computing and Network Convergence Orchestration0
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement LearningCode0
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation0
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments0
An Investigation of the Bias-Variance Tradeoff in Meta-GradientsCode0
Identifiability and generalizability from multiple experts in Inverse Reinforcement LearningCode0
Computational Discovery of Energy-Efficient Heat Treatment for Microstructure Design using Deep Reinforcement Learning0
Learning from Symmetry: Meta-Reinforcement Learning with Symmetrical Behaviors and Language Instructions0
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games0
ECSAS: Exploring Critical Scenarios from Action Sequence in Autonomous Driving0
Hierarchical Decision Transformer0
Evaluation of Look-ahead Economic Dispatch Using Reinforcement Learning0
Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged AgentCode0
Model-Free Reinforcement Learning for Asset Allocation0
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies0
Performance Optimization for Variable Bitwidth Federated Learning in Wireless Networks0
Towards Task-Prioritized Policy Composition0
Optimizing Crop Management with Reinforcement Learning and Imitation Learning0
Soft Action Priors: Towards Robust Policy Transfer0
Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability0
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline RegretCode0
IRS Assisted NOMA Aided Mobile Edge Computing with Queue Stability: Heterogeneous Multi-Agent Reinforcement Learning0
Deep Q-Network for AI Soccer0
A Spiking Neural Network Learning Markov Chain0
Locally Constrained Representations in Reinforcement Learning0
Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning0
Show:102550
← PrevPage 124 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified