SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1000110025 of 15113 papers

TitleStatusHype
Multi-condition multi-objective optimization using deep reinforcement learning0
Multi-criteria Hardware Trojan Detection: A Reinforcement Learning Approach0
Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning0
Multi-Fidelity Policy Gradient Algorithms0
Multi-fidelity reinforcement learning framework for shape optimization0
Multifidelity Reinforcement Learning with Control Variates0
Multi-Flow Transmission in Wireless Interference Networks: A Convergent Graph Learning Approach0
Multi-focus Attention Network for Efficient Deep Reinforcement Learning0
Multi-Issue Bargaining With Deep Reinforcement Learning0
Multi-lane Cruising Using Hierarchical Planning and Reinforcement Learning0
Multi-level Explanation of Deep Reinforcement Learning-based Scheduling0
Multi-Level Policy and Reward Reinforcement Learning for Image Captioning0
Multi-market Energy Optimization with Renewables via Reinforcement Learning0
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach0
Multimodal Deep Reinforcement Learning for Portfolio Optimization0
Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning0
Multi-modal Feedback for Affordance-driven Interactive Reinforcement Learning0
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog0
Multimodal Machine Translation with Reinforcement Learning0
Multi-modal reward for visual relationships-based image captioning0
Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning0
Multi-Modal Transformer and Reinforcement Learning-based Beam Management0
Multi-Objective Autonomous Braking System using Naturalistic Dataset0
Multi-Objective Decision Transformers for Offline Reinforcement Learning0
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability0
Show:102550
← PrevPage 401 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified