SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 54015450 of 15113 papers

TitleStatusHype
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Inferring and Conveying Intentionality: Beyond Numerical Rewards to Logical Intentions0
Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm0
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning0
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems0
Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents0
InfoRL: Interpretable Reinforcement Learning using Information Maximization0
Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning0
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems0
Information Maximizing Exploration with a Latent Dynamics Model0
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL0
Information Theoretically Aided Reinforcement Learning for Embodied Agents0
Information-Theoretic Confidence Bounds for Reinforcement Learning0
Information-Theoretic Considerations in Batch Reinforcement Learning0
Information Theoretic Model Predictive Q-Learning0
Information-theoretic Task Selection for Meta-Reinforcement Learning0
Informing Autonomous Deception Systems with Cyber Expert Performance Data0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
Inherently Explainable Reinforcement Learning in Natural Language0
Injecting Prior Knowledge for Transfer Learning into Reinforcement Learning Algorithms using Logic Tensor Networks0
Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling0
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents0
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning0
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Instance-based Generalization in Reinforcement Learning0
Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective0
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning0
Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design0
Instigating Cooperation among LLM Agents Using Adaptive Information Modulation0
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
Insulin Regimen ML-based control for T2DM patients0
In Support of Over-Parametrization in Deep Reinforcement Learning: an Empirical Study0
Insurance pricing on price comparison websites via reinforcement learning0
Integral Equations and Machine Learning0
Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning0
Integrated Decision and Control at Multi-Lane Intersections with Mixed Traffic Flow0
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification0
Integrated Drill Boom Hole-Seeking Control via Reinforcement Learning0
Integrated Learning of Dialog Strategies and Semantic Parsing0
Integrated Longitudinal Speed Decision-Making and Energy Efficiency Control for Connected Electrified Vehicles0
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments0
Integrating Domain Knowledge for handling Limited Data in Offline RL0
Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling0
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research0
Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization0
Integrating Large Language Models and Reinforcement Learning for Non-Linear Reasoning0
Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control0
Integrating LEO Satellite and UAV Relaying via Reinforcement Learning for Non-Terrestrial Networks0
Show:102550
← PrevPage 109 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified