SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 94019450 of 15113 papers

TitleStatusHype
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
Reinforcement Learning with Subspaces using Free Energy Paradigm0
Reinforcement Learning with Supervision from Noisy Demonstrations0
Reinforcement Learning With Temporal Logic Rewards0
Reinforcement Learning with Temporal-Logic-Based Causal Diagrams0
Reinforcement Learning with Time-dependent Goals for Robotic Musicians0
Reinforcement Learning with Trajectory Feedback0
Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation0
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections0
Reinforcement Learning Your Way: Agent Characterization through Policy Regularization0
Reinforcement Pre-Training0
Reinforcement Speculative Decoding for Fast Ranking0
Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding0
Single-step Options for Adversary Driving0
Reinforcing Semantic-Symmetry for Document Summarization0
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models0
Reinforcing User Retention in a Billion Scale Short Video Recommender System0
Relate to Predict: Towards Task-Independent Knowledge Representations for Reinforcement Learning0
Relational Abstractions for Generalized Reinforcement Learning on Symbolic Problems0
Relational Deep Reinforcement Learning for Routing in Wireless Networks0
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction0
Relation Mention Extraction from Noisy Data with Hierarchical Reinforcement Learning0
Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension0
Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation0
Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning0
Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning0
Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning0
Relative Policy-Transition Optimization for Fast Policy Transfer0
Optimal Actuator Attacks on Autonomous Vehicles Using Reinforcement Learning0
Low-Resource Machine Translation based on Asynchronous Dynamic Programming0
Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling0
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning0
LPaintB: Learning to Paint from Self-Supervision0
LPMARL: Linear Programming based Implicit Task Assigment for Hiearchical Multi-Agent Reinforcement Learning0
LSD-Net: Look, Step and Detect for Joint Navigation and Multi-View Recognition with Deep Reinforcement Learning0
LSTD with Random Projections0
LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement0
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control0
Lyapunov-Based Reinforcement Learning State Estimator0
Lyapunov-based uncertainty-aware safe reinforcement learning0
Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning0
Lyapunov Robust Constrained-MDPs: Soft-Constrained Robustly Stable Policy Optimization under Model Uncertainty0
Lyceum: An efficient and scalable ecosystem for robot learning0
M3: Mamba-assisted Multi-Circuit Optimization via MBRL with Effective Scheduling0
M^3RL: Mind-aware Multi-agent Management Reinforcement Learning0
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning0
MACC: Cross-Layer Multi-Agent Congestion Control with Deep Reinforcement Learning0
Machine Learning aided Crop Yield Optimization0
Machine learning and control engineering: The model-free case0
Machine Learning Applications in the Routing in Computer Networks0
Show:102550
← PrevPage 189 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified