SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1100111025 of 15113 papers

TitleStatusHype
Towards Robust Knowledge Graph Embedding via Multi-task Reinforcement Learning0
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption0
Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning0
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses0
Towards Safe Continuing Task Reinforcement Learning0
Towards Safe Control of Continuum Manipulator Using Shielded Multiagent Reinforcement Learning0
Towards Safe, Explainable, and Regulated Autonomous Driving0
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control0
Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review0
Towards sample-efficient episodic control with DAC-ML0
Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning0
Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning0
Towards Smarter Sensing: 2D Clutter Mitigation in RL-Driven Cognitive MIMO Radar0
Towards Socially and Morally Aware RL agent: Reward Design With LLM0
Towards Synthesizing Complex Programs from Input-Output Examples0
Towards Task-Prioritized Policy Composition0
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree Search, and Deep Reinforcement Learning0
Towards Understanding Deep Policy Gradients: A Case Study on PPO0
The Benefits of Being Categorical Distributional: Uncertainty-aware Regularized Exploration in Reinforcement Learning0
Towards Understanding Distributional Reinforcement Learning: Regularization, Optimization, Acceleration and Sinkhorn Algorithm0
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process0
Towards Unknown-aware Deep Q-Learning0
Towards Unraveling and Improving Generalization in World Models0
Fairness-Oriented User Scheduling for Bursty Downlink Transmission Using Multi-Agent Reinforcement Learning0
Towards using Reinforcement Learning for Scaling and Data Replication in Cloud Systems0
Show:102550
← PrevPage 441 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified