SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1225112300 of 15113 papers

TitleStatusHype
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs0
Searching for High-Value Molecules Using Reinforcement Learning and Transformers0
Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation0
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning0
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits0
SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees0
Secure Computation Offloading in Blockchain based IoT Networks with Deep Reinforcement Learning0
Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms0
Security-Aware Virtual Network Embedding Algorithm based on Reinforcement Learning0
SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation0
Seeing by haptic glance: reinforcement learning-based 3D object Recognition0
Seeing-Eye Quadruped Navigation with Force Responsive Locomotion Control0
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation0
Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning0
SeekNet: Improved Human Instance Segmentation and Tracking via Reinforcement Learning Based Optimized Robot Relocation0
SEERL: Sample Efficient Ensemble Reinforcement Learning0
Segmenting Action-Value Functions Over Time-Scales in SARSA via TD(Δ)0
Segregation Dynamics with Reinforcement Learning and Agent Based Modeling0
SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition0
Select before Act: Spatially Decoupled Action Repetition for Continuous Control0
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning0
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning0
Selecting the State-Representation in Reinforcement Learning0
Selective Credit Assignment0
Selective Experience Sharing in Reinforcement Learning Enhances Interference Management0
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning0
Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation0
Selective Reviews of Bandit Problems in AI via a Statistical View0
Selective Token Generation for Few-shot Language Modeling0
Selective Transfer with Reinforced Transfer Network for Partial Domain Adaptation0
Selective Uncertainty Propagation in Offline RL0
Selector-Enhancer: Learning Dynamic Selection of Local and Non-local Attention Operation for Speech Enhancement0
Self-Adapting Goals Allow Transfer of Predictive Models to New Tasks0
Self-Awareness Safety of Deep Reinforcement Learning in Road Traffic Junction Driving0
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning0
Self-Consistent Models and Values0
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings0
Self-Critical Alternate Learning based Semantic Broadcast Communication0
Self-critical Sequence Training for Automatic Speech Recognition0
Self-Driving Car Racing: Application of Deep Reinforcement Learning0
Self-driving scale car trained by Deep reinforcement learning0
Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning0
Self-evolving Autoencoder Embedded Q-Network0
Self-Evolving Curriculum for LLM Reasoning0
Self-Imitation Advantage Learning0
Self-Imitation Learning by Planning0
Self-Imitation Learning from Demonstrations0
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning0
Self-Inspection Method of Unmanned Aerial Vehicles in Power Plants Using Deep Q-Network Reinforcement Learning0
Self-Learned Formula Synthesis in Set Theory0
Show:102550
← PrevPage 246 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified