SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 57515800 of 15113 papers

TitleStatusHype
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games0
Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints0
Lane-Merging Using Policy-based Reinforcement Learning and Post-Optimization0
Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms0
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning0
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game0
Language-based General Action Template for Reinforcement Learning Agents0
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model0
Language Expansion In Text-Based Games0
Language Guided Exploration for RL Agents in Text Environments0
Language Inference with Multi-head Automata through Reinforcement Learning0
LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning0
LARES: Latent Reasoning for Sequential Recommendation0
Large Language Model driven Policy Exploration for Recommender Systems0
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies0
Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning0
Large Language Models (LLMs) Assisted Wireless Network Deployment in Urban Settings0
Large Language Models Prompting With Episodic Memory0
Large scale continuous-time mean-variance portfolio allocation via reinforcement learning0
Large-scale Interactive Recommendation with Tree-structured Policy Gradient0
Large-scale Regional Traffic Signal Control Based on Single-Agent Reinforcement Learning0
Large-scale Reinforcement Learning for Diffusion Models0
Large-Scale Retrieval for Reinforcement Learning0
Large-Scale Traffic Signal Control by a Nash Deep Q-network Approach0
Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning0
LASER: Learning a Latent Action Space for Efficient Reinforcement Learning0
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning0
Latent forward model for Real-time Strategy game planning with incomplete information0
LatentPoison -- Adversarial Attacks On The Latent Space0
Latent Properties of Lifelong Learning Systems0
Latent Space Policies for Hierarchical Reinforcement Learning0
Latent Space Reinforcement Learning for Steering Angle Prediction0
Latent Variable Representation for Reinforcement Learning0
Launchpad: Learning to Schedule Using Offline and Online RL Methods0
LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization0
Laxity-Aware Scalable Reinforcement Learning for HVAC Control0
Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act0
LBGP: Learning Based Goal Planning for Autonomous Following in Front0
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning0
Leader Reward for POMO-Based Neural Combinatorial Optimization0
Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes0
Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning0
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection0
Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images0
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes0
LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment0
Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution0
Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games0
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
Learn Fine-grained Adaptive Loss for Multiple Anatomical Landmark Detection in Medical Images0
Show:102550
← PrevPage 116 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified