SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 62516300 of 15113 papers

TitleStatusHype
Limited Query Graph Connectivity Test0
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs0
Lineage Evolution Reinforcement Learning0
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions0
Linear Complementarity for Regularized Policy Evaluation and Improvement0
Linear convergence of a policy gradient method for some finite horizon continuous time control problems0
Linear Feature Encoding for Reinforcement Learning0
Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization0
Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods0
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon0
Linear Reinforcement Learning with Ball Structure Action Space0
Linear Representation Meta-Reinforcement Learning for Instant Adaptation0
Linear Stochastic Approximation: Constant Step-Size and Iterate Averaging0
LISPR: An Options Framework for Policy Reuse with Reinforcement Learning0
Listener-Rewarded Thinking in VLMs for Image Preferences0
LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin0
LLM Alignment as Retriever Optimization: An Information Retrieval Perspective0
LLM Augmented Hierarchical Agents0
LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition0
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions0
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models0
LLM-hRIC: LLM-empowered Hierarchical RAN Intelligent Control for O-RAN0
LLMs for Engineering: Teaching Models to Design High Powered Rockets0
LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard0
LLMStinger: Jailbreaking LLMs using RL fine-tuned LLMs0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Reward Guidance for Reinforcement Learning Tasks Based on Large Language Models: The LMGT Framework0
Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning0
Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning0
Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning0
Local Differential Privacy for Regret Minimization in Reinforcement Learning0
Local Environment Poisoning Attacks on Federated Reinforcement Learning0
LocalEscaper: A Weakly-supervised Framework with Regional Reconstruction for Scalable Neural TSP Solvers0
Local Explanations for Reinforcement Learning0
Local Feature Swapping for Generalization in Reinforcement Learning0
Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning0
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning0
Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations0
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition0
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs0
Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving0
Locally Constrained Representations in Reinforcement Learning0
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes0
Locally Private Distributed Reinforcement Learning0
Local Navigation and Docking of an Autonomous Robot Mower using Reinforcement Learning and Computer Vision0
Local Nonstationarity for Efficient Bayesian Optimization0
Local Pairwise Distance Matching for Backpropagation-Free Reinforcement Learning0
Local Policy Optimization for Trajectory-Centric Reinforcement Learning0
Local Search for Policy Iteration in Continuous Control0
Show:102550
← PrevPage 126 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified