SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 47014750 of 15113 papers

TitleStatusHype
An Invitation to Deep Reinforcement Learning0
An L^2 Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation0
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models0
Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning0
An ocular biomechanics environment for reinforcement learning0
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning0
An Offline Deep Reinforcement Learning for Maintenance Decision-Making0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning0
Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning0
A non-cooperative meta-modeling game for automated third-party calibrating, validating, and falsifying constitutive laws with parallelized adversarial attacks0
An online evolving framework for advancing reinforcement-learning based automated vehicle control0
An Online Model-Following Projection Mechanism Using Reinforcement Learning0
An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method0
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networks0
An Optics Controlling Environment and Reinforcement Learning Benchmarks0
An Optimal Control View of Adversarial Machine Learning0
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning0
An Optimization Framework for Task Sequencing in Curriculum Learning0
An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems0
An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes0
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment0
Reinforcement Learning with Wasserstein Distance Regularisation, with Applications to Multipolicy Learning0
A novel agent with formal goal-reaching guarantees: an experimental study with a mobile robot0
A novel approach for multi-agent cooperative pursuit to capture grouped evaders0
A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances0
A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments0
A Novel Deep Reinforcement Learning Based Automated Stock Trading System Using Cascaded LSTM Networks0
A Novel Deep Reinforcement Learning-based Approach for Enhancing Spectral Efficiency of IRS-assisted Wireless Systems0
A Novel Entropy-Maximizing TD3-based Reinforcement Learning for Automatic PID Tuning0
A Novel Experts Advice Aggregation Framework Using Deep Reinforcement Learning for Portfolio Management0
A Novel Framework for Neural Architecture Search in the Hill Climbing Domain0
A Novel Multi-Agent Deep RL Approach for Traffic Signal Control0
A Novel Multi-Objective Reinforcement Learning Algorithm for Pursuit-Evasion Game0
A Novel Neuromorphic Processors Realization of Spiking Deep Reinforcement Learning for Portfolio Management0
A Novel Reinforcement Learning Model for Post-Incident Malware Investigations0
A novel repetition normalized adversarial reward for headline generation0
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces0
An overall view of key problems in algorithmic trading and recent progress0
An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models0
An Overview of Natural Language State Representation for Reinforcement Learning0
An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems0
ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models0
Answer-driven Deep Question Generation based on Reinforcement Learning0
Answer Set Programming for Non-Stationary Markov Decision Processes0
Answer-Supervised Question Reformulation for Enhancing Conversational Machine Comprehension0
Emotional Contagion-Aware Deep Reinforcement Learning for Antagonistic Crowd Simulation0
Anti-Concentrated Confidence Bonuses for Scalable Exploration0
Antifragile Perimeter Control: Anticipating and Gaining from Disruptions with Reinforcement Learning0
Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System0
Show:102550
← PrevPage 95 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified