SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 91519200 of 15113 papers

TitleStatusHype
Reinforcement Learning for Motor Control: A Comprehensive Review0
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems0
Reinforcement Learning for Multi-Truck Vehicle Routing Problems0
Reinforcement Learning for Nested Polar Code Construction0
Reinforcement Learning for Node Selection in Branch-and-Bound0
Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system0
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism0
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP0
Reinforcement Learning for on-line Sequence Transformation0
Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study0
Reinforcement Learning for Optimal Control of a District Cooling Energy Plant0
Reinforcement Learning for Optimal Load Distribution Sequencing in Resource-Sharing System0
Reinforcement learning for optimization of variational quantum circuit architectures0
On-line reinforcement learning for optimization of real-life energy trading strategy0
Reinforcement Learning for Optimized Beam Training in Multi-Hop Terahertz Communications0
Reinforcement Learning for Optimizing RAG for Domain Chatbots0
Reinforcement learning for options on target volatility funds0
Reinforcement Learning for Orientation Estimation Using Inertial Sensors with Performance Guarantee0
Reinforcement Learning for Personalized Dialogue Management0
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective0
Reinforcement learning for port-Hamiltonian systems0
Reinforcement Learning for Predicting Traffic Accidents0
Reinforcement Learning for Predict+Optimize0
Reinforcement Learning for Process Control with Application in Semiconductor Manufacturing0
Reinforcement Learning for Protocol Synthesis in Resource-Constrained Wireless Sensor and IoT Networks0
Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number0
Reinforcement Learning for Quantitative Trading0
Reinforcement Learning for Rate Maximization in IRS-aided OWC Networks0
Reinforcement Learning for Resilient Power Grids0
Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems0
Reinforcement Learning for Ridesharing: An Extended Survey0
Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction0
Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots0
Reinforcement Learning for Robust Header Compression under Model Uncertainty0
Reinforcement Learning for Robust Missile Autopilot Design0
Reinforcement Learning for Safe Autonomous Two Device Navigation of Cerebral Vessels in Mechanical Thrombectomy0
Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic0
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions0
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions0
Reinforcement Learning for Scalable Logic Optimization with Graph Neural Networks0
Reinforcement Learning for Semantic Segmentation in Indoor Scenes0
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning0
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology0
Reinforcement Learning for Sociohydrology0
Reinforcement Learning for Solving the Pricing Problem in Column Generation: Applications to Vehicle Routing0
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment0
Reinforcement Learning for Standards Design0
Reinforcement Learning for Stock Transactions0
Reinforcement Learning for Strategic Recommendations0
Reinforcement learning for suppression of collective activity in oscillatory ensembles0
Show:102550
← PrevPage 184 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified