SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1042610450 of 15113 papers

TitleStatusHype
End-to-End Policy Gradient Method for POMDPs and Explainable Agents0
End-to-End Race Driving with Deep Reinforcement Learning0
End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation0
End-to-End Vision-Based Adaptive Cruise Control (ACC) Using Deep Reinforcement Learning0
Energy Aware Deep Reinforcement Learning Scheduling for Sensors Correlated in Time and Space0
Energy-Aware Multi-Server Mobile Edge Computing: A Deep Reinforcement Learning Approach0
Energy-aware Scheduling of Jobs in Heterogeneous Cluster Systems Using Deep Reinforcement Learning0
Energy Efficiency in Reinforcement Learning for Wireless Sensor Networks0
Energy Efficiency Optimization for Subterranean LoRaWAN Using A Reinforcement Learning Approach: A Direct-to-Satellite Scenario0
Energy-Efficient Design for a NOMA assisted STAR-RIS Network with Deep Reinforcement Learning0
Energy Efficient Edge Computing: When Lyapunov Meets Distributed Reinforcement Learning0
Energy Expenditure Estimation Through Daily Activity Recognition Using a Smart-phone0
Energy Management Based on Multi-Agent Deep Reinforcement Learning for A Multi-Energy Industrial Park0
Energy Optimization of Wind Turbines via a Neural Control Policy Based on Reinforcement Learning Markov Chain Monte Carlo Algorithm0
Energy Saving in 6G O-RAN Using DQN-based xApp0
Energy Storage Arbitrage in Real-Time Markets via Reinforcement Learning0
Energy Storage Management via Deep Q-Networks0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment0
Enforcing constraints for time series prediction in supervised, unsupervised and reinforcement learning0
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments0
Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning0
Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control0
Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning0
Enhanced Audit Techniques Empowered by the Reinforcement Learning Pertaining to IFRS 16 Lease0
Show:102550
← PrevPage 418 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified