SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 77767800 of 15113 papers

TitleStatusHype
Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models0
Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control0
Hamiltonian Policy Optimization0
Hamiltonian Policy Optimization in Reinforcement Learning0
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension0
Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time0
Handling Cold-Start Collaborative Filtering with Reinforcement Learning0
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control0
Hand-Object Interaction Pretraining from Videos0
Hard instance learning for quantum adiabatic prime factorization0
Hardness in Markov Decision Processes: Theory and Practice0
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization0
Hardware Trojan Insertion Using Reinforcement Learning0
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks0
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems0
Harnessing Causality in Reinforcement Learning With Bagged Decision Times0
Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration0
HARPO: Learning to Subvert Online Behavioral Advertising0
Harvesting energy from turbulent winds with Reinforcement Learning0
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents0
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning0
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism0
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving0
Hebbian Learning of Bayes Optimal Decisions0
Hebbian Synaptic Modifications in Spiking Neurons that Learn0
Show:102550
← PrevPage 312 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified