SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 51515200 of 15113 papers

TitleStatusHype
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation0
Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments0
Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors0
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning0
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning0
Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning0
Human-Like Decision Making: Document-level Aspect Sentiment Classification via Hierarchical Reinforcement Learning0
Human-like Energy Management Based on Deep Reinforcement Learning and Historical Driving Experiences0
Human-Object Interaction from Human-Level Instructions0
Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning0
Human-Robot Skill Transfer with Enhanced Compliance via Dynamic Movement Primitives0
Humans are not Boltzmann Distributions: Challenges and Opportunities for Modelling Human Feedback and Interaction in Reinforcement Learning0
Human-Timescale Adaptation in an Open-Ended Task Space0
Machine versus Human Attention in Deep Reinforcement Learning Tasks0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
HVAC-DPT: A Decision Pretrained Transformer for HVAC Control0
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation0
Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving0
Hybrid Adversarial Imitation Learning0
Hybrid Beamforming for mmWave MU-MISO Systems Exploiting Multi-agent Deep Reinforcement Learning0
Hybrid computer approach to train a machine learning system0
Hybrid Cross-domain Robust Reinforcement Learning0
Hybrid Deep Reinforcement Learning and Planning for Safe and Comfortable Automated Driving0
Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems0
Hybrid Indoor Localization via Reinforcement Learning-based Information Fusion0
Hybrid Information-driven Multi-agent Reinforcement Learning0
Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization0
Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks0
Hybrid Learning with New Value Function for the Maximum Common Subgraph Problem0
Hybrid Policies Using Inverse Rewards for Reinforcement Learning0
Hybrid Q-Learning Applied to Ubiquitous recommender system0
Hybrid Reinforcement Learning and Model Predictive Control for Adaptive Control of Hydrogen-Diesel Dual-Fuel Combustion0
Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Hybrid Reinforcement Learning for Optimizing Pump Sustainability in Real-World Water Distribution Networks0
Hybrid Reinforcement Learning for STAR-RISs: A Coupled Phase-Shift Model Based Beamformer0
Hybrid Reinforcement Learning Framework for Mixed-Variable Problems0
Hybrid Reinforcement Learning from Offline Observation Alone0
Hybrid Supervised and Reinforcement Learning for the Design and Optimization of Nanophotonic Structures0
Hybrid Systems Neural Control with Region-of-Attraction Planner0
Mixed Traffic Control and Coordination from Pixels0
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation0
Hybrid UAV-enabled Secure Offloading via Deep Reinforcement Learning0
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning0
Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning0
Hyperbolically-Discounted Reinforcement Learning on Reward-Punishment Framework0
Hyperbolic Deep Reinforcement Learning0
Hyperbolic Embeddings for Learning Options in Hierarchical Reinforcement Learning0
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning0
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem0
Show:102550
← PrevPage 104 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified