SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1095111000 of 15113 papers

TitleStatusHype
Adaptive Reinforcement Learning through Evolving Self-Modifying Neural Networks0
Towards Automated Safety Coverage and Testing for Autonomous Vehicles with Reinforcement Learning0
Reinforcement learning with human advice: a survey0
Q-NAV: NAV Setting Method based on Reinforcement Learning in Underwater Wireless Networks0
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension0
Novel Policy Seeking with Constrained OptimizationCode0
Two-stage Deep Reinforcement Learning for Inverter-based Volt-VAR Control in Active Distribution Networks0
Reinforcement Learning for Variable Selection in a Branch and Bound Algorithm0
Learning and Reasoning for Robot Dialog and Navigation Tasks0
Deep Reinforcement Learning for High Level Character Control0
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise0
A reinforcement learning based decision support system in textile manufacturing process0
Batch-Augmented Multi-Agent Reinforcement Learning for Efficient Traffic Signal Optimization0
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments0
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text0
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning0
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning0
Privileged Information Dropout in Reinforcement Learning0
Reinforcement Learning for Caching with Space-Time Popularity Dynamics0
Optimal Charging Method for Effective Li-ion Battery Life Extension Based on Reinforcement Learning0
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation0
Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency MapsCode0
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning0
Learning Transferable Concepts in Deep Reinforcement Learning0
A Simple Imitation Learning Method via Contrastive Regularization0
A Distributional View on Multi-Objective Policy Optimization0
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement LearningCode0
Solve Traveling Salesman Problem by Monte Carlo Tree Search and Deep Neural Network0
Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning0
Probabilistic Guarantees for Safe Deep Reinforcement Learning0
Data-driven Dynamic Multi-objective Optimal Control: An Aspiration-satisfying Reinforcement Learning Approach0
DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics0
From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning0
Explainable Reinforcement Learning: A Survey0
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning0
Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms0
A New Deep Neural Architecture Search Pipeline for Face Recognition0
Deep Reinforcement Learning for Organ Localization in CT0
A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support0
Reinforcement Learning Based on Real-Time Iteration NMPC0
TOMA: Topological Map Abstraction for Reinforcement Learning0
Maximizing Information Gain in Partially Observable Environments via Prediction Reward0
Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem0
Reinforcement Learning based Design of Linear Fixed Structure Controllers0
A Reinforcement Learning based approach for Multi-target Detection in Massive MIMO radar0
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python0
Synthesizing Safe Policies under Probabilistic Constraints with Reinforcement Learning and Bayesian Model Checking0
Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients0
Show:102550
← PrevPage 220 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified